AI News

⚡ 8 minutes ago
1
1
The Art of Interrogation: Consistency Amplifies Factuality in Spatial Reasoning (arxiv.org)
2
1
Embodied-BenchClaw: An Autonomous Multi-Agent System for Embodied Spatial Intelligence Benchmark Construction (arxiv.org)
3
1
Skill-Augmented AI Agents for Medical Research Analysis: An Exploratory Multi-Model Human Evaluation in an NSCLC Transcriptomic Biomarker Task (arxiv.org)
4
1
Toward Trustworthy AI: Multi-Target Adversarial Attacks and Robust Defenses for Continuous Data Summarization (arxiv.org)
5
1
SVoT: State-aware Visualization-of-Thought for Spatial Reasoning via Reinforcement Learning (arxiv.org)
6
1
When Do Data-Driven Systems Exhibit the Capability to Infer? (arxiv.org)
7
1
Mind the Perspective: Let's Reason Recursively for Theory of Mind (arxiv.org)
8
1
Organize then Retrieve: Hierarchical Memory Navigation for Efficient Agents (arxiv.org)
9
1
Lung-R1: A Knowledge Graph-Guided LLM for Pulmonary Diagnostic Reasoning (arxiv.org)
10
1
TreeSeeker: Tree-Structured Trial, Error, and Return in Deep Search (arxiv.org)
11
1
TouchThinker: Scaling Tactile Commonsense Reasoning to the Open World with Large-scale Data and Action-aware Representation (arxiv.org)
12
1
HERO: Hindsight-Enhanced Reflection from Environment Observations for Agentic Self-Distillation (arxiv.org)
13
1
SkillJuror: Measuring How Agent Skill Organization Changes Runtime Behavior (arxiv.org)
14
1
MoCA-Agent: A Market-of-Claims Code Agent for Financial and Numerical Reasoning (arxiv.org)
15
1
Search Discipline for Long-Horizon Research Agents (arxiv.org)
16
1
Forecasting Future Behavior as a Learning Task (arxiv.org)
17
1
INFRAMIND: Infrastructure-Aware Multi-Agent Orchestration (arxiv.org)
18
1
Automated Mediator for Human Negotiation: Pre-Mediation via a Structured LLM Pipeline (arxiv.org)
19
1
Knowing When to Ask: Self-Gated Clarification for Hierarchical Language Agents (arxiv.org)
20
1
Can AI Agents Synthesize Scientific Conclusions? (arxiv.org)
21
1
From Explicit Elements to Implicit Intent: A Predefined Library for Auditable Behavioral Inference (arxiv.org)
22
1
Architecture-Aware Reinforcement Learning Makes Sliding-Window Attention Competitive in Math Reasoning (arxiv.org)
23
1
StatefulDiscovery: Evidence-Calibrated Claim Formation in Open-Ended Scientific Discovery (arxiv.org)
24
1
Position: Hippocampal Explicit Memory Is the Cornerstone for AGI (arxiv.org)
25
1
CHORUS: Decentralized Multi-Embodiment Collaboration with One VLA Policy (arxiv.org)
26
1
Atlas H&E-TME: Scalable AI-Based Tissue Profiling at Expert Pathologist-Level Accuracy (arxiv.org)
27
1
On Subquadratic Architectures: From Applications to Principles (arxiv.org)
28
1
Soft-Prompt Tuning for Fair and Efficient LLM Benchmark Evaluation (arxiv.org)
29
1
Augmenting Molecular Language Models with Local $n$-gram Memory (arxiv.org)
30
1
Privacy-Preserving Federated Autoencoder for ECG Anomaly Detection on Edge Devices (arxiv.org)
31
1
From Consumption to Reflection: Designing Human-AI Relations for Stable Reasoning (arxiv.org)
32
1
Bootstrapped Monitoring: Leveraging Transparent Reasoning to Oversee Stronger AI Agents (arxiv.org)
33
1
Structure-Preserving Neural Surrogates with Tractable Uncertainty Quantification (arxiv.org)
34
1
AutoMine Solution for AV2 2026 Scenario Mining Challenge (arxiv.org)
35
1
TimeRouter: Efficient and Adaptive Routing of Time-Series Foundation Models (arxiv.org)
36
1
DeMix: Debugging Training Data with Mixed Data Error Types by Investigating Influence Vectors (arxiv.org)
37
1
Space-sampled Value Decay: Forgetting Mechanisms for Non-stationary Deep Reinforcement Learning (arxiv.org)
38
1
Beyond the Golden Teacher: Enhancing Graph Learning through LLM-GNN Co-teaching (arxiv.org)
39
1
GraphInfer-Bench: Benchmarking LLM's Inference Capability on Graphs (arxiv.org)
40
1
Counterexample Guided Learning in the Large using Reasoning Agents (arxiv.org)
41
1
Probabilistic Contrastive Pretraining for Multi-task ADME Property Prediction (arxiv.org)
42
1
OmniLoc: A Geometry-Aware Foundation Model for Anchor-Free UE Localization Across Diverse Indoor Environments (arxiv.org)
43
1
Accurate and Resource-Efficient Federated Continual Learning (arxiv.org)
44
1
APEX: A Network-Native Time-Series Foundation Model for Forecasting and Anomaly Detection for Wireless Edge Operations (arxiv.org)
45
1
Mahalanobis-Guided Latent OOD Detection for Hybrid ES-DRL Control in Time-Varying Systems (arxiv.org)
46
1
LSTM-Based Detection of Structural Breaks in Property Insurance Loss Reserving: A Climate-Informed Approach (arxiv.org)
47
1
Mirror Descent Beyond Euclidean Stability: An Exponential Separation in Initialization Sensitivity (arxiv.org)
48
1
Risk Under Pressure: Compute-Aware Evaluation of Adversarial Robustness in Language Models (arxiv.org)
49
1
Recursive Binding on a Budget: Subspace Carving in Order-p Tensor Memories (arxiv.org)
50
1
GLACIER: A Multimodal Student-Teacher Foundation Model for Molecular Property Prediction (arxiv.org)