AI News

The Art of Interrogation: Consistency Amplifies Factuality in Spatial Reasoning (arxiv.org)

by rss-bot · 1 week ago · 0 comments

Embodied-BenchClaw: An Autonomous Multi-Agent System for Embodied Spatial Intelligence Benchmark Construction (arxiv.org)

by rss-bot · 1 week ago · 0 comments

Skill-Augmented AI Agents for Medical Research Analysis: An Exploratory Multi-Model Human Evaluation in an NSCLC Transcriptomic Biomarker Task (arxiv.org)

by rss-bot · 1 week ago · 0 comments

Toward Trustworthy AI: Multi-Target Adversarial Attacks and Robust Defenses for Continuous Data Summarization (arxiv.org)

by rss-bot · 1 week ago · 0 comments

SVoT: State-aware Visualization-of-Thought for Spatial Reasoning via Reinforcement Learning (arxiv.org)

by rss-bot · 1 week ago · 0 comments

When Do Data-Driven Systems Exhibit the Capability to Infer? (arxiv.org)

by rss-bot · 1 week ago · 0 comments

Mind the Perspective: Let's Reason Recursively for Theory of Mind (arxiv.org)

by rss-bot · 1 week ago · 0 comments

Organize then Retrieve: Hierarchical Memory Navigation for Efficient Agents (arxiv.org)

by rss-bot · 1 week ago · 0 comments

Lung-R1: A Knowledge Graph-Guided LLM for Pulmonary Diagnostic Reasoning (arxiv.org)

by rss-bot · 1 week ago · 0 comments

TreeSeeker: Tree-Structured Trial, Error, and Return in Deep Search (arxiv.org)

by rss-bot · 1 week ago · 0 comments

TouchThinker: Scaling Tactile Commonsense Reasoning to the Open World with Large-scale Data and Action-aware Representation (arxiv.org)

by rss-bot · 1 week ago · 0 comments

HERO: Hindsight-Enhanced Reflection from Environment Observations for Agentic Self-Distillation (arxiv.org)

by rss-bot · 1 week ago · 0 comments

SkillJuror: Measuring How Agent Skill Organization Changes Runtime Behavior (arxiv.org)

by rss-bot · 1 week ago · 0 comments

MoCA-Agent: A Market-of-Claims Code Agent for Financial and Numerical Reasoning (arxiv.org)

by rss-bot · 1 week ago · 0 comments

Search Discipline for Long-Horizon Research Agents (arxiv.org)

by rss-bot · 1 week ago · 0 comments

Forecasting Future Behavior as a Learning Task (arxiv.org)

by rss-bot · 1 week ago · 0 comments

INFRAMIND: Infrastructure-Aware Multi-Agent Orchestration (arxiv.org)

by rss-bot · 1 week ago · 0 comments

Automated Mediator for Human Negotiation: Pre-Mediation via a Structured LLM Pipeline (arxiv.org)

by rss-bot · 1 week ago · 0 comments

Knowing When to Ask: Self-Gated Clarification for Hierarchical Language Agents (arxiv.org)

by rss-bot · 1 week ago · 0 comments

Can AI Agents Synthesize Scientific Conclusions? (arxiv.org)

by rss-bot · 1 week ago · 0 comments

From Explicit Elements to Implicit Intent: A Predefined Library for Auditable Behavioral Inference (arxiv.org)

by rss-bot · 1 week ago · 0 comments

Architecture-Aware Reinforcement Learning Makes Sliding-Window Attention Competitive in Math Reasoning (arxiv.org)

by rss-bot · 1 week ago · 0 comments

StatefulDiscovery: Evidence-Calibrated Claim Formation in Open-Ended Scientific Discovery (arxiv.org)

by rss-bot · 1 week ago · 0 comments

Position: Hippocampal Explicit Memory Is the Cornerstone for AGI (arxiv.org)

by rss-bot · 1 week ago · 0 comments

CHORUS: Decentralized Multi-Embodiment Collaboration with One VLA Policy (arxiv.org)

by rss-bot · 1 week ago · 0 comments

Atlas H&E-TME: Scalable AI-Based Tissue Profiling at Expert Pathologist-Level Accuracy (arxiv.org)

by rss-bot · 1 week ago · 0 comments

On Subquadratic Architectures: From Applications to Principles (arxiv.org)

by rss-bot · 1 week ago · 0 comments

Soft-Prompt Tuning for Fair and Efficient LLM Benchmark Evaluation (arxiv.org)

by rss-bot · 1 week ago · 0 comments

Augmenting Molecular Language Models with Local $n$-gram Memory (arxiv.org)

by rss-bot · 1 week ago · 0 comments

Privacy-Preserving Federated Autoencoder for ECG Anomaly Detection on Edge Devices (arxiv.org)

by rss-bot · 1 week ago · 0 comments

From Consumption to Reflection: Designing Human-AI Relations for Stable Reasoning (arxiv.org)

by rss-bot · 1 week ago · 0 comments

Bootstrapped Monitoring: Leveraging Transparent Reasoning to Oversee Stronger AI Agents (arxiv.org)

by rss-bot · 1 week ago · 0 comments

Structure-Preserving Neural Surrogates with Tractable Uncertainty Quantification (arxiv.org)

by rss-bot · 1 week ago · 0 comments

AutoMine Solution for AV2 2026 Scenario Mining Challenge (arxiv.org)

by rss-bot · 1 week ago · 0 comments

TimeRouter: Efficient and Adaptive Routing of Time-Series Foundation Models (arxiv.org)

by rss-bot · 1 week ago · 0 comments

DeMix: Debugging Training Data with Mixed Data Error Types by Investigating Influence Vectors (arxiv.org)

by rss-bot · 1 week ago · 0 comments

Space-sampled Value Decay: Forgetting Mechanisms for Non-stationary Deep Reinforcement Learning (arxiv.org)

by rss-bot · 1 week ago · 0 comments

Beyond the Golden Teacher: Enhancing Graph Learning through LLM-GNN Co-teaching (arxiv.org)

by rss-bot · 1 week ago · 0 comments

GraphInfer-Bench: Benchmarking LLM's Inference Capability on Graphs (arxiv.org)

by rss-bot · 1 week ago · 0 comments

Counterexample Guided Learning in the Large using Reasoning Agents (arxiv.org)

by rss-bot · 1 week ago · 0 comments

Probabilistic Contrastive Pretraining for Multi-task ADME Property Prediction (arxiv.org)

by rss-bot · 1 week ago · 0 comments

OmniLoc: A Geometry-Aware Foundation Model for Anchor-Free UE Localization Across Diverse Indoor Environments (arxiv.org)

by rss-bot · 1 week ago · 0 comments

Accurate and Resource-Efficient Federated Continual Learning (arxiv.org)

by rss-bot · 1 week ago · 0 comments

APEX: A Network-Native Time-Series Foundation Model for Forecasting and Anomaly Detection for Wireless Edge Operations (arxiv.org)

by rss-bot · 1 week ago · 0 comments

Mahalanobis-Guided Latent OOD Detection for Hybrid ES-DRL Control in Time-Varying Systems (arxiv.org)

by rss-bot · 1 week ago · 0 comments

LSTM-Based Detection of Structural Breaks in Property Insurance Loss Reserving: A Climate-Informed Approach (arxiv.org)

by rss-bot · 1 week ago · 0 comments

Mirror Descent Beyond Euclidean Stability: An Exponential Separation in Initialization Sensitivity (arxiv.org)

by rss-bot · 1 week ago · 0 comments

Risk Under Pressure: Compute-Aware Evaluation of Adversarial Robustness in Language Models (arxiv.org)

by rss-bot · 1 week ago · 0 comments

Recursive Binding on a Budget: Subspace Carving in Order-p Tensor Memories (arxiv.org)

by rss-bot · 1 week ago · 0 comments

GLACIER: A Multimodal Student-Teacher Foundation Model for Molecular Property Prediction (arxiv.org)

by rss-bot · 1 week ago · 0 comments

← prev p.110/2203 next →