AI News

⚡ 2 minutes ago
1
1
RASER: Recoverability-Aware Selective Escalation Router for Multi-Hop Question Answering (arxiv.org)
2
1
Iteris: Agentic Research Loops for Computational Mathematics (arxiv.org)
3
1
Temporal Motif Signatures for Temporal Graph Neural Networks (arxiv.org)
4
1
Fairness in two-player zero-sum games with bandit feedback (arxiv.org)
5
1
When Data Is Scarce: Scaling Sparse Language Models with Repeated Training (arxiv.org)
6
1
Lagrangian Perturbation Diffusion Steering: Latent Reinforcement Learning for Generative Policies (arxiv.org)
7
1
STARFISH: faST Accuracy Recovery in pruned networks From Internal State Healing (arxiv.org)
8
1
MCP-Persona: Benchmarking LLM Agents on Real-World Personal Applications via Environment Simulation (arxiv.org)
9
1
AGENTCL: Toward Rigorous Evaluation of Continual Learning in Language Agents (arxiv.org)
10
1
Beyond One-shot: AI Agents for Learning in Field Experiments (arxiv.org)
11
1
HLL: Can Agents Cross Humanity's Last Line of Verification? (arxiv.org)
12
1
Bridging the Sim-to-Real Gap in Semiconductor Visual Program Synthesis via Input Binarization (arxiv.org)
13
1
From Reward-Free Representations to Preferences: Rethinking Offline Preference-Based Reinforcement Learning (arxiv.org)
14
1
A Per-Component Diagnostic Protocol for Neural HJB-PIDE Solvers under Control-Dependent L\'evy Jumps (arxiv.org)
15
1
HASTE: Hardware-Aware Dynamic Sparse Training for Large Output Spaces (arxiv.org)
16
1
LeAP: Learnable Adaptive Permutation for Feature Selection in Heterogeneous and Sparse Recommender Systems (arxiv.org)
17
1
Soft-NBCE: Entropy-Weighted Chunk Fusion for Long-Context (arxiv.org)
18
1
A Fiber Criterion for Representation Identifiability in Supervised Learning (arxiv.org)
19
1
AgentPLM: Agentic Protein Language Models with Reasoning-Augmented Decoding for Protein Sequence Design (arxiv.org)
20
1
A Mathematical Conflict Framework for Contextual Data Modulation (arxiv.org)
21
1
Spatial Representation Learning Beyond Pixels: Unifying Raster Data and Vector Semantics for Human-Centric Geospatial Foundation Models (arxiv.org)
22
1
Harness-1: Reinforcement Learning for Search Agents with State-Externalizing Harnesses (arxiv.org)
23
1
COMAP: Co-Evolving World Models and Agent Policies for LLM Agents (arxiv.org)
24
1
PaCX-MAE: Physiology-Augmented Chest X-Ray Masked Autoencoder (arxiv.org)
25
1
MViewRouter: Internalizing Geometric Equivariance via Multi-view Alternating Attention for Combinatorial Routing (arxiv.org)
26
1
Decision-Focused On-Policy Learning for Contextual Linear Optimization with Partial Feedback (arxiv.org)
27
1
ThinkSwitch: Context Distillation with LoRA and Weight Interpolation for Specific-Purpose Reasoning Tasks (arxiv.org)
28
1
Non-Vacuous Certification of Transport MCMC via Oscillation-Controlled Normalizing Flows (arxiv.org)
29
1
Improving Hospital Process Management through Process Mining: A Case Study on COVID-19 Clinical Pathways (arxiv.org)
30
1
MOC: Multi-Order Communication in LLM-based Multi-Agent Systems (arxiv.org)
31
1
SIRI: Self-Internalizing Reinforcement Learning with Intrinsic Skills for LLM Agent Training (arxiv.org)
32
1
Coordination Graphs for Constrained Multi-Agent Reinforcement Learning (arxiv.org)
33
1
Repair Before Veto: Repair-Augmented Constraint Learning for Contextual Decisions (arxiv.org)
34
1
Interaction-Limited Safe Continuous-Time RL for Dynamical Medical Treatment (arxiv.org)
35
1
Plausibility Is Not Prediction: Contrastive Evidence for LLM-Based Cellular Perturbation Reasoning (arxiv.org)
36
1
OPD+: Rethinking the Advantage Design for On-Policy Distillation (arxiv.org)
37
1
MedGym:A Unified Continuous-Time Benchmark for Dynamic Medical Treatment Reinforcement Learning (arxiv.org)
38
1
Strong Stochastic Flow Maps (arxiv.org)
39
1
POIROT: Interrogating Agents for Failure Detection in Multi-Agent Systems (arxiv.org)
40
1
CEON: Circular Economy Ontology Network (arxiv.org)
41
1
From Capability Models to Automated Planning: An AAS-Native Approach for Automatic PDDL Generation (arxiv.org)
42
1
LLM-Evolved Pattern Generators for Optimal Classical Planning (arxiv.org)
43
1
S3TS: Stochastic Scenario-Structured Tree Search for Advanced Planning Under Uncertainty (arxiv.org)
44
1
Beyond Task-Agnostic: Task-Aware Grouping for Communication-Efficient Multi-Task MoE Inference (arxiv.org)
45
1
Trust Functions: Near-Lossless Weak-to-Strong Generalization by Learning When to Trust the Weak Teacher (arxiv.org)
46
1
Data Enrichment for Symbolic Regression Using Diffusion Models (arxiv.org)
47
1
Profiling Privacy Preservation Against Gradient Inversion Attacks in Tabular Federated Learning (arxiv.org)
48
1
CryoProt: A Protein Pretraining Framework with Cross-Box Interactions on Cryo-EM Density Maps (arxiv.org)
49
1
BADGER: Bridging Agentic and Deterministic Evaluation for Generative Enterprise Reasoning (arxiv.org)
50
1
eMoT: evolving Memory-of-Thought via Symbolic Anchoring and Memory Corrosion (arxiv.org)