AI News

⚡ just now
1
1
SMH-Bench: Benchmarking LLM Agents for Environment-Grounded Reasoning and Action in Smart Homes (arxiv.org)
2
1
Bayesian Spectral Emotion Transition Discovery from Multi-Annotator Disagreement (arxiv.org)
3
1
Escaping the Mode Lottery: Multi-Response Training Improves Language Model Generalization (arxiv.org)
4
1
Physically-Constrained Mamba-SDE for Remaining Useful Life Prediction under Irregular Observations (arxiv.org)
5
1
Absorbing Complexity: An Interaction-Native Knowledge Harness for Financial LLM Agents (arxiv.org)
6
1
EVA-Net: Subject-Independent EEG Motor Decoding with Video-Derived Motor Priors (arxiv.org)
7
1
WorldCoder-Bench: Benchmarking Physically Grounded 3D World Synthesis (arxiv.org)
8
1
Dive into Waves: Morlet Spectral Transformer for Cross-Subject Emotion Decoding from EEG (arxiv.org)
9
1
Task diversity produces systematic transfer but inhibits continual reinforcement learning (arxiv.org)
10
1
Enhancing LLM Metacognition via Cognitive Pairwise Training (arxiv.org)
11
1
CUPID in the Model Zoo: Online Matchmaking for Selecting Your Dream LLM (arxiv.org)
12
1
Online Packet Scheduling with Deadlines and Learning (arxiv.org)
13
1
Does Compression Preserve Uncertainty? A Unified Benchmark for Quantized and Sparse LLMs via Conformal Prediction (arxiv.org)
14
1
Evaluation of Baseline Methods for IDD-based SSD External Memory Search (arxiv.org)
15
1
CAPF: Guiding Search-Agent Rollouts with Credit-Attenuated Privileged Feedback (arxiv.org)
16
1
An NLP-Driven Framework for Curriculum-Labor Market Alignment: Schema-Constrained LLM Extraction, ESCO-Anchored Semantic Matching, and Multi-Dimensional Gap Quantification (arxiv.org)
17
1
Token Predictors Are Not Planners: Building Physically Grounded Causal Reasoners (arxiv.org)
18
1
OctoT2I: A Self-Evolving Agentic Text-to-Image Router (arxiv.org)
19
1
Limits of Resolution Equivariance in Fourier Neural Operators (arxiv.org)
20
1
Mapping the evolution of small reservoirs in Brazil from 1984 to 2025 using deep learning (arxiv.org)
21
1
What Makes a Strong Model? A Unified Spectral Analysis of Knowledge Transfer over High-dimensional Linear Regression (arxiv.org)
22
1
The Paradox of Outcome Optimization: A Causal Information-Theoretic Bound on Reasoning Shortcuts in LLMs (arxiv.org)
23
1
Demystifying the Optimal Fair Classifier in Multi-Class Classification (arxiv.org)
24
1
Don't Ask the LLM to Track Freshness: A Deterministic Recipe for Memory Conflict Resolution (arxiv.org)
25
1
GovAI-Pipe: A Layered AI Governance Pipeline for Citizen-Facing AI in Turkey's e-Government Gateway (arxiv.org)
26
1
Self-Healing Agentic Orchestrators for Reliable Tool-Augmented Large Language Model Systems (arxiv.org)
27
1
GuidaPA: Privacy-Preserving Chatbot for Public Administration via Federated Learning (arxiv.org)
28
1
Early Diagnosis of Wasted Computation in Multi-Agent LLM Systems via Failure-Aware Observability (arxiv.org)
29
1
MESA: Improving MoE Safety Alignment via Decentralized Expertise (arxiv.org)
30
1
How Neural Losses Shape VAE Latents (arxiv.org)
31
1
CARE-RL: Capability-Aware Reinforcement Learning for Mitigating Cross-Domain Conflicts (arxiv.org)
32
1
LASER: Loss-Aware Singular-value Decomposition and Rank Allocation for Efficient Low-Precision Vision-Language Models (arxiv.org)
33
1
Spatiotemporal Multi-Task Graph Transformer for Trip-Level Transit Prediction (arxiv.org)
34
1
FlowTime: Towards Continuous Generative Watch Time Prediction via Flow-based Personalized Priors (arxiv.org)
35
1
Recognize Your Orchestrator: An Entropy Dynamics Perspective for LLM Multi-Agent Systems (arxiv.org)
36
1
Science Earth: Towards A Planet-Scale Operating System for AI-Native Scientific Discovery (arxiv.org)
37
1
SkillSmith: Co-Evolving Skills and Tools for Self-Improving Agent Systems (arxiv.org)
38
1
ANDES: Agent Native Data Evolving Synthesis Tool for Autonomous Instruction Alignment (arxiv.org)
39
1
SIRIUS-SQL: Anchoring Multi-Candidate Text-to-SQL in Execution Feedback (arxiv.org)
40
1
Characterization of Multi-Model Agentic AI Systems on General Tasks via Trace-Driven Simulation (arxiv.org)
41
1
On the Difficulty of Learning a Meta-network for Training Data Selection (arxiv.org)
42
1
On the Recoverability of Causal Relations from Bulk Gene Expression Data (arxiv.org)
43
1
Same Payload, Different Channel: Measuring Trust Asymmetry in Tool-Using Language Models (arxiv.org)
44
1
Interpretable Policy Distillation for Power Grid Topology Control (arxiv.org)
45
1
Richer Representations for Neural Algorithmic Reasoning via Auxiliary Reconstruction (arxiv.org)
46
1
Semi-Supervised Noise Adaptation: Transferring Knowledge from Noise Domain (arxiv.org)
47
1
Normalized Relevance Measure as a Unifying Framework to Explain Neural Network Latent Structures (arxiv.org)
48
1
The Assistant as a Privileged Persona: A canonical reference in cross-persona self-recognition (arxiv.org)
49
1
WildCat: Near-Linear Attention in Theory and Practice (arxiv.org)
50
1
Collaborative and Efficient Fine-tuning: Leveraging Task Similarity (arxiv.org)