AI News

⚡ 6 minutes ago
1
1
Value-Aware Stochastic KV Cache Eviction for Reasoning Models (arxiv.org)
2
1
Enhancing Protein-Protein Interaction Prediction with Hierarchical Motif-based Multimodal Protein Embedding (arxiv.org)
3
1
PRISM: Synergizing Vision Foundation Models via Self-organized Expert Specialization (arxiv.org)
4
1
Finding Needles in the Haystack: Transductive Active Labeling in Ecology (arxiv.org)
5
1
PHASE: Physiology-Aware Hyperspectral Reconstruction via Object-to-Human Domain Adaptation (arxiv.org)
6
1
Generalizing Graph Foundation Models via Hyperbolic Retrieval-Augmented Generation (arxiv.org)
7
1
Fast-dLLM++: Fr\'{e}chet Profile Decoding for Faster Diffusion LLM Inference (arxiv.org)
8
1
Causal Evidence of Stack Representations in Modeling Counter Languages Using Transformers (arxiv.org)
9
1
Reproducibility is the New Copyleft: Defining AGI-oriented Reproducible Builds (arxiv.org)
10
1
NeuroArmor: Safe-Variant-Guided Representation Consistency for Selective Re-Anchoring in Jailbreak Defense (arxiv.org)
11
1
Resource-Constrained Adaptive Inference for Sequential Pricing (arxiv.org)
12
1
Agent Skills for Large Language Models: Architecture, Acquisition, Security, and the Path Forward (arxiv.org)
13
1
Rethinking the Role of Tensor Decompositions in Post-Training LLM Compression (arxiv.org)
14
1
A Training-Free Mixture-of-Agents Framework for Multi-Document Summarization using LLMs and Knowledge Graphs (arxiv.org)
15
1
Target Updates May Stabilize Linear Q-Learning: Periodic and Soft Dynamics (arxiv.org)
16
1
PrimeSVT: An Automated Memory-aware Pruning Framework with Prioritized Compression Policy for Spiking Vision Transformers (arxiv.org)
17
1
InftyThink+: Effective and Efficient Infinite-Horizon Reasoning via Reinforcement Learning (arxiv.org)
18
1
Denoise First, Orthogonalize Later: Understanding Momentum in Muon via Spectral Filtering (arxiv.org)
19
1
PerchRL: Vision-Based Agile Perching on Inclined Platforms under Rapid and Irregular Motion (arxiv.org)
20
1
The Epi-LLM Framework: probing LLM behavioral priors through epidemiological agent-based models (arxiv.org)
21
1
Too Much of a Good Thing: When sim2real Efforts Impede Policy Learning (And What to Do About It) (arxiv.org)
22
1
Towards Non-Monotonic Entailment in Propositional Defeasible Standpoint Logic (arxiv.org)
23
1
Diagnosing Knowledge Gaps in LLM Tool Use: An Agentic Benchmark for Novel API Acquisition (arxiv.org)
24
1
WRIT: Write-Read Intensive Trajectory Synthesis for Multi-Turn User-Facing Agents (arxiv.org)
25
1
LLM-Assisted Reranking to Operationalize Nuanced Objectives in Recommender Systems (arxiv.org)
26
1
RobotValues: Evaluating Household Robots When Human Values Conflict (arxiv.org)
27
1
From Answers to States: Verifiable Process-Level Evaluation of Chemical Reasoning in Large Language Models (arxiv.org)
28
1
Synthetic Hallucinations, Real Gains: Hard Negatives from Frontier Models for FIM Hallucination Mitigation (arxiv.org)
29
1
Easy-to-Use Shielding for Reinforcement Learning (arxiv.org)
30
1
Learning to See via Epiretinal Implant Stimulation in silico with Model-Based Deep Reinforcement Learning (arxiv.org)
31
1
FlashbackCL: Mitigating Temporal Forgetting in Federated Learning (arxiv.org)
32
1
TiWeaver: Unified Temporal Dynamics Modeling via Contextual Patching (arxiv.org)
33
1
FGRPO: Federated GRPO with Adaptive Aggregation on Non-IID Data (arxiv.org)
34
1
A Geometric Lens on Physics-Aligned Data Compression (arxiv.org)
35
1
Constitutional On-Policy Safe Distillation (arxiv.org)
36
1
The DeepSpeak-Agentic Dataset (arxiv.org)
37
1
Dynamic Objective Selection with Safeguards and LLM Oversight for Financial Decision-Making (arxiv.org)
38
1
Glass Box at Orbit: A Constitutional AI Verification Framework for Trustworthy Autonomous CubeSat Intelligence (arxiv.org)
39
1
Dynamic Short Convolutions Improve Transformers (arxiv.org)
40
1
BigFinanceBench: A Workflow-Grounded Benchmark for Financial-Research Agents (arxiv.org)
41
1
EvoDrive: Pareto Evolution for Safety-Critical Autonomous Driving via Self-Improving LLM Agents (arxiv.org)
42
1
Using Reward Uncertainty to Induce Diverse Behaviour in Reinforcement Learning (arxiv.org)
43
1
Code-on-Graph: Iterative Programmatic Reasoning via Large Language Models on Knowledge Graphs (arxiv.org)
44
1
DECA: Decentralizing Block-Wise Adam for Efficient LLM Full-Parameter Fine-Tuning on Non-IID Data (arxiv.org)
45
1
Fast Organic Crystal Structure Prediction with Unit Cell Flow Matching (arxiv.org)
46
1
GFFMERGE: Efficient Merging of Graph Neural Force Fields and Beyond (arxiv.org)
47
1
When to Re-Plan: Subgoal Persistence in Hierarchical Latent Reasoning (arxiv.org)
48
1
FederatedSkill: Federated Learning for Agentic Skill Evolution (arxiv.org)
49
1
IdEst: Assessing Self-Supervised Learning Representations via Intrinsic Dimension (arxiv.org)
50
1
Leveraging BART to Assess CS1 C++ Programming Assignments using Rubric-based Criteria (arxiv.org)