AI News

⚡ 15 minutes ago
1
1
How Much Orthogonalization Does Muon Need? (arxiv.org)
2
1
CRMA: A Spectrally-Bounded Backbone for Modular Continual Fine-Tuning of LLMs (arxiv.org)
3
1
Detector-Evasive LLM Paraphrasing via Constrained Policy Optimization (arxiv.org)
4
1
PR2: Predictive Routing Replay for MoE-Based LLM Reinforcement Learning (arxiv.org)
5
1
Quantum Reservoir Computing and Risk Bounds (arxiv.org)
6
1
Subliminal Learning Is Steering Vector Distillation (arxiv.org)
7
1
Property Prediction of Stacked Bilayer Materials: A Multimodal Learning Approach (arxiv.org)
8
1
Can AI Review Improve Paper Drafting? An Empirical Study on 20 Computer Architecture Submissions (arxiv.org)
9
1
Tackling the Root of Misinformation by Teaching Laypeople about Logical Fallacies via Socratic Questioning and Critical Argumentation (arxiv.org)
10
1
TriLens: Per-Layer Logit-Lens Entropy for White-Box Hallucination Detection (arxiv.org)
11
1
Dynamic Proxy-Mixing: Transferring Replay Controllers from Small to Large Models for Continual Instruction Tuning (arxiv.org)
12
1
Auditing Near-Optimal Policies Can Be Exponentially Hard: Conditional Query Lower Bounds via Occupancy Rashomon Capacity (arxiv.org)
13
1
Finer Parameter Steps for Low-Rank PEFT: A Controlled Study with CP Tensor Adapters (arxiv.org)
14
1
Canonicalized Stable-List Replay for Private Federated Continual Learning over Language-Model Embeddings (arxiv.org)
15
1
VERA: Variational Inference Framework for Jailbreaking Large Language Models (arxiv.org)
16
1
Targeted Data Fusion for Region-Specific Survival Effects in the AMP HIV Prevention Trials (arxiv.org)
17
1
Non-vacuous Generalization Bounds for Deep Neural Networks without any modification to the trained models (arxiv.org)
18
1
Advancing Local Clustering on Graphs via Compressive Sensing: Semi-supervised and Unsupervised Methods (arxiv.org)
19
1
Global Convergence of Adaptive Sensing for Principal Eigenvector Estimation (arxiv.org)
20
1
Human in the Loop Adaptive Optimization for Improved Time Series Forecasting (arxiv.org)
21
1
Interpretable Self-Supervised Learning via Representer Landmarks and Nystr\"om Approximation (arxiv.org)
22
1
Trajectory Data Suffices for Statistically Efficient Policy Evaluation in Fixed-Horizon Offline RL with Linear $q^\pi$-Realizability and Concentrability (arxiv.org)
23
1
Safeguarded Stochastic Polyak Step Sizes for Non-smooth Optimization: Robust Performance Without Small (Sub)Gradients (arxiv.org)
24
1
Independent Component Discovery in Temporal Count Data (arxiv.org)
25
1
Universal One-third Time Scaling in Learning Peaked Distributions (arxiv.org)
26
1
Collaborative and Efficient Fine-tuning: Leveraging Task Similarity (arxiv.org)
27
1
WildCat: Near-Linear Attention in Theory and Practice (arxiv.org)
28
1
The Assistant as a Privileged Persona: A canonical reference in cross-persona self-recognition (arxiv.org)
29
1
Normalized Relevance Measure as a Unifying Framework to Explain Neural Network Latent Structures (arxiv.org)
30
1
Semi-Supervised Noise Adaptation: Transferring Knowledge from Noise Domain (arxiv.org)
31
1
Richer Representations for Neural Algorithmic Reasoning via Auxiliary Reconstruction (arxiv.org)
32
1
Interpretable Policy Distillation for Power Grid Topology Control (arxiv.org)
33
1
Same Payload, Different Channel: Measuring Trust Asymmetry in Tool-Using Language Models (arxiv.org)
34
1
On the Recoverability of Causal Relations from Bulk Gene Expression Data (arxiv.org)
35
1
On the Difficulty of Learning a Meta-network for Training Data Selection (arxiv.org)
36
1
Characterization of Multi-Model Agentic AI Systems on General Tasks via Trace-Driven Simulation (arxiv.org)
37
1
SIRIUS-SQL: Anchoring Multi-Candidate Text-to-SQL in Execution Feedback (arxiv.org)
38
1
ANDES: Agent Native Data Evolving Synthesis Tool for Autonomous Instruction Alignment (arxiv.org)
39
1
SkillSmith: Co-Evolving Skills and Tools for Self-Improving Agent Systems (arxiv.org)
40
1
Science Earth: Towards A Planet-Scale Operating System for AI-Native Scientific Discovery (arxiv.org)
41
1
Recognize Your Orchestrator: An Entropy Dynamics Perspective for LLM Multi-Agent Systems (arxiv.org)
42
1
FlowTime: Towards Continuous Generative Watch Time Prediction via Flow-based Personalized Priors (arxiv.org)
43
1
Spatiotemporal Multi-Task Graph Transformer for Trip-Level Transit Prediction (arxiv.org)
44
1
LASER: Loss-Aware Singular-value Decomposition and Rank Allocation for Efficient Low-Precision Vision-Language Models (arxiv.org)
45
1
CARE-RL: Capability-Aware Reinforcement Learning for Mitigating Cross-Domain Conflicts (arxiv.org)
46
1
How Neural Losses Shape VAE Latents (arxiv.org)
47
1
MESA: Improving MoE Safety Alignment via Decentralized Expertise (arxiv.org)
48
1
Early Diagnosis of Wasted Computation in Multi-Agent LLM Systems via Failure-Aware Observability (arxiv.org)
49
1
GuidaPA: Privacy-Preserving Chatbot for Public Administration via Federated Learning (arxiv.org)
50
1
Self-Healing Agentic Orchestrators for Reliable Tool-Augmented Large Language Model Systems (arxiv.org)