AI News

⚡ 13 minutes ago
1
1
DECA: Decentralizing Block-Wise Adam for Efficient LLM Full-Parameter Fine-Tuning on Non-IID Data (arxiv.org)
2
1
Multi-Modal Machine Learning for Breast Cancer Recurrence Prediction (arxiv.org)
3
1
When to Re-Plan: Subgoal Persistence in Hierarchical Latent Reasoning (arxiv.org)
4
1
Proof-Refactor: Refactoring Generated Formal Proofs into Modular Artifacts (arxiv.org)
5
1
LAP: An Agent-to-Instrument Protocol for Autonomous Science (arxiv.org)
6
1
From Control Boundary to Insurance Claim: Reconstructing AI-Mediated Losses Through the CER Framework (arxiv.org)
7
1
Enhancing Operational Safety via Agentic Dialogue Hazard Identification Analysis (arxiv.org)
8
1
Bayesian Tensor Decomposition with Diffusion Model Prior (arxiv.org)
9
1
Learning Temporal Causal Structure via Smooth Differentiable Optimization (arxiv.org)
10
1
GFFMERGE: Efficient Merging of Graph Neural Force Fields and Beyond (arxiv.org)
11
1
IdEst: Assessing Self-Supervised Learning Representations via Intrinsic Dimension (arxiv.org)
12
1
Right Makes Might: Aligning Verified Hidden States Empowers RL Reasoning (arxiv.org)
13
1
SkillPyramid: A Hierarchical Skill Consolidation Framework for Self-Evolving Agents (arxiv.org)
14
1
Leveraging BART to Assess CS1 C++ Programming Assignments using Rubric-based Criteria (arxiv.org)
15
1
Calibrating Urban Traffic Simulation from Sparse Road Observations via Genetic Optimization (arxiv.org)
16
1
BigFinanceBench: A Workflow-Grounded Benchmark for Financial-Research Agents (arxiv.org)
17
1
EvoDS: Self-Evolving Autonomous Data Science Agent with Skill Learning and Context Management (arxiv.org)
18
1
Rethinking the Role of Tensor Decompositions in Post-Training LLM Compression (arxiv.org)
19
1
When RLHF Fails: A Mechanistic Taxonomy of Reward Hacking, Collapse, and Evaluator Gaming (arxiv.org)
20
1
EqGINO: Equivariant Geometry-Informed Fourier Neural Operators for 3D PDEs (arxiv.org)
21
1
Let There Be Light: Reflection, Refraction and Scattering for Neural Operators (arxiv.org)
22
1
Are Common Substructures Transferable? Riemannian Graph Foundation Model with Neural Vector Bundles (arxiv.org)
23
1
A Geometric Lens on Physics-Aligned Data Compression (arxiv.org)
24
1
scTranslation: A Comprehensive Benchmark for Single-Cell Multi-Omics Modality Translation (arxiv.org)
25
1
Hedge-Bench: Benchmarking Agents on Hard, Realistic Tasks Pertaining to Financial Reasoning (arxiv.org)
26
1
Entropy Is Not Enough: Unlocking Effective Reinforcement Learning for Visual Reasoning via Vision-Anchored Token Selection (arxiv.org)
27
1
Imaginative Perception Tokens Enhance Spatial Reasoning in Multimodal Language Models (arxiv.org)
28
1
TRAP: Hijacking VLA CoT-Reasoning via Adversarial Patches (arxiv.org)
29
1
Message Tuning Outshines Graph Prompt Tuning: A Prismatic Space Perspective (arxiv.org)
30
1
Learning Multi-Scale Hypergraph for High-Order Brain Connectivity Analysis (arxiv.org)
31
1
A Graph Foundation Model with Spectral Parsing and Prototype-Guided Spatial Propagation (arxiv.org)
32
1
Validation-Gated Multi-Agent Governance for Online Adaptation of Thermal-Hydraulic Surrogate Models under Operating-Regime Shift (arxiv.org)
33
1
Cost-Aware Query Routing in RAG: Empirical Analysis of Retrieval Depth Tradeoffs (arxiv.org)
34
1
IdiomX A Multilingual Benchmark for Idiom Understanding, Retrieval, and Interpretation (arxiv.org)
35
1
Lean-GAP: A Dataset of Formalized Graduate Algebra Problems (arxiv.org)
36
1
Tracking Urban Atmospheric Pollutants using Sentinel-5P Satellite Data (arxiv.org)
37
1
Samudra 2: Scaling Ocean Emulators across Resolutions (arxiv.org)
38
1
Margin Play: A Multi-Agent System For Public Policy Analysis In The Brazilian Equatorial Margin (arxiv.org)
39
1
FSA-GRPO: Teaching Auditory LLMs to Use Few-shot Demonstrations (arxiv.org)
40
1
Closed-Loop Molecular Design with Calibrated Deference (arxiv.org)
41
1
Oscillatory State-Space Models as Inductive Biases for Physics-Informed Neural PDE Solvers (arxiv.org)
42
1
TadA-Bench: A Million-Variant Benchmark for Future-Round Discovery Toward Agentic Protein Engineering (arxiv.org)
43
1
DXA-Derived Skeletal Phenotypes and Hip Fracture Risk: A Backdoor-Adjusted Causal Analysis (arxiv.org)
44
1
DMF: A Deterministic Memory Framework for Conversational AI Agents (arxiv.org)
45
1
Calibration Data Trade-offs Across Capability Dimensions: Why Multi-Source Mixing Matters for High-Sparsity LLM Pruning (arxiv.org)
46
1
MultiTurnPSB: Evaluating Multi-Turn Jailbreak Attacks an dClassifier-Based Defenses for Medical AI Safety (arxiv.org)
47
1
FLIPS: Instance-Fingerprinting for LLMs via Pseudo-random Sequences (arxiv.org)
48
1
Wavelet as Tokenizer: Preliminary Results on a Shared Wavelet Token Schema for Natural Signals (arxiv.org)
49
1
SegTune: Structured and Fine-Grained Control for Song Generation (arxiv.org)
50
1
Sparse-View Lung Nodule Volumetry from Digitally Reconstructed Radiographs via AReT: Anatomy-Regularized TensoRF (arxiv.org)