AI News

⚡ 2 minutes ago
1
1
Observability for Delegated Execution in Agentic AI Systems (arxiv.org)
2
1
An 84-Format Numeric Catalog with Bit-Exact Conformance Vectors: A Vendor-Neutral Reference for FP8, BF16, MXFP4, and Microscaling Formats (arxiv.org)
3
1
A Graphop Analysis of Graph Neural Networks on Sparse Graphs: Generalization and Universal Approximation (arxiv.org)
4
1
Foundation Inference Models for Ordinary Differential Equations (arxiv.org)
5
1
On the Superlinear Relationship between SGD Noise Covariance and Loss Landscape Curvature (arxiv.org)
6
1
The Label Horizon Paradox: Rethinking Supervision Targets in Financial Forecasting (arxiv.org)
7
1
Your Self-Play Algorithm is Secretly an Adversarial Imitator: Understanding LLM Self-Play through the Lens of Imitation Learning (arxiv.org)
8
1
When Benign Inputs Lead to Severe Harms: Eliciting Unsafe Unintended Behaviors of Computer-Use Agents (arxiv.org)
9
1
Variational Speculative Decoding: Rethinking Draft Training from Token Likelihood to Sequence Acceptance (arxiv.org)
10
1
Optimal Fair Aggregation of Crowdsourced Noisy Labels using Demographic Parity Constraints (arxiv.org)
11
1
Latent Spherical Flow Policy for Reinforcement Learning with Combinatorial Actions (arxiv.org)
12
1
Cosmo3DFlow: Wavelet Flow Matching for Spatial-to-Spectral Compression in Reconstructing the Early Universe (arxiv.org)
13
1
Nonparametric LLM Evaluation from Preference Data (arxiv.org)
14
1
Alcmean's: Unsupervised community detection using local Laplacian, automatic detection of the number of centers (arxiv.org)
15
1
MilliVid: Hierarchical Latents for Long-Range Consistency in Video Generation (arxiv.org)
16
1
Families of Control-Cost-Parametrized Inverse-Optimal Universal Stabilizers (arxiv.org)
17
1
MEnvAgent: Scalable Polyglot Environment Construction for Verifiable Software Engineering (arxiv.org)
18
1
UA-DCM: Uncertainty-aware Causal Decision Making via Effect Bound Decomposition (arxiv.org)
19
1
Mechanistic Data Attribution: Tracing the Training Origins of Interpretable LLM Units (arxiv.org)
20
1
Comparative evaluation of training strategies using partially labelled datasets for segmentation of white matter hyperintensities and stroke lesions in FLAIR MRI (arxiv.org)
21
1
A systematic investigation of molecular encoding methods for drug property predictions across neural network and Transformer encoder-based model (arxiv.org)
22
1
Hardening Agent Benchmarks with Adversarial Hacker-Fixer Loops (arxiv.org)
23
1
RAM: Reachability Across Morphologies (arxiv.org)
24
1
Fourier Neural Operators with rank-1 lattice points and hyperbolic cross (arxiv.org)
25
1
CHROMA: Detecting AI-Generated Images through Inter-Channel Color-Space Correlations (arxiv.org)
26
1
Structure-Aware Modeling of Multiple-Choice Questions Improves Automatic Difficulty Estimation (arxiv.org)
27
1
Meeting SLOs, Slashing Hours: Automated Enterprise LLM Optimization with OptiKIT (arxiv.org)
28
1
Generative Reasoning Re-ranker (arxiv.org)
29
1
CURE: Curriculum-guided Multi-task Training for Reliable Anatomy Grounded Report Generation (arxiv.org)
30
1
XCR-Bench: Benchmarking Cross-Cultural Reasoning in LLMs via Culture-Specific Items and Hall's Triad (arxiv.org)
31
1
Multimodal Generative Engine Optimization: Rank Manipulation for Vision-Language Model Rankers (arxiv.org)
32
1
CLONE: A 3DGS-Based Closed-Loop Differentiable Optimization Framework for Single-Image Normal Estimation (arxiv.org)
33
1
AMix-1: A Pathway to Test-Time Scalable Protein Foundation Model (arxiv.org)
34
1
Failure by Interference: Language Models Make Balanced Parentheses Errors When Faulty Mechanisms Overshadow Sound Ones (arxiv.org)
35
1
ACTIVE-o3: Empowering MLLMs with Active Perception via Pure Reinforcement Learning (arxiv.org)
36
1
From A to B to A: Palindromic Zero-Shot Voice Conversion with Non-Parallel Data (arxiv.org)
37
1
IR-SIM: A Lightweight Skill-Native Simulator for Navigation, Learning, and Benchmarking (arxiv.org)
38
1
Compositional Approximation Can Strictly Outperform Superpositional Approximation (arxiv.org)
39
1
Discovering and decoding latent mean-field structure with variational autoencoders (arxiv.org)
40
1
Balancing Real and Synthetic Data for CNN-based Masonry Crack Detection (arxiv.org)
41
1
Harmonia: End-to-End RAG Serving Optimization (arxiv.org)
42
1
Robust Renal Mass Segmentation on CT: A Validation Study of an AI-Based Framework (arxiv.org)
43
1
Deep Tree Tensor Networks (arxiv.org)
44
1
FIT-Print: Towards False-claim-resistant Model Ownership Verification via Targeted Fingerprint (arxiv.org)
45
1
Complement or substitute? How AI increases the demand for human skills (arxiv.org)
46
1
Strategic Integration of Artificial Intelligence in the C-Suite: The Role of the Chief AI Officer (arxiv.org)
47
1
Rule-based autocorrection of Piping and Instrumentation Diagrams (P&IDs) on graphs (arxiv.org)
48
1
GRPO Does Not Close the Multi-Agent Coordination Gap (arxiv.org)
49
1
Latent Structural Categorical Matrix Completion with Application to Quasispecies Analysis (arxiv.org)
50
1
Toward autocorrection of chemical process flowsheets using large language models (arxiv.org)