AI News

Observability for Delegated Execution in Agentic AI Systems (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

An 84-Format Numeric Catalog with Bit-Exact Conformance Vectors: A Vendor-Neutral Reference for FP8, BF16, MXFP4, and Microscaling Formats (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

A Graphop Analysis of Graph Neural Networks on Sparse Graphs: Generalization and Universal Approximation (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Foundation Inference Models for Ordinary Differential Equations (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

On the Superlinear Relationship between SGD Noise Covariance and Loss Landscape Curvature (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

The Label Horizon Paradox: Rethinking Supervision Targets in Financial Forecasting (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Your Self-Play Algorithm is Secretly an Adversarial Imitator: Understanding LLM Self-Play through the Lens of Imitation Learning (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

When Benign Inputs Lead to Severe Harms: Eliciting Unsafe Unintended Behaviors of Computer-Use Agents (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Variational Speculative Decoding: Rethinking Draft Training from Token Likelihood to Sequence Acceptance (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Optimal Fair Aggregation of Crowdsourced Noisy Labels using Demographic Parity Constraints (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Latent Spherical Flow Policy for Reinforcement Learning with Combinatorial Actions (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Cosmo3DFlow: Wavelet Flow Matching for Spatial-to-Spectral Compression in Reconstructing the Early Universe (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Nonparametric LLM Evaluation from Preference Data (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Alcmean's: Unsupervised community detection using local Laplacian, automatic detection of the number of centers (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

MilliVid: Hierarchical Latents for Long-Range Consistency in Video Generation (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Families of Control-Cost-Parametrized Inverse-Optimal Universal Stabilizers (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

MEnvAgent: Scalable Polyglot Environment Construction for Verifiable Software Engineering (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

UA-DCM: Uncertainty-aware Causal Decision Making via Effect Bound Decomposition (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Mechanistic Data Attribution: Tracing the Training Origins of Interpretable LLM Units (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Comparative evaluation of training strategies using partially labelled datasets for segmentation of white matter hyperintensities and stroke lesions in FLAIR MRI (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

A systematic investigation of molecular encoding methods for drug property predictions across neural network and Transformer encoder-based model (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Hardening Agent Benchmarks with Adversarial Hacker-Fixer Loops (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

RAM: Reachability Across Morphologies (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Fourier Neural Operators with rank-1 lattice points and hyperbolic cross (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

CHROMA: Detecting AI-Generated Images through Inter-Channel Color-Space Correlations (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Structure-Aware Modeling of Multiple-Choice Questions Improves Automatic Difficulty Estimation (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Meeting SLOs, Slashing Hours: Automated Enterprise LLM Optimization with OptiKIT (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Generative Reasoning Re-ranker (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

CURE: Curriculum-guided Multi-task Training for Reliable Anatomy Grounded Report Generation (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

XCR-Bench: Benchmarking Cross-Cultural Reasoning in LLMs via Culture-Specific Items and Hall's Triad (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Multimodal Generative Engine Optimization: Rank Manipulation for Vision-Language Model Rankers (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

CLONE: A 3DGS-Based Closed-Loop Differentiable Optimization Framework for Single-Image Normal Estimation (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

AMix-1: A Pathway to Test-Time Scalable Protein Foundation Model (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Failure by Interference: Language Models Make Balanced Parentheses Errors When Faulty Mechanisms Overshadow Sound Ones (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

ACTIVE-o3: Empowering MLLMs with Active Perception via Pure Reinforcement Learning (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

From A to B to A: Palindromic Zero-Shot Voice Conversion with Non-Parallel Data (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

IR-SIM: A Lightweight Skill-Native Simulator for Navigation, Learning, and Benchmarking (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Compositional Approximation Can Strictly Outperform Superpositional Approximation (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Discovering and decoding latent mean-field structure with variational autoencoders (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Balancing Real and Synthetic Data for CNN-based Masonry Crack Detection (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Harmonia: End-to-End RAG Serving Optimization (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Robust Renal Mass Segmentation on CT: A Validation Study of an AI-Based Framework (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Deep Tree Tensor Networks (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

FIT-Print: Towards False-claim-resistant Model Ownership Verification via Targeted Fingerprint (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Complement or substitute? How AI increases the demand for human skills (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Strategic Integration of Artificial Intelligence in the C-Suite: The Role of the Chief AI Officer (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Rule-based autocorrection of Piping and Instrumentation Diagrams (P&IDs) on graphs (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

GRPO Does Not Close the Multi-Agent Coordination Gap (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Latent Structural Categorical Matrix Completion with Application to Quasispecies Analysis (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Toward autocorrection of chemical process flowsheets using large language models (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

← prev p.217/2264 next →