AI News

⚡ 14 minutes ago
1
1
Enhancing Layer Attention Efficiency through Pruning Redundant Retrievals (arxiv.org)
2
1
Skill-Based Mixture-of-Experts: Adaptive Routing for Heterogeneous Reasoning via Inferred Skills (arxiv.org)
3
1
EuroBERT: Scaling Multilingual Encoders for European Languages (arxiv.org)
4
1
ShapeLib: Designing a library of programmatic 3D shape abstractions with Large Language Models (arxiv.org)
5
1
Self-supervised Monocular Depth and Pose Estimation for Endoscopy with Latent Priors (arxiv.org)
6
1
Online Learning in MDPs with Partially Adversarial Transitions and Losses (arxiv.org)
7
1
Learning to Remember, Learn, and Forget in Attention-Based Models (arxiv.org)
8
1
AnomSeer: Reinforcing Multimodal LLMs to Reason for Time-Series Anomaly Detection (arxiv.org)
9
1
Learning To Sample From Diffusion Models Via Inverse Reinforcement Learning (arxiv.org)
10
1
Generalization of Gibbs and Langevin Monte Carlo Algorithms in the Interpolation Regime (arxiv.org)
11
1
How to Correctly Report LLM-as-a-Judge Evaluations (arxiv.org)
12
1
It does what it says on the tin: safe synthetic data from coarsened margins (arxiv.org)
13
1
Inverse Depth Scaling From Most Layers Being Similar (arxiv.org)
14
1
Error Bounds for a Diffusion Model-Based Drift Estimator (arxiv.org)
15
1
ProbRes: Volatility Learning for Probabilistic Time-Series Forecasting (arxiv.org)
16
1
When Softmax Fails at the Top: Extreme Value Corrections for InfoNCE (arxiv.org)
17
1
ERICA: Quantifying Replicability of Cluster Analysis (arxiv.org)
18
1
Riemannian Stochastic Optimization for Sufficient Dimension Reduction (arxiv.org)
19
1
Interpreting FCDNNs via RG on Exponential Family (arxiv.org)
20
1
Out-of-Distribution generalization of quantile regression with heavy tailed inputs: an SVM approach (arxiv.org)
21
1
Is Zero-Shot Super-Resolution Possible in Operator Learning? (arxiv.org)
22
1
Parameter-Free and Group Conditional Online Conformal Prediction (arxiv.org)
23
1
Spectra-Guided Neural Tucker Factorization (arxiv.org)
24
1
Taming the Loss Landscape of PINNs with Noisy Feynman-Kac Supervision: Operator Preconditioning and Non-Asymptotic Error Bounds (arxiv.org)
25
1
On Median of Incomplete U-Statistics (arxiv.org)
26
1
Statistical Testing on Directed Graphs by Surrogate Data Generation (arxiv.org)
27
1
Statistical Analysis of using the Shapley Value for Sensor Anomaly Localization with Accurate Classifiers (arxiv.org)
28
1
Bandit Simulation for Average Reward Inference (arxiv.org)
29
1
Efficient Synthetic Network Generation via Latent Embedding Reconstruction (arxiv.org)
30
1
Practical and Optimal Algorithm for Linear Contextual Bandits with Rare Parameter Updates (arxiv.org)
31
1
Reinforcement Learning for Optimal Experiment Design in Parameter Identification of Mechatronic Systems (arxiv.org)
32
1
Efficient Approximation for Encoder--Decoder Neural Operators via Variation Spaces (arxiv.org)
33
1
Distribution-free changepoint localization after sequential change detection (arxiv.org)
34
1
On the Uncertainty Quantification Ability of Tabular Foundation Models (arxiv.org)
35
1
Computation-Aware Kalman Filtering with Model Selection for Neural Dynamics (arxiv.org)
36
1
Self-Regulating Annealing in Heavy-Tailed Diffusion Models (arxiv.org)
37
1
Knowledge-Intensive Video Generation (arxiv.org)
38
1
Provable Data Scaling Law for Meta Learning via Complexity Minimization (arxiv.org)
39
1
Convex Distance Operator Transport: A Convex and Geometry-Preserving Formulation (arxiv.org)
40
1
Bayesian meta-learning for modeling Alzheimer's disease progression (arxiv.org)
41
1
Identifiable Markov Switching Models with Instantaneous Effects and Exponential Families (arxiv.org)
42
1
ShaplEIG: Bayesian Experimental Design for Shapley Value Estimation (arxiv.org)
43
1
Doing well with less! On Sampling Techniques for Empirical Pairwise Loss Estimation/Minimization (arxiv.org)
44
1
Sharpness-Aware Hybrid Model Learning for Architecture-Agnostic Parameter Estimation (arxiv.org)
45
1
Hoeffding Concept Bottleneck Models with Applications to Overhead Images (arxiv.org)
46
1
Physics from Video: Identifiability of Time-Invariant Second-Order ODEs under Minimal Trajectory Conditions (arxiv.org)
47
1
Agentic Transformers Provably Learn to Search via Reinforcement Learning (arxiv.org)
48
1
InfoAtlas: A Foundation Model for Zero-Shot Statistical Dependence Estimate (arxiv.org)
49
1
Dynamics and Representation Structure of Local Approximations to Gradient-Based Learning in Linear Recurrent Neural Networks (arxiv.org)
50
1
Accurate Large-sample Uncertainty Quantification using Stochastic Gradient Markov Chain Monte Carlo (arxiv.org)