AI News

⚡ 1 minute ago
1
1
Collaborative and Efficient Fine-tuning: Leveraging Task Similarity (arxiv.org)
2
1
Universal One-third Time Scaling in Learning Peaked Distributions (arxiv.org)
3
1
Independent Component Discovery in Temporal Count Data (arxiv.org)
4
1
Safeguarded Stochastic Polyak Step Sizes for Non-smooth Optimization: Robust Performance Without Small (Sub)Gradients (arxiv.org)
5
1
Trajectory Data Suffices for Statistically Efficient Policy Evaluation in Fixed-Horizon Offline RL with Linear $q^\pi$-Realizability and Concentrability (arxiv.org)
6
1
Interpretable Self-Supervised Learning via Representer Landmarks and Nystr\"om Approximation (arxiv.org)
7
1
Human in the Loop Adaptive Optimization for Improved Time Series Forecasting (arxiv.org)
8
1
Global Convergence of Adaptive Sensing for Principal Eigenvector Estimation (arxiv.org)
9
1
Advancing Local Clustering on Graphs via Compressive Sensing: Semi-supervised and Unsupervised Methods (arxiv.org)
10
1
Non-vacuous Generalization Bounds for Deep Neural Networks without any modification to the trained models (arxiv.org)
11
1
Targeted Data Fusion for Region-Specific Survival Effects in the AMP HIV Prevention Trials (arxiv.org)
12
1
Towards Simple and Provable Parameter-Free Adaptive Gradient Methods (arxiv.org)
13
1
Challenges in the calibration of tree-based models for imbalanced classification (arxiv.org)
14
1
Fixed-Mean Gaussian Processes for Post-hoc Bayesian Deep Learning (arxiv.org)
15
1
Dimension Reduction via Sum-of-Squares and Improved Clustering Algorithms for Non-Spherical Mixtures (arxiv.org)
16
1
A Likelihood Approach for Inference of Population Heterogeneity in Particle Ensembles with Second-Order Langevin Dynamics (arxiv.org)
17
1
Near-Optimal and Tractable Estimation under Shift-Invariance (arxiv.org)
18
1
Optimizing accuracy and diversity: a multi-task approach to forecast combinations (arxiv.org)
19
1
Adaptive Querying with AI Persona Priors (arxiv.org)
20
1
Beyond Procedure: Substantive Fairness in Conformal Prediction (arxiv.org)
21
1
Rethinking Bregman Divergences in Kronecker-Factored Optimizers (arxiv.org)
22
1
PropLLM: Propagation-Aware Scene Reconstruction for Network Fault Diagnosis (arxiv.org)
23
1
HomeFlow: A Data Flywheel for Smart Home Agent Training with Verifiable Simulation (arxiv.org)
24
1
Application of Algorithms in Energy-Efficient Design Platforms for Green Building (arxiv.org)
25
1
Can LLM Agents Sustain Long-Horizon Organizational Dynamics? (arxiv.org)
26
1
The Case for Model Science: Verify, Explore, Steer, Refine (arxiv.org)
27
1
"Skill issues'': data-centric optimization of lakehouse agents (arxiv.org)
28
1
Deft Scheduling of Dynamic Cloud Workflows with Varying Deadlines via Mixture-of-Experts (arxiv.org)
29
1
Emergent Ordinal Geometry in Transformers Trained on Local Comparisons (arxiv.org)
30
1
DREAM-S: Speculative Decoding with Searchable Drafting and Target-Aware Refinement for Multimodal Generation (arxiv.org)
31
1
Generate in Reconstruction Space, Match in Semantic Space: Transport Geometry for One-Step Generation (arxiv.org)
32
1
Saliency-Aware Model Merging (arxiv.org)
33
1
TabChange: Precise Attribute Changes in Tabular Data (arxiv.org)
34
1
Torus Graphs for Large Scale Neural Phase Analysis (arxiv.org)
35
1
Expected Value Alignment for Generative Reward Modeling in Formal Mathematics Verification (arxiv.org)
36
1
Reasoning4Sciences: Bridging Reasoning Language Models to All Scientific Branches (arxiv.org)
37
1
SkillRevise: Improving LLM-Authored Agent Skills via Trace-Conditioned Skill Revision (arxiv.org)
38
1
Diagnosing LLM Arbitration Behavior over Pre-evidence Epistemic States in RAG-based Fact-Checking (arxiv.org)
39
1
CAREAgent: Clinical Agent with Structured Reasoning and Tool-Integrated for Order Generation (arxiv.org)
40
1
ProjQ: Project-and-Quantize for Adapter-Aware LLM Compression (arxiv.org)
41
1
EST-PRM: Stress-Testing Process Reward Models Before They Become Load-Bearing (arxiv.org)
42
1
Grounded Decoding: Retrieval-Anchored Probability Fusion for Faithful RAG (arxiv.org)
43
1
Variance-sensitive Thompson sampling for generalised linear bandits, revisited (arxiv.org)
44
1
Topology-Aware State Abstraction with Tangle Cores for Markov Decision Processes (arxiv.org)
45
1
Deep Learning as the Disciplined Construction of Tame Objects (arxiv.org)
46
1
Before the Model Learns the Bug:Fuzzing RLVR Verifiers (arxiv.org)
47
1
MindClaw: Closed-Loop Embodied Mental-State Reasoning for Precision Intervention (arxiv.org)
48
1
DAG-MoE: From Simple Mixture to Structural Aggregation in Mixture-of-Experts (arxiv.org)
49
1
AnyEdit++: Adaptive Long-Form Knowledge Editing via Bayesian Surprise (arxiv.org)
50
1
TravelEval: A Comprehensive Benchmarking Framework for Evaluating LLM-Powered Travel Planning Agents (arxiv.org)