AI News

⚡ 2 minutes ago
1
1
Improvise, Adapt, Overcome: An On-The-Fly Multifidelity Algorithm for Efficient Machine Learning (arxiv.org)
2
1
AdaWeather: Adaptively Mixing Probabilistic Weather Forecasts with Logarithmic Regret (arxiv.org)
3
1
When to Re-Plan: Subgoal Persistence in Hierarchical Latent Reasoning (arxiv.org)
4
1
NVIDIA OmniDreams: Real-Time Generative World Model for Closed-Loop Autonomous Vehicle Simulation (arxiv.org)
5
1
A Scoping Review of the Ethical Perspectives on Anthropomorphising Large Language Model-Based Conversational Agents (arxiv.org)
6
1
Auditable Climate Risk Intelligence from Fragmented ESG Data: Deterministic Orchestration and Imbalance-Aware Learning for Scope 1-3 Validation (arxiv.org)
7
1
EvoDS: Self-Evolving Autonomous Data Science Agent with Skill Learning and Context Management (arxiv.org)
8
1
When RLHF Fails: A Mechanistic Taxonomy of Reward Hacking, Collapse, and Evaluator Gaming (arxiv.org)
9
1
IdEst: Assessing Self-Supervised Learning Representations via Intrinsic Dimension (arxiv.org)
10
1
Visual Graph Scaffolds for Structural Reasoning in Large Language Models (arxiv.org)
11
1
BehaviorBench: Modeling Real-World User Decisions from Behavioral Traces (arxiv.org)
12
1
Building Better Activation Oracles (arxiv.org)
13
1
CL-DMDF:Dynamic Multimodal Data Fusion Model Based on Contrastive Learning (arxiv.org)
14
1
Locality Does Not Imply Reachability: Boundary Repair in Block-Sparse Causal Attention (arxiv.org)
15
1
Aligning Data-Driven Predictors with Allocation: A Decision-Focused Approach to Survival Analysis (arxiv.org)
16
1
Thinking Past the Answer: Evaluating Harmful Overthinking in Large Reasoning Models (arxiv.org)
17
1
Before Fusion, Ask What to Keep: Contextual Calibration of Multimodal Signals (arxiv.org)
18
1
$\Psi$-Bench: Evaluating Persona-Sensitive Influencing in Persuasive Dialogues (arxiv.org)
19
1
Diagnosing Knowledge Gaps in LLM Tool Use: An Agentic Benchmark for Novel API Acquisition (arxiv.org)
20
1
FGRPO: Federated GRPO with Adaptive Aggregation on Non-IID Data (arxiv.org)
21
1
Rethinking Neural Width for Alternating Current Optimal Power Flow Proxies (arxiv.org)
22
1
Dynamic Objective Selection with Safeguards and LLM Oversight for Financial Decision-Making (arxiv.org)
23
1
SkillPyramid: A Hierarchical Skill Consolidation Framework for Self-Evolving Agents (arxiv.org)
24
1
DECA: Decentralizing Block-Wise Adam for Efficient LLM Full-Parameter Fine-Tuning on Non-IID Data (arxiv.org)
25
1
FederatedSkill: Federated Learning for Agentic Skill Evolution (arxiv.org)
26
1
Making Brain-Computer Interfaces More Secure (arxiv.org)
27
1
BigFinanceBench: A Workflow-Grounded Benchmark for Financial-Research Agents (arxiv.org)
28
1
Hybrid Adaptive Kalman Filtering for Data-Efficient Joint Tracking and Classification (arxiv.org)
29
1
Right Makes Might: Aligning Verified Hidden States Empowers RL Reasoning (arxiv.org)
30
1
GFFMERGE: Efficient Merging of Graph Neural Force Fields and Beyond (arxiv.org)
31
1
Leveraging BART to Assess CS1 C++ Programming Assignments using Rubric-based Criteria (arxiv.org)
32
1
Spectral Asymptotics of Neural Network Loss Landscapes: An Exact Decomposition of the Curvature Exponent (arxiv.org)
33
1
Are Common Substructures Transferable? Riemannian Graph Foundation Model with Neural Vector Bundles (arxiv.org)
34
1
"**Important** You should give me full credits!": Exploring Prompt Injection Attacks on LLM-Based Automatic Grading Systems (arxiv.org)
35
1
Theoretical Aspects of Lie Groupoid and Lie Algebroid Equivariant Convolutional Neural Networks (arxiv.org)
36
1
Causal Preference Elicitation (arxiv.org)
37
1
Skill-RM: Unifying Heterogeneous Evaluation Criteria via Agent Skill (arxiv.org)
38
1
Neuron Populations Exhibit Divergent Selectivity with Scale (arxiv.org)
39
1
ROBUST-WT: Robust Uncertainty-aware Segmentation Transform via Whitening and Training Enhancements (arxiv.org)
40
1
Conditional Hypothesis Generation for LLM-Based Text Analysis with Researcher-Specified Covariates (arxiv.org)
41
1
Applying Two-Grid Preconditioner for Subsurface Flow Simulation using Attention-enhanced Hybrid Network to Accelerate Multiscale Discretization in High-contrast Media (arxiv.org)
42
1
Bayesian Tensor Decomposition with Diffusion Model Prior (arxiv.org)
43
1
Calibrating Urban Traffic Simulation from Sparse Road Observations via Genetic Optimization (arxiv.org)
44
1
High-Dimensional Latents Should Be Diagnosed Through Phase Structure (arxiv.org)
45
1
Cross-Modal Contrastive Learning of ECG and Angiography Representations for Severe Stenosis Classification (arxiv.org)
46
1
Learning Self-Interpretation from Interpretability Artifacts: Training Lightweight Adapters on Vector-Label Pairs (arxiv.org)
47
1
Learning Temporal Causal Structure via Smooth Differentiable Optimization (arxiv.org)
48
1
Social Caption: Evaluating Social Understanding in Multimodal Models (arxiv.org)
49
1
Plan, Verify and Fill: A Structured Parallel Decoding Approach for Diffusion Language Models (arxiv.org)
50
1
Spike-Aware C++ INT8 Inference for Sparse Spiking Language Models on Commodity CPUs (arxiv.org)