AI News

⚡ 7 minutes ago
1
1
SPADE: Sketch-guided Path Planning Augmented with Diffusion Experts (arxiv.org)
2
1
Towards Blind Lens Aberration Correction via Large LensLib Pre-training and Discrete Degradation Priors (arxiv.org)
3
1
BigFinanceBench: A Workflow-Grounded Benchmark for Financial-Research Agents (arxiv.org)
4
1
Suboptimality bounds for trace-bounded SDPs enable a faster and scalable low-rank SDP solver SDPLR+ (arxiv.org)
5
1
Learning Power Flow with Confidence: A Probabilistic Guarantee Framework for Voltage Risk (arxiv.org)
6
1
GFFMERGE: Efficient Merging of Graph Neural Force Fields and Beyond (arxiv.org)
7
1
Assessing and Mitigating Miscalibration in LLM-Based Social Science Measurement (arxiv.org)
8
1
FutureWeaver: Planning Test-Time Compute for Multi-Agent Systems with Modularized Collaboration (arxiv.org)
9
1
Causal Preference Elicitation (arxiv.org)
10
1
Scalable On-Hardware Training of Quantum Neural Networks and Application to Clinical Data Imputation (arxiv.org)
11
1
Align-KD: Distilling Cross-Modal Alignment Knowledge for Mobile Vision-Language Model Enhancement (arxiv.org)
12
1
SAIL: Sound Abstract Interpreters with LLMs (arxiv.org)
13
1
Sample-Size Scaling of the African Languages NLI Evaluation (arxiv.org)
14
1
From Control Boundary to Insurance Claim: Reconstructing AI-Mediated Losses Through the CER Framework (arxiv.org)
15
1
PerchRL: Vision-Based Agile Perching on Inclined Platforms under Rapid and Irregular Motion (arxiv.org)
16
1
Self-Soupervision: Cooking Model Soups without Labels (arxiv.org)
17
1
PieArena: Ranking and Profiling Language Agents in Realistic Negotiation Scenarios (arxiv.org)
18
1
TimeOmni-VL: Unified Models for Time Series Understanding and Generation (arxiv.org)
19
1
WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation (arxiv.org)
20
1
When Should LLMs Be Less Specific? Selective Abstraction for Reliable Long-Form Text Generation (arxiv.org)
21
1
When RLHF Fails: A Mechanistic Taxonomy of Reward Hacking, Collapse, and Evaluator Gaming (arxiv.org)
22
1
IdEst: Assessing Self-Supervised Learning Representations via Intrinsic Dimension (arxiv.org)
23
1
Signed Spiking Neuron Enabled by an Orthogonal-Easy-Axis Magnetic Tunnel Junction (arxiv.org)
24
1
MemVerse: Multimodal Memory for Lifelong Learning Agents (arxiv.org)
25
1
A Cartesian-3j Framework for Machine Learning Interatomic Potentials (arxiv.org)
26
1
Whom to Query for What: Adaptive Group Elicitation via Multi-Turn LLM Interactions (arxiv.org)
27
1
Training a Predictive Coding Network on ImageNet using Equilibrium Propagation (arxiv.org)
28
1
Target Updates May Stabilize Linear Q-Learning: Periodic and Soft Dynamics (arxiv.org)
29
1
Generalizing Graph Foundation Models via Hyperbolic Retrieval-Augmented Generation (arxiv.org)
30
1
MIND: Multi-rationale INtegrated Discriminative Reasoning Framework for Multi-modal Large Models (arxiv.org)
31
1
A Single-Loop Bilevel Deep Learning Method for Optimal Control of Obstacle Problems (arxiv.org)
32
1
Inference Cost Attacks for Retrieval-Augmented Large Language Models (arxiv.org)
33
1
Spatial Transcriptomics-Guided Alignment Enhances Molecular Profiling in Pathology Foundation Model (arxiv.org)
34
1
EqGINO: Equivariant Geometry-Informed Fourier Neural Operators for 3D PDEs (arxiv.org)
35
1
Grasp-Then-Plan with Failure Attribution: A Closed Two-Stage Framework for Precise and Generalizable Robotic Manipulation (arxiv.org)
36
1
PRISM: Synergizing Vision Foundation Models via Self-organized Expert Specialization (arxiv.org)
37
1
Critical evaluation of PINN for FWD inverse analysis and differentiable FEM as an alternative (arxiv.org)
38
1
Bayesian Tensor Decomposition with Diffusion Model Prior (arxiv.org)
39
1
Resource-Constrained Adaptive Inference for Sequential Pricing (arxiv.org)
40
1
ScoreStop: Gradient-based early stopping using functional score tests (arxiv.org)
41
1
Gender-Dependent Diagnostic Substitution in LLM Medical Triage: Same Symptoms, Unequal Urgency (arxiv.org)
42
1
HARVE: Hacking-Aware Reward-Head Vector Editing for Robust Reward Models (arxiv.org)
43
1
From Answers to States: Verifiable Process-Level Evaluation of Chemical Reasoning in Large Language Models (arxiv.org)
44
1
FGRPO: Federated GRPO with Adaptive Aggregation on Non-IID Data (arxiv.org)
45
1
Constitutional On-Policy Safe Distillation (arxiv.org)
46
1
Synthetic Hallucinations, Real Gains: Hard Negatives from Frontier Models for FIM Hallucination Mitigation (arxiv.org)
47
1
Rethinking Neural Width for Alternating Current Optimal Power Flow Proxies (arxiv.org)
48
1
Dynamic Objective Selection with Safeguards and LLM Oversight for Financial Decision-Making (arxiv.org)
49
1
EvoDrive: Pareto Evolution for Safety-Critical Autonomous Driving via Self-Improving LLM Agents (arxiv.org)
50
1
Towards Fair Graph Prompting: A Dual-Prompt Mechanism for Mitigating Attribute and Structural Bias (arxiv.org)