AI News

⚡ 12 minutes ago
1
1
Agent Skills for Large Language Models: Architecture, Acquisition, Security, and the Path Forward (arxiv.org)
2
1
Critical evaluation of PINN for FWD inverse analysis and differentiable FEM as an alternative (arxiv.org)
3
1
Coupled Local and Global World Models for Efficient First Order RL (arxiv.org)
4
1
vLLM Semantic Router: Signal Driven Decision Routing for Mixture-of-Modality Models (arxiv.org)
5
1
Auditing Engagement Incentives in the Kidfluencer Ecosystem: A Multimodal Weak Supervision Approach (arxiv.org)
6
1
InftyThink+: Effective and Efficient Infinite-Horizon Reasoning via Reinforcement Learning (arxiv.org)
7
1
Introduction to optimization methods for training SciML models (arxiv.org)
8
1
FRED: A Multi-Modal Autonomous Driving Dataset for Flooded Road Environments (arxiv.org)
9
1
Mitigating False Credit Propagation: Probabilistic Graphical Reward Aggregation for Rubric-Based Reinforcement Learning (arxiv.org)
10
1
A Robust Optimization Approach to Sparse Principal Component Analysis (arxiv.org)
11
1
Few-Shot Prediction for Pulsar Noise with Long Short-Term Memory Network (arxiv.org)
12
1
Set-Preserving Calibration from Conformal P-Values to E-Values (arxiv.org)
13
1
Estimating Bidirectional Causal Effects with Large Scale Online Kernel Learning (arxiv.org)
14
1
Text-attributed Graph Condensation via Text Selection and Attribute Matching (arxiv.org)
15
1
An Asymptotic Theory of Chain-of-Thought in In-Context Learning (arxiv.org)
16
1
Local and Global Contraction Principles for MCMC Mixing (arxiv.org)
17
1
Analytical Evaluation of DCA Convergence Properties for Minimizing Prediction Functions of Gaussian RBF Support Vector Regression (arxiv.org)
18
1
Do Real-World Datasets Contain Natural Experiments? An Empirical Study Using Causal Feature Selection (arxiv.org)
19
1
Online Learning with Gradient-Variation Interval Regret (arxiv.org)
20
1
Phantom Transfer: Data Poisoning can Survive Data-Level Defences (arxiv.org)
21
1
Neural Networks Provably Learn Spectral Representations for Group Composition (arxiv.org)
22
1
PrimeSVT: An Automated Memory-aware Pruning Framework with Prioritized Compression Policy for Spiking Vision Transformers (arxiv.org)
23
1
Qift: Shift-Friendly No-Zero W2 Post-Training Quantization for Rotated W2A4/KV4 LLM Inference (arxiv.org)
24
1
Link Prediction or Perdition: the Seeds of Instability in Knowledge Graph Embeddings (arxiv.org)
25
1
High-Precision APT Malware Attribution with Out-of-Scope Resilience (arxiv.org)
26
1
Generating Rectifiable Measures through Neural Networks (arxiv.org)
27
1
Physics-informed diffusion models in spectral space (arxiv.org)
28
1
Analyzing Stream Collapse in Hyper-Connections: From Diagnosis to Mitigation (arxiv.org)
29
1
Wavelet as Tokenizer: Preliminary Results on a Shared Wavelet Token Schema for Natural Signals (arxiv.org)
30
1
Handoff Debt: The Rediscovery Cost When Coding Agents Take Over Interrupted Tasks (arxiv.org)
31
1
FinStressTS: A Parametric Synthetic Benchmark for Time-Series Forecasting in Finance (arxiv.org)
32
1
AUDITFLOW: Executable Symbolic Environments for Structured Financial Reporting Verification (arxiv.org)
33
1
Inducing Reasoning Primitives from Agent Traces (arxiv.org)
34
1
WISE-HAR: A Generalizable Ensemble Deep Learning Framework for WiFi-Based Human Activity Recognition (arxiv.org)
35
1
Validation-Gated Multi-Agent Governance for Online Adaptation of Thermal-Hydraulic Surrogate Models under Operating-Regime Shift (arxiv.org)
36
1
Assistax: A Multi-Agent Hardware-Accelerated Reinforcement Learning Benchmark for Assistive Robotics (arxiv.org)
37
1
GTBench: A Curriculum-Grounded Benchmark for Evaluating LLMs as Mathematical Research Assistants in Graph Theory (arxiv.org)
38
1
RogueMerge: Robust and Unified Attacks against LLM Model Merging (arxiv.org)
39
1
Localized, High-resolution Geographic Representations with Slepian Functions (arxiv.org)
40
1
Multi-Modal Graph Neural Network with Transformer-Guided Adaptive Diffusion for Preclinical Alzheimer Classification (arxiv.org)
41
1
From Long News to Accurate Forecast: Importance-Aware Fusion and PRM-Guided Reflection for Time Series Forecasting (arxiv.org)
42
1
DELTAMEM: Incremental Experience Memory for LLM Agents via Residual Trees (arxiv.org)
43
1
RRISE: Robust Radius Inference via a Surrogate Estimator (arxiv.org)
44
1
ClinicalMC: A Benchmark for Multi-Course Clinical Decision-Making with Large Language Models (arxiv.org)
45
1
Cost-Aware Query Routing in RAG: Empirical Analysis of Retrieval Depth Tradeoffs (arxiv.org)
46
1
DeskCraft: Benchmarking Desktop Agents on Professional Workflows and Human-in-the-Loop Collaboration (arxiv.org)
47
1
CARVE: Certified Affordable Repair of Vetoed Maneuvers via Envelopes for Interactive Driving (arxiv.org)
48
1
Human-in-the-Loop Contextual Bandits for Short-Term Rental Dynamic Pricing: Structural Equivalence of Historical Warm-Up and Approval-Gated Live Learning (arxiv.org)
49
1
Clustered Self-Assessment: A Simple yet Effective Method for Uncertainty Quantification in Large Language Models (arxiv.org)
50
1
The Violation Situation Pattern: A Knowledge-Graph Pattern for Compliance Violations (arxiv.org)