AI News

⚡ 5 minutes ago
1
1
From Script to Semantics: Prompting Strategies for African NLI (arxiv.org)
2
1
Introduction to optimization methods for training SciML models (arxiv.org)
3
1
KForge: LLM-Driven Cross-Platform Kernel Generation for AI Accelerators (arxiv.org)
4
1
Outsmarting the Chameleon: Counterfactual Decoupling for Tactical OOD Shifts in Live Streaming Risk Assessment (arxiv.org)
5
1
Sparse-View Lung Nodule Volumetry from Digitally Reconstructed Radiographs via AReT: Anatomy-Regularized TensoRF (arxiv.org)
6
1
Efficient Hyperparameter Optimization for LLM Reinforcement Learning (arxiv.org)
7
1
Hierarchical RBF-KAN and RBF-SKAN Architectures for Multidimensional Function Approximation and Random Field Learning (arxiv.org)
8
1
Cross-Lingual Token Arbitrage: Optimizing Code Agent Context Windows via Local LLM Preprocessing (arxiv.org)
9
1
Agent Skills for Large Language Models: Architecture, Acquisition, Security, and the Path Forward (arxiv.org)
10
1
Coupled Local and Global World Models for Efficient First Order RL (arxiv.org)
11
1
Easy-to-Use Shielding for Reinforcement Learning (arxiv.org)
12
1
Perceive Before Reasoning: A Pre-Reasoning Perception Framework for Efficient and Reliable Proactive Mobile Agents (arxiv.org)
13
1
CP-Agent: Context-Aware Multimodal Reasoning for Cellular Morphological Profiling under Chemical Perturbations (arxiv.org)
14
1
Weak Diffusion Priors Can Still Achieve Strong Inverse-Problem Performance (arxiv.org)
15
1
Signed Spiking Neuron Enabled by an Orthogonal-Easy-Axis Magnetic Tunnel Junction (arxiv.org)
16
1
DXA-Derived Skeletal Phenotypes and Hip Fracture Risk: A Backdoor-Adjusted Causal Analysis (arxiv.org)
17
1
Overlaying Governance: A Compositional Authorization Framework for Delegation and Scope in Agentic AI (arxiv.org)
18
1
Effect of Demographic Bias on Skin Lesion Classification (arxiv.org)
19
1
LEAP: Supercharging LLMs for Formal Mathematics with Agentic Frameworks (arxiv.org)
20
1
FFR: Forward-Forward Learning for Regression (arxiv.org)
21
1
LLM-Assisted Reranking to Operationalize Nuanced Objectives in Recommender Systems (arxiv.org)
22
1
MedCUA-Bench: A Screenshot-Only Benchmark for Clinical Computer-Use Agents (arxiv.org)
23
1
Think-Before-Speak: From Internal Evaluation to Public Expression in Multi-Agent Social Simulation (arxiv.org)
24
1
A Negative Result on Cross-Model Activation Transfer in a Pythia Multi-Hop Setting (arxiv.org)
25
1
Efficient ASR Training with Conversations that Never Happened (arxiv.org)
26
1
Distilling Answer-Set Programming Rules from LLMs for Neurosymbolic Visual Question Answering (arxiv.org)
27
1
PURGE: Projected Unlearning via Retain-Guided Erasure (arxiv.org)
28
1
Human-in-the-Loop Contextual Bandits for Short-Term Rental Dynamic Pricing: Structural Equivalence of Historical Warm-Up and Approval-Gated Live Learning (arxiv.org)
29
1
Trans GAN-WT: A Feature Extraction and Interactive Learning-Based Anomaly Detection Model for Wind Turbine Time Series Data (arxiv.org)
30
1
Low-Frequency Shortcuts in Texture-Driven Visual Learning (arxiv.org)
31
1
The Epi-LLM Framework: probing LLM behavioral priors through epidemiological agent-based models (arxiv.org)
32
1
Humanoid-GPT: Scaling Data and Structure for Zero-Shot Motion Tracking (arxiv.org)
33
1
Uncertainty-Aware Clarification in LLM Agents with Information Gain (arxiv.org)
34
1
Quantifying Faithful Confidence Expression in Large Reasoning Models (arxiv.org)
35
1
Relational Linearity is a Predictor of Hallucinations (arxiv.org)
36
1
MAdam: Metric-Aware Multi-Objective Adam (arxiv.org)
37
1
MUSE: A Unified Agentic Harness for MLLMs (arxiv.org)
38
1
FlowGuard: Flow Matching for Identity-Independent Detection of Data-Free Model Stealing Attacks on Energy System Intrusion Detection Systems (arxiv.org)
39
1
dstack-capsule: Pod-Level Remote Attestation for Confidential Workloads on Kubernetes (arxiv.org)
40
1
Multi-component Causal Tracing in Large Language Models (arxiv.org)
41
1
A Nonmonotone Gradient-Based Algorithm for Symmetric Nonnegative Matrix Factorization and Graph Clustering (arxiv.org)
42
1
Towards Fair Graph Prompting: A Dual-Prompt Mechanism for Mitigating Attribute and Structural Bias (arxiv.org)
43
1
Grasp-Then-Plan with Failure Attribution: A Closed Two-Stage Framework for Precise and Generalizable Robotic Manipulation (arxiv.org)
44
1
RRISE: Robust Radius Inference via a Surrogate Estimator (arxiv.org)
45
1
TreeFlash: Parallel AR-Approximation for Faster Speculative Decoding (arxiv.org)
46
1
A Systematic Evaluation of Current Architectures in Wind Power Forecasting (arxiv.org)
47
1
Learning without training: The implicit dynamics of in-context learning (arxiv.org)
48
1
Using Reward Uncertainty to Induce Diverse Behaviour in Reinforcement Learning (arxiv.org)
49
1
From Long News to Accurate Forecast: Importance-Aware Fusion and PRM-Guided Reflection for Time Series Forecasting (arxiv.org)
50
1
DELTAMEM: Incremental Experience Memory for LLM Agents via Residual Trees (arxiv.org)