AI News

⚡ 9 minutes ago
1
1
EEVEE: Towards Test-time Prompt Learning in the Real World for Self-Improving Agents (arxiv.org)
2
1
Piper: A Programmable Distributed Training System (arxiv.org)
3
1
Flaws in the LLM Automation Narrative (arxiv.org)
4
1
Provenance-Grounded Gating and Adaptive Recovery in Synthetic Post-Training Data Curation (arxiv.org)
5
1
Towards Autonomous Accelerator Design: FPGA Accelerator Generation with SECDA (arxiv.org)
6
1
FADA: Accessible fetal ultrasound interpretation and annotation with a selectively distilled unified vision-language model (arxiv.org)
7
1
PhantomBench: Benchmarking the Non-existential Threat of Language Models (arxiv.org)
8
1
RoboNaldo: Accurate, Stable and Powerful Humanoid Soccer Shooting via Motion-Guided Curriculum Reinforcement Learning (arxiv.org)
9
1
Modeling Complex Behaviors: Multi-Personality Composition and Dynamic Switching in Vision-Language Models (arxiv.org)
10
1
T1-Bench: Benchmarking Multi-Scenario Agents in Real-World Domains (arxiv.org)
11
1
Assessment of Personality Dimensions Across Situations in Dyadic Role-Play Scenarios (arxiv.org)
12
1
LLM-Aided Joint Secrecy Precoding and Trajectory for RSMA-Based Heterogeneous UAV Networks (arxiv.org)
13
1
A Unifying Lens on Supervised Fine-Tuning Through Target Distribution Design (arxiv.org)
14
1
ASyMOB: Algebraic Symbolic Mathematical Operations Benchmark (arxiv.org)
15
1
Attacks on Machine-Text Detectors Retain Stylistic Fingerprints (arxiv.org)
16
1
CleanPatrick: A Benchmark for Image Data Cleaning (arxiv.org)
17
1
Dynamics of Adversarial Attacks on Large Language Model-Based Search Engines (arxiv.org)
18
1
NuWa: Deriving Lightweight Class-Specific Vision Transformers for Edge Devices (arxiv.org)
19
1
Whisper-GPT -- Continuous Discrete Hybrid Representation Language Models For Speech And Music (arxiv.org)
20
1
Visual-TCAV: Concept-based Attribution and Saliency Maps for Post-hoc Explainability in Image Classification (arxiv.org)
21
1
BadRobot: Jailbreaking Embodied LLM Agents in the Physical World (arxiv.org)
22
1
Atomic Intent Reasoning: Bringing LLM Semantics to Industrial Cross-Domain Recommendations (arxiv.org)
23
1
Designed by Journalists, but Is It for Readers? Rethinking AI Disclosures and Transparency in News (arxiv.org)
24
1
Routing-Aware Expert Calibration for Machine Unlearning in Mixture-of-Experts Language Models (arxiv.org)
25
1
Building Change Detection in Earthquake: A Multi-Scale Interaction Network and A Change Detection Dataset (arxiv.org)
26
1
Content-Induced Spatial-Spectral Aggregation Network for Change Detection in Remote Sensing Images (arxiv.org)
27
1
Baseline-Free Policy Optimization for Neural Combinatorial Optimization (arxiv.org)
28
1
The Confident Liar: Diagnosing Multi-Agent Debate with Log-Probabilities and LLM-as-Judge (arxiv.org)
29
1
LLM-Guided Neural Architecture Search for Robust Co-Design of Physical Neural Networks (arxiv.org)
30
1
Towards Robust Arabic Speech Emotion Recognition with Deep Learning (arxiv.org)
31
1
Infini Memory: Maintainable Topic Documents for Long-Term LLM Agent Memory (arxiv.org)
32
1
Business World Model (arxiv.org)
33
1
Efficient AI-Inspired Reduction of Feynman Integrals via Tube Seeding (arxiv.org)
34
1
Do LLMsMakeNeural Distinguishers Wise? (arxiv.org)
35
1
An adaptive framework for the axisymmetric pulsar magnetosphere using physics-informed Kolmogorov-Arnold networks (arxiv.org)
36
1
ClusBench: The Clustering Benchmark Data Resource You've All Been Waiting For (?) (arxiv.org)
37
1
Profy: Interpretable Visualization of Expertise-Dependent Motor Skills Toward Supporting Piano Practice (arxiv.org)
38
1
A Bayesian Network Approach for Enhancing Security-Focused Decision Support Systems (arxiv.org)
39
1
The Role of Feedback Alignment in Self-Distillation (arxiv.org)
40
1
ReasonAlloc: Hierarchical Decoding-Time KV Cache Budget Allocation for Reasoning Models (arxiv.org)
41
1
ABC-Bench: An Agentic Bio-Capabilities Benchmark for Biosecurity (arxiv.org)
42
1
Monte Carlo Pass Search: Using Trajectory Generation for 3D Counterfactual Pass Evaluation in Football (arxiv.org)
43
1
Generating Concept Lexicalizations via Dictionary-Based Cross-Lingual Sense Projection (arxiv.org)
44
1
Dexterous Point Policy: Learning Point-based Dexterous Hand Policies from Human Demonstrations (arxiv.org)
45
1
Dmsh: A Multi-Agent Reinforcement Learning Framework for All-Quad Mesh Generation (arxiv.org)
46
1
Toward Proactive RF Charging Scheduling: Generative AI for Decision Support (arxiv.org)
47
1
Accelerating SAV-based optimization via randomized low-rank Hessian approximation (arxiv.org)
48
1
Few-step Generative Models as Lossy Compression (arxiv.org)
49
1
ASTRA-sim 3.0: Next-Level Distributed Machine Learning Simulations via High-Fidelity GPU and Infrastructure Modeling (arxiv.org)
50
1
PADD: Path-Aligned Decompression Distillation for Non-Router Teacher to Guide MoE Student Learning (arxiv.org)