AI News

⚡ 9 minutes ago
1
1
Assessment of Personality Dimensions Across Situations in Dyadic Role-Play Scenarios (arxiv.org)
2
1
T1-Bench: Benchmarking Multi-Scenario Agents in Real-World Domains (arxiv.org)
3
1
Modeling Complex Behaviors: Multi-Personality Composition and Dynamic Switching in Vision-Language Models (arxiv.org)
4
1
RoboNaldo: Accurate, Stable and Powerful Humanoid Soccer Shooting via Motion-Guided Curriculum Reinforcement Learning (arxiv.org)
5
1
PhantomBench: Benchmarking the Non-existential Threat of Language Models (arxiv.org)
6
1
FADA: Accessible fetal ultrasound interpretation and annotation with a selectively distilled unified vision-language model (arxiv.org)
7
1
Towards Autonomous Accelerator Design: FPGA Accelerator Generation with SECDA (arxiv.org)
8
1
Provenance-Grounded Gating and Adaptive Recovery in Synthetic Post-Training Data Curation (arxiv.org)
9
1
Flaws in the LLM Automation Narrative (arxiv.org)
10
1
Piper: A Programmable Distributed Training System (arxiv.org)
11
1
EEVEE: Towards Test-time Prompt Learning in the Real World for Self-Improving Agents (arxiv.org)
12
1
Belief Acquisition as Stochastic Filtering (arxiv.org)
13
1
A Survey on Semantic Modeling for Building Energy Management (arxiv.org)
14
1
A Comprehensive Survey of Direct Preference Optimization: Datasets, Theories, Variants, and Applications (arxiv.org)
15
1
Position: The ML Community Must Build an AI-Augmented Peer-Review Ecosystem (arxiv.org)
16
1
Constructing coherent spatial memory in LLM agents through graph rectification (arxiv.org)
17
1
Supervised Fine-tuning with Synthetic Rationale Data Hurts Real-World Disease Prediction (arxiv.org)
18
1
Integrated Real-Time Motion Tracking and AI Analysis for Athletic Performance Optimization (arxiv.org)
19
1
An LLM-Native Psychometric Instrument Does Not Predict LLM Behavior: Evidence Across 25 Models (arxiv.org)
20
1
The Interlocutor Effect: Why LLMs Leak More Personal Data to Agents Than Humans (arxiv.org)
21
1
CANVAS: Captioning Art with Narrative Visual-Audio AI Systems (arxiv.org)
22
1
Hyperbolic Neural Population Geometry Benefits Computation (arxiv.org)
23
1
Conformal Risk Prediction for Non-Alcoholic Fatty Liver Disease Using Gradient Boosting with Distribution-Free Coverages (arxiv.org)
24
1
Time Series as Language: A Universal Tokenizer for General-Purpose Time Series Foundation Models (arxiv.org)
25
1
Blurry Window Attention (arxiv.org)
26
1
From Confident Closing to Silent Failure: Characterizing False Success in LLM Agents (arxiv.org)
27
1
Alignment Collapse Under KV Cache Quantization: Diagnosis and Mitigation (arxiv.org)
28
1
SHAPE: Coalition-Aware Expert Pruning for Sparse Mixture-of-Experts LLMs (arxiv.org)
29
1
SocraticPO: Policy Optimization via Interactive Guidance (arxiv.org)
30
1
PreAct-Bench: Benchmarking Predictive Monitoring in LLMs (arxiv.org)
31
1
Representation Curriculum: Stagewise Training for Robust Ranking and Allocation (arxiv.org)
32
1
LLM-as-a-Discriminator: When Synthetic Tables Still Look Real (arxiv.org)
33
1
Two to Tango: Coupled Task-Reference Selection for Safe LLM Fine-tuning (arxiv.org)
34
1
SPACE: Source-free Proxy Anchor Concept Erasure for MLLMs (arxiv.org)
35
1
QSplitFL: Capability Aware Deep Q-Learning for Optimal Split Point Selection in Split Federated Learning (arxiv.org)
36
1
TENP: Trapezoidal Expert Neuron Pruning For Mixture-of-Experts (arxiv.org)
37
1
Tractogram foundation model (arxiv.org)
38
1
HMAF: A Hierarchical Multi-Slot GD-RTB Allocation Framework (arxiv.org)
39
1
When Attribution Patching Lies: Diagnosis and a Second-Order Correction (arxiv.org)
40
1
Less Context, More Accuracy: A Bi-Temporal Memory Engine for LLM Agents Where a Lean Retrieved Context Beats the Full History (arxiv.org)
41
1
The Whale That Outswam Evolution: Swarm Intelligence Maximises Memory in Connectome Reservoirs (arxiv.org)
42
1
RKSC: Reasoning-Aware KV Cache Sharing and Confident Early Exit for Multi-Step LLM Inference (arxiv.org)
43
1
IntentKV: Cross-Turn Intent-Aware KV Cache Pruning for Agent Inference (arxiv.org)
44
1
SPDM: Geometry-Modulated State Space Modeling with Manifold Constraints for Time Series Forecasting (arxiv.org)
45
1
Co-GLANCE: Uncertainty-Aware Active Perception for Heterogeneous Robot Teaming (arxiv.org)
46
1
Conformal Prediction for Neural Operators: Distribution-Free Uncertainty Quantification in Physics Simulation (arxiv.org)
47
1
Interactions Between Crosscoder Features: A Compact Proofs Perspective (arxiv.org)
48
1
Anomaly Detection and Root Cause Analysis for Microservice Systems (arxiv.org)
49
1
GAGI: A Gini-Adjusted GDP-per-Capita Index for Distribution-Aware Macroeconomic Welfare Monitoring (arxiv.org)
50
1
Learning Where to Simulate: Generative Active Sampling for Online PDE Surrogate Training (arxiv.org)