AI News

Assessment of Personality Dimensions Across Situations in Dyadic Role-Play Scenarios (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

T1-Bench: Benchmarking Multi-Scenario Agents in Real-World Domains (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Modeling Complex Behaviors: Multi-Personality Composition and Dynamic Switching in Vision-Language Models (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

RoboNaldo: Accurate, Stable and Powerful Humanoid Soccer Shooting via Motion-Guided Curriculum Reinforcement Learning (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

PhantomBench: Benchmarking the Non-existential Threat of Language Models (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

FADA: Accessible fetal ultrasound interpretation and annotation with a selectively distilled unified vision-language model (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Towards Autonomous Accelerator Design: FPGA Accelerator Generation with SECDA (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Provenance-Grounded Gating and Adaptive Recovery in Synthetic Post-Training Data Curation (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Flaws in the LLM Automation Narrative (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Piper: A Programmable Distributed Training System (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

EEVEE: Towards Test-time Prompt Learning in the Real World for Self-Improving Agents (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Belief Acquisition as Stochastic Filtering (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

A Survey on Semantic Modeling for Building Energy Management (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

A Comprehensive Survey of Direct Preference Optimization: Datasets, Theories, Variants, and Applications (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Position: The ML Community Must Build an AI-Augmented Peer-Review Ecosystem (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Constructing coherent spatial memory in LLM agents through graph rectification (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Supervised Fine-tuning with Synthetic Rationale Data Hurts Real-World Disease Prediction (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Integrated Real-Time Motion Tracking and AI Analysis for Athletic Performance Optimization (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

An LLM-Native Psychometric Instrument Does Not Predict LLM Behavior: Evidence Across 25 Models (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

The Interlocutor Effect: Why LLMs Leak More Personal Data to Agents Than Humans (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

CANVAS: Captioning Art with Narrative Visual-Audio AI Systems (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Hyperbolic Neural Population Geometry Benefits Computation (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Conformal Risk Prediction for Non-Alcoholic Fatty Liver Disease Using Gradient Boosting with Distribution-Free Coverages (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Time Series as Language: A Universal Tokenizer for General-Purpose Time Series Foundation Models (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Blurry Window Attention (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

From Confident Closing to Silent Failure: Characterizing False Success in LLM Agents (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Alignment Collapse Under KV Cache Quantization: Diagnosis and Mitigation (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

SHAPE: Coalition-Aware Expert Pruning for Sparse Mixture-of-Experts LLMs (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

SocraticPO: Policy Optimization via Interactive Guidance (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

PreAct-Bench: Benchmarking Predictive Monitoring in LLMs (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Representation Curriculum: Stagewise Training for Robust Ranking and Allocation (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

LLM-as-a-Discriminator: When Synthetic Tables Still Look Real (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Two to Tango: Coupled Task-Reference Selection for Safe LLM Fine-tuning (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

SPACE: Source-free Proxy Anchor Concept Erasure for MLLMs (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

QSplitFL: Capability Aware Deep Q-Learning for Optimal Split Point Selection in Split Federated Learning (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

TENP: Trapezoidal Expert Neuron Pruning For Mixture-of-Experts (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Tractogram foundation model (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

HMAF: A Hierarchical Multi-Slot GD-RTB Allocation Framework (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

When Attribution Patching Lies: Diagnosis and a Second-Order Correction (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Less Context, More Accuracy: A Bi-Temporal Memory Engine for LLM Agents Where a Lean Retrieved Context Beats the Full History (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

The Whale That Outswam Evolution: Swarm Intelligence Maximises Memory in Connectome Reservoirs (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

RKSC: Reasoning-Aware KV Cache Sharing and Confident Early Exit for Multi-Step LLM Inference (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

IntentKV: Cross-Turn Intent-Aware KV Cache Pruning for Agent Inference (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

SPDM: Geometry-Modulated State Space Modeling with Manifold Constraints for Time Series Forecasting (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Co-GLANCE: Uncertainty-Aware Active Perception for Heterogeneous Robot Teaming (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Conformal Prediction for Neural Operators: Distribution-Free Uncertainty Quantification in Physics Simulation (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Interactions Between Crosscoder Features: A Compact Proofs Perspective (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Anomaly Detection and Root Cause Analysis for Microservice Systems (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

GAGI: A Gini-Adjusted GDP-per-Capita Index for Distribution-Aware Macroeconomic Welfare Monitoring (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Learning Where to Simulate: Generative Active Sampling for Online PDE Surrogate Training (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

← prev p.165/2242 next →