AI News

datacenter latest today hot

A systematic investigation of molecular encoding methods for drug property predictions across neural network and Transformer encoder-based model (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Comparative evaluation of training strategies using partially labelled datasets for segmentation of white matter hyperintensities and stroke lesions in FLAIR MRI (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Mechanistic Data Attribution: Tracing the Training Origins of Interpretable LLM Units (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

UA-DCM: Uncertainty-aware Causal Decision Making via Effect Bound Decomposition (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

MEnvAgent: Scalable Polyglot Environment Construction for Verifiable Software Engineering (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Families of Control-Cost-Parametrized Inverse-Optimal Universal Stabilizers (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

MilliVid: Hierarchical Latents for Long-Range Consistency in Video Generation (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Alcmean's: Unsupervised community detection using local Laplacian, automatic detection of the number of centers (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Nonparametric LLM Evaluation from Preference Data (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Cosmo3DFlow: Wavelet Flow Matching for Spatial-to-Spectral Compression in Reconstructing the Early Universe (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Latent Spherical Flow Policy for Reinforcement Learning with Combinatorial Actions (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Optimal Fair Aggregation of Crowdsourced Noisy Labels using Demographic Parity Constraints (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Variational Speculative Decoding: Rethinking Draft Training from Token Likelihood to Sequence Acceptance (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

When Benign Inputs Lead to Severe Harms: Eliciting Unsafe Unintended Behaviors of Computer-Use Agents (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Your Self-Play Algorithm is Secretly an Adversarial Imitator: Understanding LLM Self-Play through the Lens of Imitation Learning (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

The Label Horizon Paradox: Rethinking Supervision Targets in Financial Forecasting (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

On the Superlinear Relationship between SGD Noise Covariance and Loss Landscape Curvature (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Foundation Inference Models for Ordinary Differential Equations (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

A Graphop Analysis of Graph Neural Networks on Sparse Graphs: Generalization and Universal Approximation (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

An 84-Format Numeric Catalog with Bit-Exact Conformance Vectors: A Vendor-Neutral Reference for FP8, BF16, MXFP4, and Microscaling Formats (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Observability for Delegated Execution in Agentic AI Systems (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

FASE: Fast Adaptive Semantic Entropy for Code Quality (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Who Earns the Safety? Intervention-Aware Quantum Predictive Control with Safety Attribution (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Weak-Driven Learning: How Weak Agents make Strong Agents Stronger (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Scaling Laws for Masked-Reconstruction Transformers on Single-Cell Transcriptomics (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Operationalising the Superficial Alignment Hypothesis via Task Complexity (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Geometry-Aware Uncertainty Quantification via Conformal Prediction on Manifolds (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

GraphER: An Efficient Graph-Based Enrichment and Reranking Method for Retrieval-Augmented Generation (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Train at Moving Edge: Online-Verified Prompt Selection for Efficient RL Training of Large Reasoning Model (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

AHA-WAM:Asynchronous Horizon-Adaptive World-Action Modeling with Observation-Guided Context Routing (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

PTL-Diffusion: Manifold-Aware Diffusion with Periodic Terminal Laws (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

OmniGameArena: A Unified UE5 Benchmark for VLM Game Agents with Improvement Dynamics (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

A Survey on Large Language Model-Based Game Agents (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

TQA-Bench: Evaluating LLMs for Multi-Table Question Answering (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

IDEQ -- Improving Diffusion Models for the Traveling Salesman Problem (TSP) by Leveraging the Structure of the Solution Space (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Can Global XAI Methods Reveal Injected Behaviours in LLMs? SHAP vs Rule Extraction vs RuleSHAP (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Sound and Complete Neurosymbolic Reasoning with LLM-Grounded Interpretations (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

A Geometric Theory of Cognition for Machine Intelligence (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Discovering heuristics in a complex SAT solver with large language models (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

CLPO: Curriculum Learning meets Policy Optimization for LLM Reasoning (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

MixReasoning: Switching Modes to Think (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

MAR:Multi-Agent Reflexion Improves Reasoning Abilities in LLMs (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

TempoBench: Evaluating Temporal Causal Reasoning in Large Language Models (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Label-Conditioned Cross-Modal Fusion for Adult-to-Pediatric ECG Transfer via Curriculum-Gated Contrastive Alignment (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

A Geometric Unification of Concept Learning with Concept Cones (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Understanding Benchmark Language Under Weakened Formal Semantics (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Projection and Quantisation: A Unifying View of Learning to Hash, from Random Projections to the RAG Era (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Correcting Mean Bias in Text Embeddings: A Refined Renormalization with Training-Free Improvements on MMTEB (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

SMART: Shot-Aware Multimodal Video Moment Retrieval with Audio-Enhanced MLLM (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

MedVision: Benchmarking Quantitative Medical Image Analysis (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

← prev p.218/2265 next →