AI News

datacenter latest today hot

TianJi-Environ: An Autonomous AI Scientist for Atmospheric Environmental Research (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

MLingualFC: Evaluating Jailbreak Vulnerabilities in Multilingual Vision-Language Models (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Cross-View Urban Traffic Dataset: Drone-Supervised Ground Truth for Monocular Bird's-Eye View Localization (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Scaling Decision-Focused Learning to Large Problems with Lagrangian Decomposition (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Active Flow Expansion for Out-of-Distribution Discovery: from Theory to Molecules (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Knowledge Graphs and Reasoning LLMs for Finding Simple Yet Effective Transcriptomic Perturbation Predictors (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Diffuse AI Control on Fuzzy Tasks (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Cheap Reward Hacking Detection (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Synthetic but Not Realistic: The Evaluation Challenge in Generative Modelling for Structured Electronic Medical Records (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Quantum-Enhanced Similarity Measures for Polarimetric Materials Classification (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Beyond Point Estimates: Benchmarking Uncertainty Quantification Methods on the AION-1 Astronomical Foundation Model (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Memetic Capture: A Pluralistic Policy Framework for Governing AI-Driven Cultural Disempowerment (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

SLMJury: Can Small Language Models Judge as Well as Large Ones? (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

C$^3$ache: Accelerating World Action Models with Cross Inference Chunk Cache (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

The ACUTE Protocol: Operationalizing Language Model Activations for Better Calibration, Utility, and Trust (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Beyond English benchmarks: clinical llm evaluation in Brazilian Portuguese (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Model Multiplicity for Adversarial Detection in Small Language Model Training on Edge Devices (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

The Last Visible Pixel: Probing Fine-Scale Perception in Vision-Language Models (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

The Cross-Architecture Substrate: A Domain-Transcendent, Calibration-Surviving Geometric Invariant of Modern Vision Encoders (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Generalized Rank-based Evaluation for Knowledge Graph Completion: Perspectives, Framework, and Analyses (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

PROBE-Web: An Interactive System for Probing Evaluation Landscapes of Knowledge Graph Completion Models (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

From Hazard Functions to Language Space: Cox-Supervised Distillation of Survival Risk into a Large Language Model (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Self-Consistent Generative Paths via Admissible Random Variational Transport (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

From inverse problems to neural operators: prediction, mechanism, and generalization of data-driven models (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Online Learning with Recency: Algorithms for Sliding-window Streaming Multi-armed Bandits (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

LEAF: A Learning-Enabled ADMM Framework for Accelerated Convex Optimization (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Structural Grid Descriptors Predict Within-Task Solver Success on ARC-AGI (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

TRIAGE: Dialectical Reasoning for Explainable Risk Prediction on Irregularly Sampled Medical Time Series with LLMs (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

DynaCF: Mitigating Shortcut Learning in Reward Models via Dynamic Counterfactual Sensitivity (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Decoy-Calibrated Failure Audits for Language Models (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Larch: Learned Query Optimization for Semantic Predicates (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Decoupling Semantics and Logic: A Training-Free Coarse-to-Fine Pipeline for Video Retrieval-Augmented Generation (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Illusions of the Gold Standard: A Large-scale Analysis of Human Evaluation Protocols for Long-form Text Generation (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

POISE: Position-Aware Undetectable Skill Injection on LLM Agents (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

From `May' to `Is': Certainty Distortion in Language Model Rewriting (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

RecurGuard: Runtime Monitoring for Reasoning-Token Consumption Attacks (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Auditable Graph-Guided Root Cause Analysis for Kubernetes Incidents (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

HARBOR: A Harness Framework for Agentic Robot Reinforcement Learning (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Beyond Convolution: Advancing Hypergraph Neural Networks with Hypergraph U-Nets (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Stage-1 Controls the Entropy Regime, Not the Outcome (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

OnlyDense: Reduced-Order Modeling for Lagrangian simulation (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

A Unifying Lens on Reward Uncertainty in RLHF (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

SHIELD-IDS: Structurally Heterogeneous Ensemble with Integrated Layered Defense for Intrusion Detection Systems (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Steer Where It Matters: Token-Level Visual-Sensitivity Steering for LVLMs Hallucination Mitigation (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

SC3: The Multi-Solvent Solubility Challenge and Benchmark (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

QDS-SNN: Energy-efficient Quantum Deeply-Supervised Spiking Neural Network Algorithm for Traffic Sign Recognition (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

What neurosurgeons need to see: synthetic intra-operative MRI from ultrasound for brain-shift compensation in brain tumour surgery (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Reconstructing Synthetic SDO/AIA 193 A EUV Images from He I 10830 A Observations with Diffusion Model Translator (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

FiberTune: Preserving Action-Fiber Visual Residuals in Vision-Language-Action Fine-Tuning (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Latent Diffusion Policy: Shaping Latent Spaces for Diffusion-Based Robotic Manipulation (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

← prev p.202/2262 next →