AI News

⚡ 14 minutes ago
1
1
AutoEval Done Right: Using Synthetic Data for Model Evaluation (arxiv.org)
2
1
Construction of Historical Knowledge Graphs Based on BERT and Graph Neural Networks (arxiv.org)
3
1
SECUREVENT: Hybrid AI/ML Security Monitoring for Distributed Event-Based Systems (arxiv.org)
4
1
THRD: A Training-Free Multi-Turn Defense Framework for Jailbreak Attacks on Large Language Models (arxiv.org)
5
1
Argument Collapse: LLMs Flatten Long-Form Public Debate (arxiv.org)
6
1
RoboBenchMart: Benchmarking Robots in Retail Environment (arxiv.org)
7
1
Latent Reasoning in TRMs is Secretly a Policy Improvement Operator (arxiv.org)
8
1
Evaluating the Performance of Deep Learning Models in Whole-body Dynamic 3D Posture Prediction During Load-reaching Activities (arxiv.org)
9
1
T-POP: Test-Time Personalization with Online Preference Feedback (arxiv.org)
10
1
Verifying Meta-Awareness via Predictive Rewards in Reasoning Models (arxiv.org)
11
1
Latent Collaboration in Multi-Agent Systems (arxiv.org)
12
1
SpeedAug: Policy Acceleration via Tempo-Enriched Policy and RL Fine-Tuning (arxiv.org)
13
1
Retrieval-aligned Tabular Foundation Models Enable Robust Clinical Risk Prediction in Electronic Health Records Under Real-world Constraints (arxiv.org)
14
1
Neural Low-Discrepancy Sequences (arxiv.org)
15
1
From Segments to Scenes: Temporal Understanding in Autonomous Driving via Vision-Language Model (arxiv.org)
16
1
ShelfAware: Real-Time Semantic Localization in Quasi-Static Environments with Low-Cost Sensors (arxiv.org)
17
1
VocSim: A Training-free Benchmark for Zero-shot Content Identity in Single-source Audio (arxiv.org)
18
1
Perspective on Bias in Biomedical AI: Preventing Downstream Healthcare Disparities (arxiv.org)
19
1
Calibrating Uncertainty for Zero-Shot Adversarial CLIP (arxiv.org)
20
1
Control of a Twin Rotor using Twin Delayed Deep Deterministic Policy Gradient (TD3) (arxiv.org)
21
1
SilentDrift: Exploiting Action Chunking for Stealthy Backdoor Attacks on Vision-Language-Action Models (arxiv.org)
22
1
MGRegBench: A Novel Benchmark Dataset with Anatomical Landmarks for Mammography Image Registration (arxiv.org)
23
1
RadAgent: A tool-using AI agent for stepwise interpretation of chest computed tomography (arxiv.org)
24
1
Reinforcement Learning Position Control of a Quadrotor Using Soft Actor-Critic (SAC) (arxiv.org)
25
1
Margin Adaptive DPO: Leveraging Reward Model for Granular Control in Preference Optimization (arxiv.org)
26
1
Dynamic Entropy Tuning in Reinforcement Learning Low-Level Quadcopter Control: Stochasticity vs Determinism (arxiv.org)
27
1
Uncovering Competency Gaps in Large Language Models and Their Benchmarks (arxiv.org)
28
1
VLM4VLA: Revisiting Vision-Language-Models in Vision-Language-Action Models (arxiv.org)
29
1
ANDRE: An Attention-based Neuro-symbolic Differentiable Rule Extractor for Inductive Logic Programming (arxiv.org)
30
1
Paradoxical noise preference in RNNs (arxiv.org)
31
1
Prototypicality Bias Reveals Blindspots in Multimodal Evaluation Metrics (arxiv.org)
32
1
Hot-Start Chinese Language Modeling:Visual Glyphs Accelerate Sample-Efficient Learning (arxiv.org)
33
1
MASCOT: Towards Multi-Agent Socio-Collaborative Companion Systems (arxiv.org)
34
1
A Monosemantic Attribution Framework for Stable Interpretability in Clinical Neuroscience Transformer-Based Language Models (arxiv.org)
35
1
ELF: A Family of Encoder-Free ECG-Language Models (arxiv.org)
36
1
ASKD-Whisper: Adaptive Self-knowledge Distillation for Efficient and Low-Latency Automatic Speech Recognition (arxiv.org)
37
1
Demystifying Multi-Agent Debate: The Role of Confidence and Diversity (arxiv.org)
38
1
How Much Progress Has There Been in NVIDIA Datacenter GPUs? (arxiv.org)
39
1
APB-V: Accelerating Long-Video Understanding via Sequence-Parallelism-aware Approximate Attention (arxiv.org)
40
1
Med-Scout: Curing MLLMs' Geometric Blindness in Medical Perception via Geometry-Aware RL Post-Training (arxiv.org)
41
1
Global Geometry Is Not Enough for Vision Representations (arxiv.org)
42
1
Better Source, Better Flow: Learning Condition-Dependent Source Distribution for Flow Matching (arxiv.org)
43
1
GottBERT: a pure German Language Model (arxiv.org)
44
1
Incentivized Collaboration in Active Learning (arxiv.org)
45
1
Discovering Nonlinear Static Relationships in Unlabeled Dataset using Autoencoder with Ordered Variance (arxiv.org)
46
1
Synthesizing Neural Network Controllers with Closed-Loop Dissipativity Guarantees (arxiv.org)
47
1
Domain Adaptation with a Single Vision-Language Embedding (arxiv.org)
48
1
Efficient Hamiltonian, structure and trace distance learning of Gaussian states (arxiv.org)
49
1
Embedding-Space Diffusion for Zero-Shot Environmental Sound Classification (arxiv.org)
50
1
UrbanFusion: Stochastic Multimodal Fusion for Contrastive Learning of Robust Spatial Representations (arxiv.org)