AI News

⚡ 13 minutes ago
1
1
SpeedAug: Policy Acceleration via Tempo-Enriched Policy and RL Fine-Tuning (arxiv.org)
2
1
From Segments to Scenes: Temporal Understanding in Autonomous Driving via Vision-Language Model (arxiv.org)
3
1
ShelfAware: Real-Time Semantic Localization in Quasi-Static Environments with Low-Cost Sensors (arxiv.org)
4
1
VocSim: A Training-free Benchmark for Zero-shot Content Identity in Single-source Audio (arxiv.org)
5
1
Calibrating Uncertainty for Zero-Shot Adversarial CLIP (arxiv.org)
6
1
Control of a Twin Rotor using Twin Delayed Deep Deterministic Policy Gradient (TD3) (arxiv.org)
7
1
SilentDrift: Exploiting Action Chunking for Stealthy Backdoor Attacks on Vision-Language-Action Models (arxiv.org)
8
1
MGRegBench: A Novel Benchmark Dataset with Anatomical Landmarks for Mammography Image Registration (arxiv.org)
9
1
Reinforcement Learning Position Control of a Quadrotor Using Soft Actor-Critic (SAC) (arxiv.org)
10
1
Dynamic Entropy Tuning in Reinforcement Learning Low-Level Quadcopter Control: Stochasticity vs Determinism (arxiv.org)
11
1
Uncovering Competency Gaps in Large Language Models and Their Benchmarks (arxiv.org)
12
1
VLM4VLA: Revisiting Vision-Language-Models in Vision-Language-Action Models (arxiv.org)
13
1
Paradoxical noise preference in RNNs (arxiv.org)
14
1
Prototypicality Bias Reveals Blindspots in Multimodal Evaluation Metrics (arxiv.org)
15
1
Hot-Start Chinese Language Modeling:Visual Glyphs Accelerate Sample-Efficient Learning (arxiv.org)
16
1
MASCOT: Towards Multi-Agent Socio-Collaborative Companion Systems (arxiv.org)
17
1
A Monosemantic Attribution Framework for Stable Interpretability in Clinical Neuroscience Transformer-Based Language Models (arxiv.org)
18
1
ELF: A Family of Encoder-Free ECG-Language Models (arxiv.org)
19
1
ASKD-Whisper: Adaptive Self-knowledge Distillation for Efficient and Low-Latency Automatic Speech Recognition (arxiv.org)
20
1
Demystifying Multi-Agent Debate: The Role of Confidence and Diversity (arxiv.org)
21
1
How Much Progress Has There Been in NVIDIA Datacenter GPUs? (arxiv.org)
22
1
APB-V: Accelerating Long-Video Understanding via Sequence-Parallelism-aware Approximate Attention (arxiv.org)
23
1
Med-Scout: Curing MLLMs' Geometric Blindness in Medical Perception via Geometry-Aware RL Post-Training (arxiv.org)
24
1
Global Geometry Is Not Enough for Vision Representations (arxiv.org)
25
1
Better Source, Better Flow: Learning Condition-Dependent Source Distribution for Flow Matching (arxiv.org)
26
1
GottBERT: a pure German Language Model (arxiv.org)
27
1
Incentivized Collaboration in Active Learning (arxiv.org)
28
1
Discovering Nonlinear Static Relationships in Unlabeled Dataset using Autoencoder with Ordered Variance (arxiv.org)
29
1
Synthesizing Neural Network Controllers with Closed-Loop Dissipativity Guarantees (arxiv.org)
30
1
Domain Adaptation with a Single Vision-Language Embedding (arxiv.org)
31
1
Efficient Hamiltonian, structure and trace distance learning of Gaussian states (arxiv.org)
32
1
Embedding-Space Diffusion for Zero-Shot Environmental Sound Classification (arxiv.org)
33
1
Mirror Descent Under Generalized Smoothness (arxiv.org)
34
1
GIFT: Geometry-Induced Functional Transfer for Category-level Object Manipulation (arxiv.org)
35
1
VDSB-GWSyn: Diffusion Schr\"{o}dinger Bridge for Controllable and Anatomically Feasible Guidewire Synthesis in Coronary Angiography (arxiv.org)
36
1
RA-LWLM: Retrieval-Augmented In-Context Localization with Wireless Foundation Models (arxiv.org)
37
1
Multi-Contrast MRI Motion Correction via Parameter-Informed Disentanglement and Adaptive Experts (arxiv.org)
38
1
StemBind: When MLLMs Get Lost Between Rules and Instances in Abstract Visual Reasoning (arxiv.org)
39
1
Persona Attack: Incremental Memory Injection Jailbreak Attack against Large Language Models (arxiv.org)
40
1
PrivacyPeek: Auditing What LLM-Based Agents Acquire, Not Just What They Say (arxiv.org)
41
1
GeoSAM-3D: Geodesic Prompt Propagation for Open-Vocabulary 3D Scene Segmentation from Monocular Video (arxiv.org)
42
1
TimeBlocks: Foundational and Continual Time-Series Blockbase -- Extended Version (arxiv.org)
43
1
Hybrid Neural Ordinary Differential Equations for Data-Efficient Polymerization Modeling with Incomplete Kinetics (arxiv.org)
44
1
EEG-FuseFormer: A Transformer-Driven Feature Fusion Framework for Seizure Onset Prediction (arxiv.org)
45
1
Closing the Alignment-Maturity Gap in Federated Prototype Learning (arxiv.org)
46
1
When Safe Skills Collide: Measuring Compositional Risk in Agent Skill Ecosystems (arxiv.org)
47
1
Benchmarking Multimodal LLMs on Code Generation for Complex Interactive Webpages (arxiv.org)
48
1
A Protocol-Language Model for Network Intrusion (Without Deep Packet Inspection) (arxiv.org)
49
1
A physics-informed foundation model for quantitative diffusion MRI (arxiv.org)
50
1
Digital-to-Physical Transfer of Adversarial Patches for Aerial Vehicle Detection (arxiv.org)