AI News

⚡ 14 minutes ago
1
1
DetailMaster: Can Your Text-to-Image Model Handle Long Prompts? (arxiv.org)
2
1
Simulating Macroeconomic Expectations in Survey Experiments with LLM-based Economic Agents (arxiv.org)
3
1
GFlowGR: Fine-tuning Generative Recommendation Frameworks with Generative Flow Networks (arxiv.org)
4
1
AblationBench: Evaluating Automated Planning of Ablations in Empirical AI Research (arxiv.org)
5
1
FedS2R: One-Shot Federated Domain Generalization for Synthetic-to-Real Semantic Segmentation in Autonomous Driving (arxiv.org)
6
1
From Graph Retrieval to Schema Realization: Counterfactual Validation for Text-to-SPARQL over Heterogeneous Knowledge Graphs (arxiv.org)
7
1
FastSLM: Hierarchical Temporal Abstraction for Efficient Long-Form Speech Adaptation (arxiv.org)
8
1
Characterizing Web Search in The Age of Generative AI (arxiv.org)
9
1
Who Evaluates AI's Social Impacts? Mapping Coverage and Gaps in First and Third Party Evaluations (arxiv.org)
10
1
Optimizing Diversity and Quality through Base-Aligned Model Collaboration (arxiv.org)
11
1
A Theoretical Framework for Statistical Evaluability of Generative Models (arxiv.org)
12
1
Consistent Diffusion Language Models (arxiv.org)
13
1
Position: the Stochastic Parrot in the Coal Mine. Model Collapse is a Threat to Low-Resource Communities (arxiv.org)
14
1
Unsat Core Prediction through Polarity-Aware Representation Learning over Clause-Literal Hypergraphs (arxiv.org)
15
1
NILC: Discovering New Intents with LLM-assisted Clustering (arxiv.org)
16
1
RoboBenchMart: Benchmarking Robots in Retail Environment (arxiv.org)
17
1
Latent Reasoning in TRMs is Secretly a Policy Improvement Operator (arxiv.org)
18
1
Evaluating the Performance of Deep Learning Models in Whole-body Dynamic 3D Posture Prediction During Load-reaching Activities (arxiv.org)
19
1
Latent Collaboration in Multi-Agent Systems (arxiv.org)
20
1
SpeedAug: Policy Acceleration via Tempo-Enriched Policy and RL Fine-Tuning (arxiv.org)
21
1
From Segments to Scenes: Temporal Understanding in Autonomous Driving via Vision-Language Model (arxiv.org)
22
1
ShelfAware: Real-Time Semantic Localization in Quasi-Static Environments with Low-Cost Sensors (arxiv.org)
23
1
VocSim: A Training-free Benchmark for Zero-shot Content Identity in Single-source Audio (arxiv.org)
24
1
Calibrating Uncertainty for Zero-Shot Adversarial CLIP (arxiv.org)
25
1
Control of a Twin Rotor using Twin Delayed Deep Deterministic Policy Gradient (TD3) (arxiv.org)
26
1
SilentDrift: Exploiting Action Chunking for Stealthy Backdoor Attacks on Vision-Language-Action Models (arxiv.org)
27
1
MGRegBench: A Novel Benchmark Dataset with Anatomical Landmarks for Mammography Image Registration (arxiv.org)
28
1
Reinforcement Learning Position Control of a Quadrotor Using Soft Actor-Critic (SAC) (arxiv.org)
29
1
Dynamic Entropy Tuning in Reinforcement Learning Low-Level Quadcopter Control: Stochasticity vs Determinism (arxiv.org)
30
1
Uncovering Competency Gaps in Large Language Models and Their Benchmarks (arxiv.org)
31
1
VLM4VLA: Revisiting Vision-Language-Models in Vision-Language-Action Models (arxiv.org)
32
1
Paradoxical noise preference in RNNs (arxiv.org)
33
1
Prototypicality Bias Reveals Blindspots in Multimodal Evaluation Metrics (arxiv.org)
34
1
Hot-Start Chinese Language Modeling:Visual Glyphs Accelerate Sample-Efficient Learning (arxiv.org)
35
1
MASCOT: Towards Multi-Agent Socio-Collaborative Companion Systems (arxiv.org)
36
1
A Monosemantic Attribution Framework for Stable Interpretability in Clinical Neuroscience Transformer-Based Language Models (arxiv.org)
37
1
ELF: A Family of Encoder-Free ECG-Language Models (arxiv.org)
38
1
ASKD-Whisper: Adaptive Self-knowledge Distillation for Efficient and Low-Latency Automatic Speech Recognition (arxiv.org)
39
1
Demystifying Multi-Agent Debate: The Role of Confidence and Diversity (arxiv.org)
40
1
How Much Progress Has There Been in NVIDIA Datacenter GPUs? (arxiv.org)
41
1
APB-V: Accelerating Long-Video Understanding via Sequence-Parallelism-aware Approximate Attention (arxiv.org)
42
1
Med-Scout: Curing MLLMs' Geometric Blindness in Medical Perception via Geometry-Aware RL Post-Training (arxiv.org)
43
1
Global Geometry Is Not Enough for Vision Representations (arxiv.org)
44
1
Better Source, Better Flow: Learning Condition-Dependent Source Distribution for Flow Matching (arxiv.org)
45
1
GottBERT: a pure German Language Model (arxiv.org)
46
1
Incentivized Collaboration in Active Learning (arxiv.org)
47
1
Discovering Nonlinear Static Relationships in Unlabeled Dataset using Autoencoder with Ordered Variance (arxiv.org)
48
1
Synthesizing Neural Network Controllers with Closed-Loop Dissipativity Guarantees (arxiv.org)
49
1
Domain Adaptation with a Single Vision-Language Embedding (arxiv.org)
50
1
Efficient Hamiltonian, structure and trace distance learning of Gaussian states (arxiv.org)