AI News

⚡ just now
1
1
Evaluation of ML Resource Utilization Requires Model Life Cycle Assessment (arxiv.org)
2
1
Trait-space Monitoring for Emergent Misalignment During Supervised Finetuning (arxiv.org)
3
1
Large Language Models Should Learn Personalized Rather Than Aggregated Human Preferences (arxiv.org)
4
1
Semantic Cache Distillation: Efficient State Transfer via Reuse and Selective Patching (arxiv.org)
5
1
DOME: Learning Transferable Domain Variables from Sparse Supervision for Test-Time Adaptation (arxiv.org)
6
1
LogNEO: A GPT-Neo Reinforcement Learning Framework for Accurate Real-Time Log Anomaly Detection (arxiv.org)
7
1
Cost-Aware Speculative Execution for LLM-Agent Workflows: An Integrated Five-Dimension Method (arxiv.org)
8
1
Does Persona Make LLMs K-pop Fans? A Pilot Study of LLM-Based Online Concert Audience Agents (arxiv.org)
9
1
Agentic multi-fidelity learning of quasiparticle and excitonic properties (arxiv.org)
10
1
Learning Transfers: Kan Extensions for Neural Invariants (arxiv.org)
11
1
Sequential statistical inference for Large Language Models: Representation, validity, and monitoring (arxiv.org)
12
1
Finite Certificates for In-Context Determinacy and a Threshold Theory of Emergence in Language Models (arxiv.org)
13
1
Airport Terminal Passenger Queue Forecasting for Departure Gates and Security Checkpoints (arxiv.org)
14
1
HASA: Subnet Allocation for Compute-Constrained Model-Heterogeneous Federated Learning (arxiv.org)
15
1
Test-Time Adaptive Composition for Machine Learning as a Service (MLaaS) in IoT Environments (arxiv.org)
16
1
Cherry-pick Override: Unsafe Directional Commitment in LLM Judges under Mixed Evidence (arxiv.org)
17
1
Beyond Pass/Fail: Using Process Mining to Understand How LLMs Resist (and Fail) Red Team Attacks (arxiv.org)
18
1
Beyond Accuracy: Interpreting Topic Representation in Suicide Ideation Detection Models (arxiv.org)
19
1
FineGen: A VLM-based Multi-Agent Framework for Fine-Grained Image-Text Dataset Construction (arxiv.org)
20
1
Position: Anthropomorphic Misalignment Research Needs Stronger Evidence (arxiv.org)
21
1
From Coarse to Fine: Managing Temporal Granularity in Spatio-Temporal Data for Fine-Grained Traffic Prediction (arxiv.org)
22
1
Graph Neural Networks for Predicting Solvability of Finite Groups (arxiv.org)
23
1
ScaleSweep: Accurate NVFP4 Post-Training Quantization of LLMs via Block Scale Initialization (arxiv.org)
24
1
Query Lens: Interpreting Sparse Key-Value Features with Indirect Effects (arxiv.org)
25
1
Item Response Scaling Laws: A Measurement Theory Approach for Efficient and Generalizable Neural Scaling Estimation (arxiv.org)
26
1
Beyond Neural Collapse: Task-Intrinsic Geometry Governs Neural Representations in Modular Arithmetic (arxiv.org)
27
1
In-Context Learning for Latent Space Bayesian Optimization (arxiv.org)
28
1
On Choosing the $\mu$ Parameter in Gaussian Differential Privacy (arxiv.org)
29
1
BSTabDiff: Block-Subunit Diffusion Priors for High-Dimensional Tabular Data Generation (arxiv.org)
30
1
Asymptotic Optimality of Thompson Sampling for Risk-Averse Bandits with Sub-Gaussian Rewards (arxiv.org)
31
1
Data augmented bootstrap: Unifying confidence interval construction by approximate invariance (arxiv.org)
32
1
Understanding Quantization-Aware Training: Gradients at Quantized Weights Bias to the Low-Loss Basin (arxiv.org)
33
1
Capability-Aligned Hierarchical Learning for Tool-Augmented LLMs (arxiv.org)
34
1
Backward Coherence and Hidden-State Stability in Recurrent Neural Networks: A Quasi-Reverse-Martingale Theory (arxiv.org)
35
1
sGPO: Trading Inference FLOPs for Training Efficiency in RLVR (arxiv.org)
36
1
Intrinsic Selection and Particle Resampling for Inference-Time Scaling Beyond Domain Verifiability (arxiv.org)
37
1
TT-DAC-PS: Twin-Target Deterministic Actor-Critic with Policy Smoothing for Optimal Trade Execution (arxiv.org)
38
1
CP-factorization for high dimensional tensor time series and double projection iterations (arxiv.org)
39
1
Querying Counterfactuals on Tissue Graphs with Supervised Disentanglement (arxiv.org)
40
1
Nonparametric undirected graphical model selection using diffusion models (arxiv.org)
41
1
When Are Neural Interaction Discoveries Real? Identifiability, Recoverability, and a Pre-Fit Diagnostic (arxiv.org)
42
1
The Spectral Dynamics and Noise Geometry of Muon (arxiv.org)
43
1
A Switching Beamformer for Highly Non-Stationary Environments (arxiv.org)
44
1
How Deep Are Deep GPs, Really? A Sharp Threshold and a Non-Gaussian Limit for Compositional GPs (arxiv.org)
45
1
Assessing model calibration with boosting trees (arxiv.org)
46
1
Inference for High-Dimensional Sparse Spectral Precision Matrices (arxiv.org)
47
1
Partially Performative Prediction (arxiv.org)
48
1
Instrumented data for causal scientific machine learning (arxiv.org)
49
1
Large-scale empirical tuning and comparison of default optimizers for variational inference (arxiv.org)
50
1
A Framework for Evaluating and Benchmarking Concept Drift Detection Methods (arxiv.org)