AI News

Failures Reveal What Metrics Miss: An Evidence-Driven Agent for Recursive Refinement of ECG Classifiers (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

PANOPTICON: A PII-Based Assemblage of Naturalistic Output Tokens for Investigating Privacy Leakage Within LLM Context Window (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

The Scaffold Effect in Coding Agents: Harness Choice as a Hidden Variable in Coding-Agent Evaluation (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

TriSP: Tri-Signal Structured Pruning for Large Language Models (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

AI-Assisted Causal Inference and Mediation Analyses of Environmental and Psychosocial Determinants of Subjective Cognitive Difficulties in the All of Us Research Program (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

Plato-Bio: verification-first biological novelty screening with temporal rediscovery and structural benchmarks (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

Traceable LLM Reasoning for Fake-Order Fraud Detection (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

Loss-Aware Feature-Map Pruning in Convolutional Neural Networks Using Multi-Armed Bandits (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

Falsifiable Commitment Planning for Self-Correcting Web Agents (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

Retrieval-Augmented Generation of Ontologies from Relational Databases (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

CSV-Decode: Certifiable Sub-Vocabulary Decoding for Efficient Large Language Model Inference (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

TLRNet: Estimating Individual Treatment Effect based on Local Information and Single Learner Structure (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

Debiased Machine Learning: Identification, Estimation, and Shape Constraints (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

MIITA: Memory-Induced Inference-Time Adaptation for Continual Learning with Small Language Models (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

EmotionAI: A Privacy-Preserving Computational Intelligence Pipeline for Speech-Emotion-Grounded Conversational Analysis (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

Multi-Objective Structured Pruning of LLMs for Latency and Model Size Optimization (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

Distributional Split Criteria for Random Forests: Extensions, Shrinkage, and the Robustness of Mean Splitting (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

Lions and Muons: Optimization via Stochastic Frank-Wolfe under Heavy-Tailed Noise (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

Robust Conformalized Selection with Noisy Responses (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

Modeling Memory-Dependent Reliability of LLMs: A Hidden Markov Model (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

Amortized Bayesian Causal Discovery of Extended Factor Graphs (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

Two-Timescale Hierarchical Reinforcement Learning for Resilient Operations (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

Agent-UCT: Upper Confidence Bounds Applied to Trees for Agentic Workflow Optimization with Cost-Awareness (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

Covariance-Boosted Gaussian Processes for Spatiotemporal Irregularities (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

GNN-based Multi-Agent Control of Traffic Shockwaves in Sparse Vehicular Ad-hoc Networks (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

Lexical discovery in unknown environments orchestrated by Large Language Models (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

Scoping Review of AI, Metrology, and ESG in the Semiconductor Sector: Implications for Safe and Sustainable by Design (SSbD) (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

MM-ShiftKV: Decode-Aware Prefill-Stage KV Selection for Multimodal Large Language Models (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

VLASH: Real-Time VLAs via Future-State-Aware Asynchronous Inference (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

MIME: Multimodal Interactive Motion Encoder (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

TRUAV: Distributed Multi-Agent Reinforcement Learning for Trajectory Planning and Routing Enhancement in UAV-Aided IoT-Enabled VANETs (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

What CLIP Knows but Cannot Say: Recovering Negation from Frozen Intermediate Features (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

Understanding Human-like Solutions in Combinatorial Optimization via Learning and Search (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

TriShieldRAG: A Three-Ring Defense-in-Depth Framework Against Knowledge Corruption in Retrieval-Augmented Generation (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

RoleMix: Unifying Sequential and Non-Sequential Features via Semantic Tokenization for Post-Click Conversion Rate Prediction (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

HiLLTS: Zero-Shot Hierarchical LLM-Guided Traffic Signal Control for Sustainable Transportation (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

Decision trees, Frobenius traces, and Weierstrass coefficients of elliptic curves (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

Source-Aware Reranking for Retrieval-Augmented Generation: A Reliability Prior Approach (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

ARdena: Scenario-driven control of real-time LLM agents (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

FedSLIM: Privacy-Preserving Federated MDL-Based Descriptive Pattern Mining Across Data Silos (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

A Fixed-Effects Causal Forest for Staggered Adoption, with an Application to Medicaid Expansion (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

Epistemic Norms for AI Safety and Alignment Research (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

ADAGE: A Language-Agnostic Pipeline for Analogical Reasoning Evaluation (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

GeoDecider: An Evidence-Grounded Agent for Geological Interpretation via Deliberative Reasoning (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

From RLVR to RLSVR: Task Transformation Induces Self-Verifiable Rewards for Open-Ended LLM Self-Improvement (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

CodeEvo: Interaction-Driven Synthesis of Code-centric Data through Hybrid and Iterative Feedback (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

Imprompt: A Language Framework for Prompt Programming (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

Realizing Scaling Laws in Recommender Systems: A Foundation-Expert Paradigm for Hyperscale Model Deployment (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

HydroAgent: Formalizing Forecaster Expertise into Skill-Orchestrated Flood Forecasting Workflows (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

Nanbeige4.2-3B: Unlocking Agentic Capabilities in a Compact Model (arxiv.org)

by rss-bot · 17 hours ago · 0 comments

← prev p.9/2608 next →