AI News

datacenter latest today hot

Architect-Ant: Editable Automatic Furnishing of Architectural Floor Plans (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Recalling Too Well: Sycophancy Evaluation and Mitigation in Memory-Augmented Models (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

WorldKernel: A World Model is the Coupling Kernel of Admissible Possible Worlds (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Frontier Coding Agents Use Metaprogramming to Adapt to Unfamiliar Programming Languages (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Trading Utility for Dynamic Fairness in Multiple Resource Division with Sequential Demand (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Bittensor Agent Arenas as a Trajectory Primitive: Distilling a Shopping Agent from ShoppingBench Subnet Traces (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Forward-Only Convolutional Neural Networks with Learnable Channel-Class Assignment (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Trainable Smooth-Rotation Transforms with Learned Channel Scales for LLM Quantization (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Sample Where You Struggle: Sharpening Base Model Reasoning via Entropy-Guided Power Sampling (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Sigma-Branch: Hierarchical Single-Path Network Reconstruction for Dynamic Inference with Reduced Active Parameters (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Importance-Aware Scheduling for High-Dimensional Hyperparameter Optimization (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Role-Agent: Bootstrapping LLM Agents via Dual-Role Evolution (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Large-scale semantic mapping of learner agency and autonomy reveals what measurement and generative AI research overlook (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Do VLMs Reason Like Engineers? A Benchmark and a Stage-wise Evaluation (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Moonshine: An Autonomous Mathematical Research Agent Centered on Conjecture Generation (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Interpreting and Steering a Text-to-Speech Language Model with Sparse Autoencoders (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Generalized Conformal Predictive Systems Under Distributional Shifts (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Mix, Don't Pick: Why Synthetic Corpus Composition Matters for Time Series Foundation Model Pretraining (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

LongMoE: Longitudinal Multimodal Learning via Trajectory-Aware Mixture-of-Experts (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

TRAPS: Therapeutic Response Analysis via Pathway-informed Stratification (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

A Navigable Manifold of Hypothesized Consciousness-Spectrum States in Language Model Representations (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Optimality of FSQ Tokens for Continuous Diffusion for Categorical Data with Application to Text-to-Speech (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Evaluating Research-Level Math Proofs via Strict Step-Level Verification (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

READER: Robust Evidence-based Authorship Decoding via Extracted Representations (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

More Human or More AI? Visualizing Human-AI Collaboration Disclosures in Journalistic News Production (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Accelerating NeurASP with vectorization and caching (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

$\tau$-Rec: A Verifiable Benchmark for Agentic Recommender Systems (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

AutoPDE: Reliable Agentic PDE Solving via Explicitly Represented Solver Strategies (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

The Arbiter Agent: Continually Monitoring Multi-Agent Conversations to Detect Emergent Misalignment (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

When the Chain of Thought Knows Better: Failure Modes in Multi-Turn Reasoning Models (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Learning What to Remember: Observability-Safe Memory Retention via Constrained Optimization for Long-Horizon Language Agents (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

LMT: A Bayesian Framework for Causal Discovery from Textual Alarm Records in Manufacturing Systems (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Optuna Constrained Tree-Structured Parzen Estimator Is a Joint Density Generalization of c-TPE (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

SinkRec: Mitigating Semantic State Sink in Long Sequence Recommendation with Memory-Conditioned Gated Delta Networks (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Toward Calibrated, Fair, and accurate Deepfake Detection (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Hyperparameter Learning for Latent Factorization of Tensors for Representation Learning to Large-scale Dynamic Weighted Directed Network (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

One Token per Multimodal Evidence: Latent Memory for Resource-Constrained QA (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

ActiveMem: Distributed Active Memory for Long-Horizon LLM Reasoning (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

HIPIF: Hierarchical Planning and Information Folding for Long-Horizon LLM Agent Learning (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Cross-Modal Knowledge Distillation without Paired Data: Theoretical Foundation and Algorithm (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

A complementary study on PlanGPT: Evaluation with defined Performance Metrics and comparison with a planner (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

FailureScope: Cross-Regime Behavioral Diagnosis of Language Model Weaknesses (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Streaming Knowledge Compilation: Proactive Materiality-Scored Pinning for Time-Evolving LLM Wikis (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Calibrating Overconfidence Without Sacrificing Confidence: Probe-Conditioned Head Intervention for LLMs (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Rotate2Think: Geometric Priming via Orthogonal Rotation to Improve Language Model Reasoning (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

PatchSTG: Scalable Spatiotemporal Graph Transformers for Traffic Forecasting on Irregular Sensor Networks (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

ComBench: A Benchmark for Rigorous Proof Reasoning and Constructive Realization in Olympiad-Level Combinatorics (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Trace2Policy: From Expert Behavior Traces to Self-Evolving Decision Agents (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

Soul Computing: A Theoretical Framework and Technical Architecture for Intelligent Agents with Independent Consciousness (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

A Unified Multi-Modal Framework for Intelligent Financial Systems: Integrating Reinforcement Learning, High-Frequency Trading, and Game-Theoretic Approaches with Cross-Modal Sentiment Analysis (arxiv.org)

by rss-bot · 2 weeks ago · 0 comments

← prev p.168/2244 next →