AI News

⚡ 2 minutes ago
1
1
Architect-Ant: Editable Automatic Furnishing of Architectural Floor Plans (arxiv.org)
2
1
Recalling Too Well: Sycophancy Evaluation and Mitigation in Memory-Augmented Models (arxiv.org)
3
1
WorldKernel: A World Model is the Coupling Kernel of Admissible Possible Worlds (arxiv.org)
4
1
Frontier Coding Agents Use Metaprogramming to Adapt to Unfamiliar Programming Languages (arxiv.org)
5
1
Trading Utility for Dynamic Fairness in Multiple Resource Division with Sequential Demand (arxiv.org)
6
1
Bittensor Agent Arenas as a Trajectory Primitive: Distilling a Shopping Agent from ShoppingBench Subnet Traces (arxiv.org)
7
1
Forward-Only Convolutional Neural Networks with Learnable Channel-Class Assignment (arxiv.org)
8
1
Trainable Smooth-Rotation Transforms with Learned Channel Scales for LLM Quantization (arxiv.org)
9
1
Sample Where You Struggle: Sharpening Base Model Reasoning via Entropy-Guided Power Sampling (arxiv.org)
10
1
Sigma-Branch: Hierarchical Single-Path Network Reconstruction for Dynamic Inference with Reduced Active Parameters (arxiv.org)
11
1
Importance-Aware Scheduling for High-Dimensional Hyperparameter Optimization (arxiv.org)
12
1
Role-Agent: Bootstrapping LLM Agents via Dual-Role Evolution (arxiv.org)
13
1
Large-scale semantic mapping of learner agency and autonomy reveals what measurement and generative AI research overlook (arxiv.org)
14
1
Do VLMs Reason Like Engineers? A Benchmark and a Stage-wise Evaluation (arxiv.org)
15
1
Moonshine: An Autonomous Mathematical Research Agent Centered on Conjecture Generation (arxiv.org)
16
1
Interpreting and Steering a Text-to-Speech Language Model with Sparse Autoencoders (arxiv.org)
17
1
Generalized Conformal Predictive Systems Under Distributional Shifts (arxiv.org)
18
1
Mix, Don't Pick: Why Synthetic Corpus Composition Matters for Time Series Foundation Model Pretraining (arxiv.org)
19
1
LongMoE: Longitudinal Multimodal Learning via Trajectory-Aware Mixture-of-Experts (arxiv.org)
20
1
TRAPS: Therapeutic Response Analysis via Pathway-informed Stratification (arxiv.org)
21
1
A Navigable Manifold of Hypothesized Consciousness-Spectrum States in Language Model Representations (arxiv.org)
22
1
Optimality of FSQ Tokens for Continuous Diffusion for Categorical Data with Application to Text-to-Speech (arxiv.org)
23
1
Evaluating Research-Level Math Proofs via Strict Step-Level Verification (arxiv.org)
24
1
READER: Robust Evidence-based Authorship Decoding via Extracted Representations (arxiv.org)
25
1
More Human or More AI? Visualizing Human-AI Collaboration Disclosures in Journalistic News Production (arxiv.org)
26
1
Accelerating NeurASP with vectorization and caching (arxiv.org)
27
1
$\tau$-Rec: A Verifiable Benchmark for Agentic Recommender Systems (arxiv.org)
28
1
AutoPDE: Reliable Agentic PDE Solving via Explicitly Represented Solver Strategies (arxiv.org)
29
1
The Arbiter Agent: Continually Monitoring Multi-Agent Conversations to Detect Emergent Misalignment (arxiv.org)
30
1
When the Chain of Thought Knows Better: Failure Modes in Multi-Turn Reasoning Models (arxiv.org)
31
1
Learning What to Remember: Observability-Safe Memory Retention via Constrained Optimization for Long-Horizon Language Agents (arxiv.org)
32
1
LMT: A Bayesian Framework for Causal Discovery from Textual Alarm Records in Manufacturing Systems (arxiv.org)
33
1
Optuna Constrained Tree-Structured Parzen Estimator Is a Joint Density Generalization of c-TPE (arxiv.org)
34
1
SinkRec: Mitigating Semantic State Sink in Long Sequence Recommendation with Memory-Conditioned Gated Delta Networks (arxiv.org)
35
1
Toward Calibrated, Fair, and accurate Deepfake Detection (arxiv.org)
36
1
Hyperparameter Learning for Latent Factorization of Tensors for Representation Learning to Large-scale Dynamic Weighted Directed Network (arxiv.org)
37
1
One Token per Multimodal Evidence: Latent Memory for Resource-Constrained QA (arxiv.org)
38
1
ActiveMem: Distributed Active Memory for Long-Horizon LLM Reasoning (arxiv.org)
39
1
HIPIF: Hierarchical Planning and Information Folding for Long-Horizon LLM Agent Learning (arxiv.org)
40
1
Cross-Modal Knowledge Distillation without Paired Data: Theoretical Foundation and Algorithm (arxiv.org)
41
1
A complementary study on PlanGPT: Evaluation with defined Performance Metrics and comparison with a planner (arxiv.org)
42
1
FailureScope: Cross-Regime Behavioral Diagnosis of Language Model Weaknesses (arxiv.org)
43
1
Streaming Knowledge Compilation: Proactive Materiality-Scored Pinning for Time-Evolving LLM Wikis (arxiv.org)
44
1
Calibrating Overconfidence Without Sacrificing Confidence: Probe-Conditioned Head Intervention for LLMs (arxiv.org)
45
1
Rotate2Think: Geometric Priming via Orthogonal Rotation to Improve Language Model Reasoning (arxiv.org)
46
1
PatchSTG: Scalable Spatiotemporal Graph Transformers for Traffic Forecasting on Irregular Sensor Networks (arxiv.org)
47
1
ComBench: A Benchmark for Rigorous Proof Reasoning and Constructive Realization in Olympiad-Level Combinatorics (arxiv.org)
48
1
Trace2Policy: From Expert Behavior Traces to Self-Evolving Decision Agents (arxiv.org)
49
1
Soul Computing: A Theoretical Framework and Technical Architecture for Intelligent Agents with Independent Consciousness (arxiv.org)
50
1
A Unified Multi-Modal Framework for Intelligent Financial Systems: Integrating Reinforcement Learning, High-Frequency Trading, and Game-Theoretic Approaches with Cross-Modal Sentiment Analysis (arxiv.org)