AI News

⚡ 11 minutes ago
1
1
Creating and Evaluating K-12 GenAI Assessment Graders Through Context Engineering (arxiv.org)
2
1
The AI Legal Specialist: A Juridically Autonomous Professional Profile for AI Governance (arxiv.org)
3
1
GeoDial: A Multimodal Conversational Tutoring Dataset for Geometry Problem-Solving with Visual Tutor Turns (arxiv.org)
4
1
AI SciBrief as a Gateway to Research: A Framework for Onboarding Students into New Research Areas (arxiv.org)
5
1
Automated reproducibility assessments in the social and behavioral sciences using large language models (arxiv.org)
6
1
Agents-K1: Towards Agent-native Knowledge Orchestration (arxiv.org)
7
1
EurekAgent: Agent Environment Engineering is All You Need For Autonomous Scientific Discovery (arxiv.org)
8
1
Before You Think: System 0, AI-Mediated Cognition and Cognitive Colonization (arxiv.org)
9
1
Beyond Runtime Enforcement: Shield Synthesis as Defensibility Analysis for Adversarial Networks (arxiv.org)
10
1
AgentBeats: Agentifying Agent Assessment for Openness, Standardization, and Reproducibility (arxiv.org)
11
1
Reasoning as Pattern Matching: Shared Mechanisms in Human and LLM Everyday Reasoning (arxiv.org)
12
1
Multi-Agent Reinforcement Learning from Delayed Marketplace Feedback for Objective-Weight Adaptation in Three-Sided Dispatch (arxiv.org)
13
1
Eigenism: Ethics for a Human-AI Future (arxiv.org)
14
1
Versioned Late Materialization for Ultra-Long Sequence Training in Recommendation Systems at Scale (arxiv.org)
15
1
DCD: Domain-Oriented Design for Controlled Retrieval-Augmented Generation (arxiv.org)
16
1
Fusion Learning from Dynamic Functional Connectivity: Combining the Amplitude and Phase of fMRI Signals to Identify Brain Disorders (arxiv.org)
17
1
InnoEval: On Research Idea Evaluation as a Knowledge-Grounded, Multi-Perspective Reasoning Problem (arxiv.org)
18
1
Unsafer in Many Turns: Benchmarking and Defending Multi-Turn Safety Risks in Tool-Using Agents (arxiv.org)
19
1
TokaMark: A Comprehensive Benchmark for MAST Tokamak Plasma Models (arxiv.org)
20
1
Fin-RATE: A Real-world Financial Analytics and Tracking Evaluation Benchmark for LLMs on SEC Filings (arxiv.org)
21
1
Standardized Methods and Recommendations for Green Federated Learning (arxiv.org)
22
1
VDE Bench: Evaluating The Capability of Image Editing Models to Modify Visual Documents (arxiv.org)
23
1
Language Model Circuits Are Sparse in the Neuron Basis (arxiv.org)
24
1
When Smaller Wins: Dual-Stage Distillation and Pareto-Guided Compression of Liquid Neural Networks for Edge Battery Prognostics (arxiv.org)
25
1
CuMA: Aligning LLMs with Sparse Cultural Values via Demographic-Aware Mixture of Adapters (arxiv.org)
26
1
Rarity-Gated Context Conditioning for Offline Imitation Learning-Based Maritime Anomaly Detection (arxiv.org)
27
1
Mining Architectural Quality Under Agentic AI Adoption: A Causal Study of Java Repositories (arxiv.org)
28
1
HYDRA-X: Native Unified Multimodal Models with Holistic Visual Tokenizers (arxiv.org)
29
1
Cross-Modal Masked Compositional Concept Modeling for Enhancing Visio-Linguistic Compositionality (arxiv.org)
30
1
Different Layers, Different Manifolds: Module-Wise Weight-Space Geometry in Transformer Optimization (arxiv.org)
31
1
Humor Style Drives Laughter, Topic Shapes Acceptability: Evaluating Bilingual Personal and Political Robot-Delivered AI Jokes (arxiv.org)
32
1
Towards Personalized Federated Learning for Dysarthric Speech Recognition (arxiv.org)
33
1
ComAct: Reframing Professional Software Manipulation via COM-as-Action Paradigm (arxiv.org)
34
1
Once-for-All: Scalable Simultaneous Forecasting via Equilibrium State Estimation (arxiv.org)
35
1
Decoding Insect Song: A Multitask Semisupervised Orthoptera Bioacoustic Classifier (arxiv.org)
36
1
ReSET: Accurate Latency-Critical NVFP4 Reasoning via Step-Aware Temperature Scaling (arxiv.org)
37
1
Proprioceptive-visual correspondence enables self-other distinction in humanoid robots (arxiv.org)
38
1
Transformer-Guided Graph Attention for Direct Cardiac Mesh Reconstruction: A Structural Digital Twin Framework (arxiv.org)
39
1
Ex-Omni: Enabling 3D Facial Animation Generation for Omni-modal Large Language Models (arxiv.org)
40
1
Modern analog computing for solving differential and matrix equations (arxiv.org)
41
1
MemRefine: LLM-Guided Compression for Long-Term Agent Memory (arxiv.org)
42
1
NTS-CoT: Mitigating Hallucinations in LLM-based News Timeline Summarization with Chain-of-Thought Reasoning (arxiv.org)
43
1
Iterative Visual Thinking: Teaching Vision-Language Models Spatial Self-Correction through Visual Feedback (arxiv.org)
44
1
Cascade Classification of Dermoscopic Images of Skin Neoplasms with Controllable Sensitivity and External Clinical Validation (arxiv.org)
45
1
MiniPIC: Flexible Position-Independent Caching in <100LOC (arxiv.org)
46
1
Multiagent Protocols with Aggregated Confidence Signals (arxiv.org)
47
1
A Three-Layer Framework for AI in Scientific Discovery (arxiv.org)
48
1
Is It You or Your Environment? A Bayesian Inference Framework for Genomically-Anchored Personalized Physiological Interpretation (arxiv.org)
49
1
Uncertainty-Aware Hybrid Retrieval for Long-Document RAG (arxiv.org)
50
1
CloudCons: A Comprehensive End-to-End Benchmark for Cloud Resource Consolidation (arxiv.org)