Information Physics of Intelligence: Unifying Logical Depth and Entropy under Thermodynamic Constraints

CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning

arXiv:2512.02551v2 Announce Type: replace-cross Abstract: In this paper, we propose CUDA-L2, a system that combines large language models (LLMs) and reinforcement learning (RL) to automatically

MiniScope: A Least Privilege Framework for Authorizing Tool Calling Agents

arXiv:2512.11147v1 Announce Type: cross Abstract: Tool calling agents are an emerging paradigm in LLM deployment, with major platforms such as ChatGPT, Claude, and Gemini adding

Reducing Fragmentation and Starvation in GPU Clusters through Dynamic Multi-Objective Scheduling

arXiv:2512.10980v1 Announce Type: cross Abstract: GPU clusters have become essential for training and deploying modern AI systems, yet real deployments continue to report average utilization

KathDB: Explainable Multimodal Database Management System with Human-AI Collaboration

arXiv:2512.11067v1 Announce Type: cross Abstract: Traditional DBMSs execute user- or application-provided SQL queries over relational data with strong semantic guarantees and advanced query optimization, but

Agile Flight Emerges from Multi-Agent Competitive Racing

arXiv:2512.11781v1 Announce Type: cross Abstract: Through multi-agent competition and the sparse high-level objective of winning a race, we find that both agile flight (e.g., high-speed