The FACTS Leaderboard: A Comprehensive Benchmark for Large Language Model Factuality

Development and Validation of a Revised Multidimensional Digital Health Literacy Scale: Secondary Analysis Using Cross-Sectional Data From the 2022 GetCheckedOnline Community Survey In British Columbia, Canada

Background: Digital technologies are reshaping health care, making digital health literacy (DHL) a critical competency for navigating online health information. Although widely conceived and measured

Cost-Utility Analysis and Value-Based Pricing of Digital Therapeutics for Pulmonary Rehabilitation in Chronic Respiratory Disease: Economic Evaluation Based on a Randomized Controlled Trial

Background: Pulmonary rehabilitation, a nonpharmacological treatment for chronic respiratory diseases, is underused due to limited access and time constraints. In a randomized controlled trial, the

CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning

arXiv:2512.02551v2 Announce Type: replace-cross Abstract: In this paper, we propose CUDA-L2, a system that combines large language models (LLMs) and reinforcement learning (RL) to automatically

MiniScope: A Least Privilege Framework for Authorizing Tool Calling Agents

arXiv:2512.11147v1 Announce Type: cross Abstract: Tool calling agents are an emerging paradigm in LLM deployment, with major platforms such as ChatGPT, Claude, and Gemini adding

Reducing Fragmentation and Starvation in GPU Clusters through Dynamic Multi-Objective Scheduling

arXiv:2512.10980v1 Announce Type: cross Abstract: GPU clusters have become essential for training and deploying modern AI systems, yet real deployments continue to report average utilization