The Residual Stream Is All You Need: On the Redundancy of the KV Cache in Transformer Inference

Depression subtype classification from social media posts: few-shot prompting vs. fine-tuning of large language models

BackgroundSocial media provides timely proxy signals of mental health, but reliable tweet-level classification of depression subtypes remains challenging due to short, noisy text, overlapping symptomatology,

Mobile Health App Attitudes and Adoption Among Oncology Providers: Cross-Sectional National Survey

Background: Mobile health (mHealth) apps can address health inequities and enhance access to care for individuals with immunocompromising conditions. Although hundreds of oncology apps exist,

LeWorldModel: Stable End-to-End Joint-Embedding Predictive Architecture from Pixels

arXiv:2603.19312v1 Announce Type: cross Abstract: Joint Embedding Predictive Architectures (JEPAs) offer a compelling framework for learning world models in compact latent spaces, yet existing methods

Do Post-Training Algorithms Actually Differ? A Controlled Study Across Model Scales Uncovers Scale-Dependent Ranking Inversions

arXiv:2603.19335v1 Announce Type: cross Abstract: Post-training alignment has produced dozens of competing algorithms — DPO, SimPO, KTO, GRPO, and others — yet practitioners lack controlled

Exploring Subnetwork Interactions in Heterogeneous Brain Network via Prior-Informed Graph Learning

arXiv:2603.19307v1 Announce Type: cross Abstract: Modeling the complex interactions among functional subnetworks is crucial for the diagnosis of mental disorders and the identification of functional