Balanced Accuracy: The Right Metric for Evaluating LLM Judges – Explained through Youden’s J statistic

Magnification-Aware Distillation (MAD): A Self-Supervised Framework for Unified Representation Learning in Gigapixel Whole-Slide Images

arXiv:2512.14796v1 Announce Type: cross Abstract: Whole-slide images (WSIs) contain tissue information distributed across multiple magnification levels, yet most self-supervised methods treat these scales as independent

Feature-Centric Unsupervised Node Representation Learning Without Homophily Assumption

arXiv:2512.15112v1 Announce Type: cross Abstract: Unsupervised node representation learning aims to obtain meaningful node embeddings without relying on node labels. To achieve this, graph convolution,

SGM: Safety Glasses for Multimodal Large Language Models via Neuron-Level Detoxification

arXiv:2512.15052v1 Announce Type: cross Abstract: Disclaimer: Samples in this paper may be harmful and cause discomfort. Multimodal large language models (MLLMs) enable multimodal generation but

DrugRAG: Enhancing Pharmacy LLM Performance Through A Novel Retrieval-Augmented Generation Pipeline

arXiv:2512.14896v1 Announce Type: cross Abstract: Objectives: To evaluate large language model (LLM) performance on pharmacy licensure-style question-answering (QA) tasks and develop an external knowledge integration

Let the Barbarians In: How AI Can Accelerate Systems Performance Research

arXiv:2512.14806v1 Announce Type: cross Abstract: Artificial Intelligence (AI) is beginning to transform the research process by automating the discovery of new solutions. This shift depends