Fantastic Bugs and Where to Find Them in AI Benchmarks

Sex and age estimation from cardiac signals captured via radar using data augmentation and deep learning: a privacy concern

IntroductionElectrocardiograms (ECGs) have long served as the standard method for cardiac monitoring. While ECGs are highly accurate and widely validated, they require direct skin contact,

Open LLM-based actionable incidental finding extraction from [18F]fluorodeoxyglucose PET-CT radiology reports

IntroductionWe developed an open, large language model (LLM)-based pipeline to extract actionable incidental findings (AIFs) from [18F]fluorodeoxyglucose positron emission tomography-computed tomography ([18F]FDG PET-CT) reports. This

Reassessing prediction in the brain: Pre-onset neural encoding during natural listening does not reflect pre-activation

arXiv:2412.19622v2 Announce Type: replace Abstract: Predictive processing theories propose that the brain continuously anticipates upcoming input. However, direct neural evidence for predictive pre-activation during natural

CharCom: Composable Identity Control for Multi-Character Story Illustration

arXiv:2510.10135v2 Announce Type: replace Abstract: Ensuring character identity consistency across varying prompts remains a fundamental limitation in diffusion-based text-to-image generation. We propose CharCom, a modular

ConCISE: A Reference-Free Conciseness Evaluation Metric for LLM-Generated Answers

arXiv:2511.16846v1 Announce Type: cross Abstract: Large language models (LLMs) frequently generate responses that are lengthy and verbose, filled with redundant or unnecessary details. This diminishes