A Study of Failure Modes in Two-Stage Human-Object Interaction Detection

CLIP Architecture for Abdominal CT Image-Text Alignment and Zero-Shot Learning: Investigating Batch Composition and Data Scaling

arXiv:2604.13561v1 Announce Type: cross Abstract: Vision-language models trained with contrastive learning on paired medical images and reports show strong zero-shot diagnostic capabilities, yet the effect

SafeHarness: Lifecycle-Integrated Security Architecture for LLM-based Agent Deployment

arXiv:2604.13630v1 Announce Type: cross Abstract: The performance of large language model (LLM) agents depends critically on the execution harness, the system layer that orchestrates tool

Monthly Diffusion v0.9: A Latent Diffusion Model for the First AI-MIP

arXiv:2604.13481v1 Announce Type: cross Abstract: Here, we describe Monthly Diffusion at 1.5-degree grid spacing (MD-1.5 version 0.9), a climate emulator that leverages a spherical Fourier

Beyond Uniform Sampling: Synergistic Active Learning and Input Denoising for Robust Neural Operators

arXiv:2604.13316v1 Announce Type: cross Abstract: Neural operators have emerged as fast surrogate models for physics simulations, yet they remain acutely vulnerable to adversarial perturbations, a

Minimax Optimality and Spectral Routing for Majority-Vote Ensembles under Markov Dependence

arXiv:2604.13414v1 Announce Type: cross Abstract: Majority-vote ensembles achieve variance reduction by averaging over diverse, approximately independent base learners. When training data exhibits Markov dependence, as