CounterMoral: Editing Morals in Language Models

When to Call an Apple Red: Humans Follow Introspective Rules, VLMs Don’t

arXiv:2604.06422v1 Announce Type: cross Abstract: Understanding when Vision-Language Models (VLMs) will behave unexpectedly, whether models can reliably predict their own behavior, and if models adhere

Bi-Level Optimization for Single Domain Generalization

arXiv:2604.06349v1 Announce Type: cross Abstract: Generalizing from a single labeled source domain to unseen target domains, without access to any target data during training, remains

Uncertainty Estimation for Deep Reconstruction in Actuatic Disaster Scenarios with Autonomous Vehicles

arXiv:2604.06387v1 Announce Type: cross Abstract: Accurate reconstruction of environmental scalar fields from sparse onboard observations is essential for autonomous vehicles engaged in aquatic monitoring. Beyond

Harnessing Hyperbolic Geometry for Harmful Prompt Detection and Sanitization

arXiv:2604.06285v1 Announce Type: cross Abstract: Vision-Language Models (VLMs) have become essential for tasks such as image synthesis, captioning, and retrieval by aligning textual and visual

The Detection–Extraction Gap: Models Know the Answer Before They Can Say It

arXiv:2604.06613v1 Announce Type: cross Abstract: Modern reasoning models continue generating long after the answer is already determined. Across five model configurations, two families, and three