Can LLMs Reason About Attention? Towards Zero-Shot Analysis of Multimodal Classroom Behavior

Expert-Annotated Embryo Image Dataset with Natural Language Descriptions for Evidence-Based Patient Communication in IVF

arXiv:2604.16528v1 Announce Type: cross Abstract: Embryo selection is one of multiple crucial steps in in-vitro fertilization, commonly based on morphological assessment by clinical embryologists. Although

Agentic Education: Using Claude Code to Teach Claude Code

arXiv:2604.17460v1 Announce Type: cross Abstract: AI coding assistants have proliferated rapidly, yet structured pedagogical frameworks for learning these tools remain scarce. Developers face a gap

Voronoi-guided Bilateral 2D Gaussian Splatting for Arbitrary-Scale Hyperspectral Image Super-Resolution

arXiv:2604.17727v1 Announce Type: cross Abstract: Most existing hyperspectral image super-resolution methods require modifications for different scales, limiting their flexibility in arbitrary-scale reconstruction. 2D Gaussian splatting

Comparing Human and Large Language Model Interpretation of Implicit Information

arXiv:2604.17085v1 Announce Type: cross Abstract: The interpretation of implicit meanings is an integral aspect of human communication. However, this framework may not transfer to interactions

Instinct vs. Reflection: Unifying Token and Verbalized Confidence in Multimodal Large Models

arXiv:2604.17274v1 Announce Type: cross Abstract: Multimodal Large Language Models (MLLMs) have demonstrated exceptional capabilities in various perception and reasoning tasks. Despite this success, ensuring their