Generalizable Audio-Visual Navigation via Binaural Difference Attention and Action Transition Prediction

This Treatment Works, Right? Evaluating LLM Sensitivity to Patient Question Framing in Medical QA

arXiv:2604.05051v1 Announce Type: cross Abstract: Patients are increasingly turning to large language models (LLMs) with medical questions that are complex and difficult to articulate clearly.

Generative AI for Video Trailer Synthesis: From Extractive Heuristics to Autoregressive Creativity

arXiv:2604.04953v1 Announce Type: cross Abstract: The domain of automatic video trailer generation is currently undergoing a profound paradigm shift, transitioning from heuristic-based extraction methods to

Architecture Without Architects: How AI Coding Agents Shape Software Architecture

arXiv:2604.04990v1 Announce Type: cross Abstract: AI coding agents select frameworks, scaffold infrastructure, and wire integrations, often in seconds. These are architectural decisions, yet almost no

Web Retrieval-Aware Chunking (W-RAC) for Efficient and Cost-Effective Retrieval-Augmented Generation Systems

arXiv:2604.04936v1 Announce Type: cross Abstract: Retrieval-Augmented Generation (RAG) systems critically depend on effective document chunking strategies to balance retrieval quality, latency, and operational cost. Traditional

Improving Clinical Trial Recruitment using Clinical Narratives and Large Language Models

arXiv:2604.05190v1 Announce Type: cross Abstract: Screening patients for enrollment is a well-known, labor-intensive bottleneck that leads to under-enrollment and, ultimately, trial failures. Recent breakthroughs in