Wavelet analysis of human recombination rates demonstrates divergence on fine scales

Background: Recombination rates can be estimated across the genome, underpinning genetic analyses such as identification of regions under selection. Accurate recombination mapping requires observing a

Cortical geometry constrains the unimodal anchors of sensory integration

The human cerebral cortex is organized along a unimodal-to-transmodal hierarchy, which provides a putative substrate for the integration of sensory signals from primary cortical fields

Post-translational modifications in the brain are critical contributors to Alzheimers disease neuropathology and cognitive decline

Post-translational modifications (PTMs) in APP and MAPT contribute to plaques and tangles in Alzheimers disease (AD). Yet broader proteome-wide PTMs in the AD brain are

The Amygdalostriatal Transition Area Exhibits Lateral Amygdala-Like Spiking Activity and Tone-Shock Pairing-Induced Plasticity

During Pavlovian fear conditioning, presentation of a conditioned stimulus, such as a tone, together with an unconditioned stimulus, such as an electrical shock, excites neurons

The Logic of Thalamic Inputs onto the Molecular Taxonomy of Cortical Neurons Reveals a Visual Hierarchy

The hierarchical organization of sensory cortices and the rich molecular taxonomy of their cell types are defining features of the mammalian cortex. Cortical areas along

Think Like a Pilot: Fine-Grained Long-Horizon UAV Navigation

June 8, 2026

arXiv:2606.06836v1 Announce Type: cross
Abstract: Language-guided UAV agents must execute long-horizon semantic instructions while producing smooth, physically feasible continuous flight commands, yet existing Vision-Language Navigation (VLN) benchmarks typically use discrete or coarse actions and existing UAV Vision-Language-Action (VLA) tasks focus on short, atomic maneuvers. To address this gap in UAV task settings, we introduce textbfFLIGHT, a textbfFine-grained textbfLong-horizon textbfInstruction-textbfGuided benchmark for textbfHybrid UAV navigation and reasoning textbfTasks, which combines multi-stage instructions with dense 6-DoF trajectory annotations across two dataset splits: Fine-grained VLN and Long-horizon Flow. To endow the UAV agent with the capability of real-time in-flight reasoning over task execution status and mission planning, while simultaneously accommodating high-frequency, real-time precise control, we further propose textbfFLIGHT VLA, an asynchronous architecture that decouples a low-frequency Streaming Pilot Vision-Language Model (VLM) for task-state reasoning from a high-frequency diffusion action model for continuous control, supervised by explicit textbfPilot Reasoning texts that summarize the current flight state and anticipate the next subgoal. In closed-loop evaluation, FLIGHT VLA consistently surpasses representative VLN and VLA baselines on our FLIGHT benchmarks, achieving stronger multi-stage completion, subgoal adherence, and terminal control. Its trained Streaming Pilot Reasoning VLM further improves UAV video reasoning, validating the effectiveness of our design.

Subscribe for Updates

Copyright 2025 dijee Intelligence Ltd. dijee Intelligence Ltd. is a private limited company registered in England and Wales at Media House, Sopers Road, Cuffley, Hertfordshire, EN6 4RY, UK registration number 16808844