Wavelet analysis of human recombination rates demonstrates divergence on fine scales

Background: Recombination rates can be estimated across the genome, underpinning genetic analyses such as identification of regions under selection. Accurate recombination mapping requires observing a

Cortical geometry constrains the unimodal anchors of sensory integration

The human cerebral cortex is organized along a unimodal-to-transmodal hierarchy, which provides a putative substrate for the integration of sensory signals from primary cortical fields

Post-translational modifications in the brain are critical contributors to Alzheimers disease neuropathology and cognitive decline

Post-translational modifications (PTMs) in APP and MAPT contribute to plaques and tangles in Alzheimers disease (AD). Yet broader proteome-wide PTMs in the AD brain are

The Amygdalostriatal Transition Area Exhibits Lateral Amygdala-Like Spiking Activity and Tone-Shock Pairing-Induced Plasticity

During Pavlovian fear conditioning, presentation of a conditioned stimulus, such as a tone, together with an unconditioned stimulus, such as an electrical shock, excites neurons

The Logic of Thalamic Inputs onto the Molecular Taxonomy of Cortical Neurons Reveals a Visual Hierarchy

The hierarchical organization of sensory cortices and the rich molecular taxonomy of their cell types are defining features of the mammalian cortex. Cortical areas along

Plan, Watch, Recover: A Benchmark and Architectures for Proactive Procedural Assistance

June 4, 2026

arXiv:2606.04970v1 Announce Type: cross
Abstract: We envision a proactive multi-modal assistant system which gives users real-time step-by-step guidance on a procedural task, autonomously deciding textitwhen to interrupt, and textithow to coach. However, progress is limited by the absence of large-scale, cross-domain benchmarks that reflect realistic conditions, particularly the common case in which users deviate from the expected step sequence. We address this gap with four contributions: textbf(1)~we release textbfEgoProactive, a large-scale wearable-egocentric dataset for proactive procedural assistance with explicit Out-of-Plan (OOP) annotations and recovery steps; textbf(2)~we augment five established benchmarks (Ego4D, EPIC-KITCHENS, EgoExo4D, HoloAssist, HowTo100M) into textbfProtextsuperscript2Bench under a unified proactive-guidance schema; textbf(3)~we propose a textbfdecoupled planner–interaction architecture specialized for procedural state, visual cues, and recovery injection; textbf(4)~we introduce a post-training recipe that transfers across model families, validated by cross-backbone replication on Llama~4 and Qwen-3.6-VL. In extensive experiments, our trained Llama-4 system substantially improves objective intervention quality over strong proprietary baselines (Claude Opus~4.6, Gemini~3.1~Pro, GPT~5.2) and open-weight baselines (Qwen3~VL~235B) baselines across all six datasets. Oracle-plan experiments further show that, when plan quality is controlled, the trained duplex model produces high-quality guidance and large gains on Out-of-Plan recovery.

Subscribe for Updates

Copyright 2025 dijee Intelligence Ltd. dijee Intelligence Ltd. is a private limited company registered in England and Wales at Media House, Sopers Road, Cuffley, Hertfordshire, EN6 4RY, UK registration number 16808844