Adaptation to free-living drives loss of beneficial endosymbiosis through metabolic trade-offs

Symbioses are widespread (1) and underpin the function of diverse ecosystems (2-6), but their evolutionary stability is challenging to explain (7,8). Fitness trade-offs between con-trasting

Gradient-specified optimization based on muscle surface mesh and moment arm as an effect-oriented approach of automated musculotendon path modeling

There is more to musculotendon path modeling than aligning a cable to reflect the geometric features of a muscle-tendon unit. From the perspective of simulation

TREM2 deficiency causes region-specific brain effects in a mouse model of cerebral amyloid angiopathy

Cerebral amyloid angiopathy (CAA), a major vascular contributor to cognitive decline, is present in 85-95% of Alzheimer disease (AD) patients. Despite its high prevalence, the

Frontal Brain Injury Reduces Sensitivity to Reward-Predictive Cues and Remodels the Nucleus Accumbens

Traumatic brain injuries (TBIs) are more than mere lesions and generate a persistent secondary pathology. This, combined with functional reorganization of circuits post-injury, may explain

Highly replicable multisite patterns of adolescent white matter maturation

The Adolescent Brain Cognitive Development (ABCD) Study is the largest U.S.-based neuroimaging initiative of adolescent brain maturation. Diffusion MRI (dMRI) provides unique insights into white

LLMs for Text-Based Exploration and Navigation Under Partial Observability

April 14, 2026

arXiv:2604.09604v1 Announce Type: new
Abstract: Exploration and goal-directed navigation in unknown layouts are central to inspection, logistics, and search-and-rescue. We ask whether large language models (LLMs) can function as emphtext-only controllers under partial observability — without code execution, tools, or program synthesis. We introduce a reproducible benchmark with oracle localisation in fixed ASCII gridworlds: each step reveals only a local $5times5$ window around the agent and the model must select one of textttUP/RIGHT/DOWN/LEFT. Nine contemporary LLMs ranging from open/proprietary, dense / Mixture of Experts and instruction- vs. reasoning-tuned are evaluated on two tasks across three layouts of increasing difficulty: emphExploration (maximising revealed cells) and emphNavigation (reach the goal on the shortest path). The experimental results are evaluated on quantitative metrics including emphsuccess rate, emphefficiency such as normalised coverage and emphpath length vs. oracle as well as qualitative analysis. Reasoning-tuned models reliably complete navigation across all layouts, yet remain less efficient than oracle paths. Few-shot demonstrations in the prompt chiefly help these Reasoning-tuned models by reducing invalid moves and shortening paths, while classic dense instruction models remain inconsistent. We observe characteristic action priors (UP/RIGHT) that can induce looping under partial observability. Overall, training regimen and test-time deliberation predict control ability better than raw parameter count. These findings suggest lightweight hybridisation with classical online planners as a practical route to deployable partial map systems.

Subscribe for Updates

Copyright 2025 dijee Intelligence Ltd. dijee Intelligence Ltd. is a private limited company registered in England and Wales at Media House, Sopers Road, Cuffley, Hertfordshire, EN6 4RY, UK registration number 16808844