Dissociable contributions of cortical thickness and surface area to cognitive ageing: evidence from multiple longitudinal cohorts.

Cortical volume, a widely-used marker of brain ageing, is the product of two genetically and developmentally dissociable morphometric features: thickness and area. However, it remains

ATHILAfinder: a tool to detect ATHILA LTR retrotransposons in plant genomes

Motivation The ATHILA lineage of LTR retrotransposons has colonised all branches of the plant tree of life. In Arabidopsis thaliana and A. lyrata, ATHILA elements

Microfluidic low-input profiling reveals lncRNA roles in disease

Long noncoding RNAs (lncRNAs) regulate gene expression through binding to DNA, various RNAs, and proteins, playing potentially important but poorly understood roles in diseases. Existing

DNA Traces on the Shroud of Turin: Metagenomics of the 1978 Official Sample Collection

This research provides original insights into the diversity of DNA extracted from samples collected in 1978 from the Turin Shroud, revealing its biological complexity through

TEsingle enables locus-specific transposable element expression analysis at single-cell resolution

Transposable elements (TEs) are mobile genetic sequences that can generate new copies of themselves via insertional mutations. These viral-like sequences comprise nearly half the human

Prompt Complexity Dilutes Structured Reasoning: A Follow-Up Study on the Car Wash Problem

March 17, 2026

arXiv:2603.13351v1 Announce Type: new
Abstract: In a previous study [Jo, 2026], STAR reasoning (Situation, Task, Action, Result) raised car wash problem accuracy from 0% to 85% on Claude Sonnet 4.5, and to 100% with
additional prompt layers. This follow-up asks: does STAR maintain its effectiveness in a production system prompt?
We tested STAR inside InterviewMate’s 60+ line production prompt, which had evolved through iterative additions of style guidelines, format instructions, and profile
features. Three conditions, 20 trials each, on Claude Sonnet 4.6: (A) production prompt with Anthropic profile, (B) production prompt with default profile, (C) original
STAR-only prompt. C scored 100% (verified at n=100). A and B scored 0% and 30%.
Prompt complexity dilutes structured reasoning. STAR achieves 100% in isolation but degrades to 0-30% when surrounded by competing instructions. The mechanism: directives
like “Lead with specifics” force conclusion-first output, reversing the reason-then-conclude order that makes STAR effective. In one case, the model output “Short answer:
Walk.” then executed STAR reasoning that correctly identified the constraint — proving the model could reason correctly but had already committed to the wrong answer.
Cross-model comparison shows STAR-only improved from 85% (Sonnet 4.5) to 100% (Sonnet 4.6) without prompt changes, suggesting model upgrades amplify structured reasoning
in isolation.
These results imply structured reasoning frameworks should not be assumed to transfer from isolated testing to complex prompt environments. The order in which a model
reasons and concludes is a first-class design variable.

Subscribe for Updates

Copyright 2025 dijee Intelligence Ltd. dijee Intelligence Ltd. is a private limited company registered in England and Wales at Media House, Sopers Road, Cuffley, Hertfordshire, EN6 4RY, UK registration number 16808844