A woman’s uterus has been kept alive outside the body for the first time

A woman’s uterus has been kept alive outside the body for the first time

“Think of this as a human body,” says Javier González. In front of me is essentially a metal box on wheels. Standing at around a

Brain criticality emerges with developmental shifts in frequency-specific excitation-inhibition balance

Adolescent brain maturation involves structural changes effecting a shift in excitation/inhibition (E/I) balance, yet the functional implications of these changes remain unclear. One implication is

An oxytocin-gated circuit from the hypothalamus silences olfactory tubercle neurons to drive prosocial grooming

Spontaneous helping behaviors such as allogrooming are vital for survival in social species, yet their underlying neural mechanisms remain largely unknown. Although oxytocin (OXT) is

Human neurodevelopmental genes housed in massive, ancient gene deserts

Mammalian genomes contain a remarkable proportion of non-coding sequence relative to protein-coding sequence, including megabase-scale gene deserts. The origins of gene deserts and their functions

Open-source, Hardware-Independent GPU Acceleration for Scalable Nanopore Basecalling with Slorado and Openfish

Nanopore sequencing technologies are used widely in genomics research and their adoption continues to accelerate. ‘Basecalling’ is an essential step in the nanopore sequencing workflow,

The Autonomy Tax: Defense Training Breaks LLM Agents

March 23, 2026

arXiv:2603.19423v1 Announce Type: cross
Abstract: Large language model (LLM) agents increasingly rely on external tools (file operations, API calls, database transactions) to autonomously complete complex multi-step tasks. Practitioners deploy defense-trained models to protect against prompt injection attacks that manipulate agent behavior through malicious observations or retrieved content. We reveal a fundamental textbfcapability-alignment paradox: defense training designed to improve safety systematically destroys agent competence while failing to prevent sophisticated attacks. Evaluating defended models against undefended baselines across 97 agent tasks and 1,000 adversarial prompts, we uncover three systematic biases unique to multi-step agents. textbfAgent incompetence bias manifests as immediate tool execution breakdown, with models refusing or generating invalid actions on benign tasks before observing any external content. textbfCascade amplification bias causes early failures to propagate through retry loops, pushing defended models to timeout on 99% of tasks compared to 13% for baselines. textbfTrigger bias leads to paradoxical security degradation where defended models perform worse than undefended baselines while straightforward attacks bypass defenses at high rates. Root cause analysis reveals these biases stem from shortcut learning: models overfit to surface attack patterns rather than semantic threat understanding, evidenced by extreme variance in defense effectiveness across attack categories. Our findings demonstrate that current defense paradigms optimize for single-turn refusal benchmarks while rendering multi-step agents fundamentally unreliable, necessitating new approaches that preserve tool execution competence under adversarial conditions.

Subscribe for Updates

Copyright 2025 dijee Intelligence Ltd. dijee Intelligence Ltd. is a private limited company registered in England and Wales at Media House, Sopers Road, Cuffley, Hertfordshire, EN6 4RY, UK registration number 16808844