The Central Coupler of the AAA+ ATPase ClpXP Controls Intersubunit Communication and Couples the Conversion of Chemical Energy into the Generation of Force

ClpX is a clockwise hexameric helical arrangement that hydrolyzes ATP to unfold proteins and translocate them into the proteolytic chamber. We investigate the central coupler,

Structural features of E. coli Stx bacteriophage phi24B revealed with cryo-electron microscopy

Shiga toxin-converting bacteriophages play a critical role in the emergence and virulence of pathogenic Escherichia coli strains. Despite their significance, detailed structural information on these

Optimization of AAV tools to target M&uumlller glial cells for retinal gene therapy

Reprogramming of M&uumlller glial (MG) cells into retinal neurons has the potential to treat vision loss by regenerating the retina. Development of efficient gene delivery

scMultiPreDICT: A single-cell predictive framework with transcriptomic and epigenetic signatures

Cellular responses to genetic perturbations depend on both transcriptional programs and the epigenetic landscape. While single-cell multiomics technologies enable simultaneous profiling of gene expression and

Engineering a Glucose-Inducible Whole-Cell Biosensor via CRISPRi-Based Promoter Reprogramming

Precise monitoring of intracellular glucose dynamics is essential for understanding carbon flux, optimizing microbial bioprocesses, and enabling responsive control of engineered metabolic pathways. Here, we

Measuring LLM Trust Allocation Across Conflicting Software Artifacts

April 7, 2026

arXiv:2604.03447v1 Announce Type: cross
Abstract: LLM-based software engineering assistants fail not only by producing incorrect outputs, but also by allocating trust to the wrong artifact when code, documentation, and tests disagree. Existing evaluations focus mainly on downstream outcomes and therefore cannot reveal whether a model recognized degraded evidence, identified the unreliable source, or calibrated its trust across artifacts. We present TRACE (Trust Reasoning over Artifacts for Calibrated Evaluation), a framework that elicits structured artifact-level trust traces over Javadoc, method signatures, implementations, and test prefixes under blind perturbations. Using 22,339 valid traces from seven models on 456 curated Java method bundles, we evaluate per-artifact quality assessment, inconsistency detection, affected artifact attribution, and source prioritization. Across all models, quality penalties are largely localized to the perturbed artifact and increase with severity, but sensitivity is asymmetric across artifact types: documentation bugs induce a substantially larger heavy-to-subtle gap than implementation faults (0.152-0.253 vs. 0.049-0.123). Models detect explicit documentation bugs well (67-94%) and Javadoc and implementation contradictions at 50-91%, yet show a systematic blind spot when only the implementation drifts while the documentation remains plausible, with detection dropping by 7-42 percentage points. Confidence is poorly calibrated for six of seven models. These findings suggest that current LLMs are better at auditing natural-language specifications than at detecting subtle code-level drift, motivating explicit artifact-level trust reasoning before correctness-critical downstream use.

Subscribe for Updates

Copyright 2025 dijee Intelligence Ltd. dijee Intelligence Ltd. is a private limited company registered in England and Wales at Media House, Sopers Road, Cuffley, Hertfordshire, EN6 4RY, UK registration number 16808844