The acid activity of enzymes, characterized by the minimum pH at which enzymes remain active (pHmin), is crucial for industrial applications in acidic environments. However, the rational design of acid-active enzymes remains challenging due to limited understanding of sequence-structure-activity relationships under acidic conditions. Here, we propose ACENet, a graph neural network that predicts enzyme pHmin by integrating surface features of protein structures with evolutionary representations derived from the large-scale protein language model ESM-2. ACENet achieved a Pearson correlation coefficient of 0.85 on the test dataset, significantly outperforming other deep learning baseline models and maintains stable pHmin predictions under various conditions. Even on a subset of the dataset with less than 20% homology, the PCC remains above 0.5, with an RMSE (Root mean square error) less than 1.4. ACENet also present excellent performance in the annotation of pHmin for homologous proteins and the predictive screening of minimal active pH in protein mutants. Remarkably, ACENet could identify the catalytic region as key determinants of acid activity through residue-level interpretability analysis. Overall, ACENet accelerates the development of highly efficient biocatalysts for diverse applications where acidic conditions predominate.
Toward terminological clarity in digital biomarker research
Digital biomarker research has generated thousands of publications demonstrating associations between sensor-derived measures and clinical conditions, yet clinical adoption remains negligible. We identify a foundational




