arXiv:2601.17102v1 Announce Type: new
Abstract: The accurate prediction of protein-ligand binding affinity is important for drug discovery yet remains challenging for multi-domain proteins, where inter-domain dynamics and flexible linkers govern molecular recognition. Current geometric deep learning methods typically treat proteins as monolithic graphs, failing to capture the distinct geometric and energetic signals at domain interfaces. To address this, we introduce DAGML (Domain-Aware Geometric Multimodal Learning), a hierarchical framework that explicitly models domain modularity. DAGML integrates a pre-trained protein language model with a novel domain-aware geometric encoder to distinguish intra- and inter-domain features, while a motif-centric ligand encoder captures pharmacophoric compatibility. We further curate a specialized multi-domain affinity benchmark, classifying complexes by binding topology (e.g., interface vs linker binders). Extensive experiments demonstrate that DAGML achieves a 21% reduction in MSE and a Pearson correlation of 0.726 compared to strong baselines. Ablation studies reveal that explicit modeling of domain interfaces is the primary driver of this improvement, particularly for ligands binding in the clefts between structural units. The code is available at https://github.com/jiankliu/DAGML.

Subscribe for Updates

Copyright 2025 dijee Intelligence Ltd.   dijee Intelligence Ltd. is a private limited company registered in England and Wales at Media House, Sopers Road, Cuffley, Hertfordshire, EN6 4RY, UK registeration number 16808844