Telomere-to-telomere (T2T) and haplotype-resolved assembly are crucial for understanding eukaryotic genomes. For diploid species, this resolution is critical to uncover allelic variations, inheritance patterns, and functional genomic traits. Current scaffolding methods typically employ either sequence-based or graph-based strategies.Sequence-based approaches rely on proximity signals to yield high contiguity, but underutilize assembly graph information, resulting in more structural errors and chromosomal misassignments.Graph-based methods leverage graph topology for higher accuracy but frequently struggle to achieve chromosome-scale contiguity.However, neither strategy alone can overcome its inherent limitations to simultaneously achieve high contiguity and accuracy.To address these challenges, we introduce HapFold, the first hybrid scaffolding framework that synergistically leverages the complementary strengths of both graph-based and sequence-based approaches.By integrating the topological accuracy of assembly graphs with the proximity-guided contiguity of sequence models, HapFold achieves highly accurate, chromosome-scale or near-T2T haplotype reconstructions for diploid genomes.Compared to existing methods, HapFold achieves superior assembly quality while accelerating computation by an order of magnitude.Furthermore, in the haplotype reconstruction of diploid genomes using standard OxfordNanopore Technologies simplex reads, HapFold enables the reconstruction of a greater number of near-T2T assemblies.Our approach provides a robust and scalable solution for the high-fidelity reconstruction of haplotype-resolved diploid genomes.
Wavelet analysis of human recombination rates demonstrates divergence on fine scales
Background: Recombination rates can be estimated across the genome, underpinning genetic analyses such as identification of regions under selection. Accurate recombination mapping requires observing a
