Key points are not available for this paper at this time.
Modern pangenome graphs are built using haplotype-resolved genome assemblies. When mapping reads to a pangenome graph, prioritizing alignments that are consistent with the known haplotypes improves genotyping accuracy. However, the existing rigorous formulations for colinear chaining and alignment problems do not consider the haplotype paths in a pangenome graph. This often leads to spurious read alignments to those paths that are unlikely recombinations of the known haplotypes. In this paper, we develop novel formulations and algorithms for sequence-to-graph alignment and chaining problems. Inspired by the genotype imputation models, we assume that a query sequence is an imperfect mosaic of reference haplotypes. Accordingly, we introduce a recombination penalty in the scoring functions for each haplotype switch. First, we solve haplotype-aware sequence-to-graph alignment in Formula: see text time, where
Building similarity graph...
Analyzing shared references across papers
Loading...
Ghanshyam Chandra
Intel (United Kingdom)
Daniel Gibney
The University of Texas at Dallas
Chirag Jain
Amity University
Genome Research
The University of Texas at Dallas
Indian Institute of Science Bangalore
Building similarity graph...
Analyzing shared references across papers
Loading...
Chandra et al. (Tue,) studied this question.
synapsesocial.com/papers/68e6012eb6db643587594fa3 — DOI: https://doi.org/10.1101/gr.279143.124