A fast and practical approach to genotype phasing and imputation on a pedigree with erroneous and incomplete information

A fast and practical approach to genotype phasing and imputation on a pedigree with erroneous and incomplete information, Yuri Pirola, ICCABS 2012 (slides).

This work proposes the Min-Recombinant Haplotype Configuration with Bounded Errors problem (MRHCE), which extends the original Min-Recombinant Haplotype Configuration formulation by incorporating two common characteristics of real data: errors and missing genotypes (including untyped individuals). We describe a practical algorithm for MRHCE that is based on a reduction to the Satisfiability problem (SAT) and exploits recent advances in the constraint programming literature. An experimental analysis demonstrates the soundness of our model and the effectiveness of the algorithm under several scenarios. The analysis on real data and the comparison with state-of-the-art programs reveals that our approach couples better scalability to large and complex pedigrees with the explicit inclusion of genotyping errors into the model. The software, released under the GNU General Public License, can be freely downloaded from this page.