We have applied and implemented HMM (Hidden Markov Model) algorithms to calculate QTL genotype robabilities from marker and pedigree data in general population structures. These algorithms have a linear complexity in memory. In nearly all experimental pedigrees they result in more precise genotype estimates than the most commonly used approaches for determining genotypes at non-marker positions in QTL analysis in outbred F2 line intercrosses [1], which include an exponential complexity factor as well as a data-reducing sampling step [2]. With a proper choice of parameters, the results from the existing methods can also be reproduced exactly. We show how the relative run times differ by a factor of 50 when 24 SNP markers are used, with our run time practically independent of marker count. The new method can also provide multi-generational probability estimates and perform haplotype inference from unphased data, which further improves accuracy and flexibility. An important future application of this method is for computationally efficient QTL genotype estimation in maps based on data from SNP chips containing 1000s of markers with mixed information content, for which there are no other suitable methods available at present.
CITATION STYLE
Nettelblad, C., Holmgren, S., Crooks, L., & Carlborg, Ö. (2009). cnF2freq: Efficient determination of genotype and haplotype probabilities in outbred populations using markov models. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5462 LNBI, pp. 307–319). https://doi.org/10.1007/978-3-642-00727-9_29
Mendeley helps you to discover research relevant for your work.