VNTRseek - A computational tool to detect tandem repeat variants in high-throughput sequencing data

34Citations
Citations of this article
81Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

DNA tandem repeats (TRs) are ubiquitous genomic features which consist of two or more adjacent copies of an underlying pattern sequence. The copies may be identical or approximate. Variable number of tandem repeats or VNTRs are polymorphic TR loci in which the number of pattern copies is variable. In this paper we describe VNTRseek, our software for discovery of minisatellite VNTRs (pattern size ≥ 7 nucleotides) using whole genome sequencing data. VNTRseek maps sequencing reads to a set of reference TRs and then identifies putative VNTRs based on a discrepancy between the copy number of a reference and its mapped reads. VNTRseek was used to analyze the Watson and Khoisan genomes (454 technology) and two 1000 Genomes family trios (Illumina). In the Watson genome, we identified 752 VNTRs with pattern sizes ranging from 7 to 84 nt. In the Khoisan genome, we identified 2572 VNTRs with pattern sizes ranging from 7 to 105 nt. In the trios, we identified between 2660 and 3822 VNTRs per individual and found nearly 100% consistency with Mendelian inheritance. VNTRseek is, to the best of our knowledge, the first software for genome-wide detection of minisatellite VNTRs. It is available at http://orca.bu.edu/vntrseek/. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

Cite

CITATION STYLE

APA

Gelfand, Y., Hernandez, Y., Loving, J., & Benson, G. (2014). VNTRseek - A computational tool to detect tandem repeat variants in high-throughput sequencing data. Nucleic Acids Research, 42(14), 8884–8894. https://doi.org/10.1093/nar/gku642

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free