KvarQ: Targeted and direct variant calling from fastq reads of bacterial genomes

114Citations
Citations of this article
174Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Background: High-throughput DNA sequencing produces vast amounts of data, with millions of short reads that usually have to be mapped to a reference genome or newly assembled. Both reference-based mapping and de novo assembly are computationally intensive, generating large intermediary data files, and thus require bioinformatics skills that are often lacking in the laboratories producing the data. Moreover, many research and practical applications in microbiology require only a small fraction of the whole genome data.Results: We developed KvarQ, a new tool that directly scans fastq files of bacterial genome sequences for known variants, such as single nucleotide polymorphisms (SNP), bypassing the need of mapping all sequencing reads to a reference genome and de novo assembly. Instead, KvarQ loads " testsuites" that define specific SNPs or short regions of interest in a reference genome, and directly synthesizes the relevant results based on the occurrence of these markers in the fastq files. KvarQ has a versatile command line interface and a graphical user interface. KvarQ currently ships with two " testsuites" for Mycobacterium tuberculosis, but new " testsuites" for other organisms can easily be created and distributed. In this article, we demonstrate how KvarQ can be used to successfully detect all main drug resistance mutations and phylogenetic markers in 880 bacterial whole genome sequences. The average scanning time per genome sequence was two minutes. The variant calls of a subset of these genomes were validated with a standard bioinformatics pipeline and revealed >99% congruency.Conclusion: KvarQ is a user-friendly tool that directly extracts relevant information from fastq files. This enables researchers and laboratory technicians with limited bioinformatics expertise to scan and analyze raw sequencing data in a matter of minutes. KvarQ is open-source, and pre-compiled packages with a graphical user interface are available at http://www.swisstph.ch/kvarq.

References Powered by Scopus

Fast and accurate short read alignment with Burrows-Wheeler transform

33934Citations
N/AReaders
Get full text

Deciphering the biology of mycobacterium tuberculosis from the complete genome sequence

6804Citations
N/AReaders
Get full text

Artemis: Sequence visualization and annotation

2573Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Whole-genome sequencing for prediction of Mycobacterium tuberculosis drug susceptibility and resistance: A retrospective cohort study

468Citations
N/AReaders
Get full text

Rapid antibiotic-resistance predictions from genome sequence data for Staphylococcus aureus and Mycobacterium tuberculosis

408Citations
N/AReaders
Get full text

The role of whole genome sequencing in antimicrobial susceptibility testing of bacteria: report from the EUCAST Subcommittee

372Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Steiner, A., Stucki, D., Coscolla, M., Borrell, S., & Gagneux, S. (2014). KvarQ: Targeted and direct variant calling from fastq reads of bacterial genomes. BMC Genomics, 15(1). https://doi.org/10.1186/1471-2164-15-881

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 70

57%

Researcher 40

33%

Professor / Associate Prof. 8

7%

Lecturer / Post doc 5

4%

Readers' Discipline

Tooltip

Agricultural and Biological Sciences 54

47%

Biochemistry, Genetics and Molecular Bi... 26

23%

Medicine and Dentistry 20

17%

Immunology and Microbiology 15

13%

Article Metrics

Tooltip
Mentions
News Mentions: 1

Save time finding and organizing research with Mendeley

Sign up for free