The identification of genetic variations in large cohorts is a critical issue to identify patient cohorts, disease risks, and to develop more effective treatments. To help this analysis, we improved a variant calling pipeline for the human genome using state-of-the-art tools, including GATK (Hard Filter/VQSR) and DeepVariant. The pipeline was tested in a computing cluster where it was possible to compare Illumina Platinum genomes using different approaches. Moreover, by using a secure data space we provide a solution to privacy and security concerns in genomics research. Overall, this variant calling pipeline has the potential to advance the field of genomics research significantly, improve healthcare outcomes, and simplify the analysis process. Therefore, it is critical to rigorously evaluate these pipelines’ performance before implementing them in clinical settings.
CITATION STYLE
Pinheiro, M., Silva, J. M., & Oliveira, J. L. (2023). Optimizing Variant Calling for Human Genome Analysis: A Comprehensive Pipeline Approach. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 13920 LNBI, pp. 72–85). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-031-34960-7_6
Mendeley helps you to discover research relevant for your work.