In proteomics, de novo sequencing is the process of deriving peptide sequences from tandem mass spectra without the assistance of a sequence database. Such analyses have traditionally been performed manually by human experts, and more recently by computer programs that have been developed because of the need for higher throughput. Although powerful, de novo sequencing often can only determine partially correct sequence tags because of imperfect tandem mass spectra. However, these sequence tags can then be searched in a sequence database to identify the exact or a homologous peptide. Homology searches are particularly useful for the study of organisms whose genomes have not been sequenced. This tutorial will present background important to understanding de novo sequencing, suggestions on how to do this manually, plus descriptions of computer algorithms used to automate this process and to subsequently carryout homology-based database searches. This Tutorial is part of the International Proteomics Tutorial Programme (IPTP 1). © 2012 by The American Society for Biochemistry and Molecular Biology, Inc.
CITATION STYLE
Ma, B., & Johnson, R. (2012, February). De novo sequencing and homology searching. Molecular and Cellular Proteomics. https://doi.org/10.1074/mcp.O111.014902
Mendeley helps you to discover research relevant for your work.