De novo sequencing and homology searching

Bin Ma; Richard Johnson

ArticleOPEN ACCESS

De novo sequencing and homology searching

Molecular and Cellular Proteomics

DOI: 10.1074/mcp.O111.014902

142Citations

352Readers

Abstract

In proteomics, de novo sequencing is the process of deriving peptide sequences from tandem mass spectra without the assistance of a sequence database. Such analyses have traditionally been performed manually by human experts, and more recently by computer programs that have been developed because of the need for higher throughput. Although powerful, de novo sequencing often can only determine partially correct sequence tags because of imperfect tandem mass spectra. However, these sequence tags can then be searched in a sequence database to identify the exact or a homologous peptide. Homology searches are particularly useful for the study of organisms whose genomes have not been sequenced. This tutorial will present background important to understanding de novo sequencing, suggestions on how to do this manually, plus descriptions of computer algorithms used to automate this process and to subsequently carryout homology-based database searches. This Tutorial is part of the International Proteomics Tutorial Programme (IPTP 1). © 2012 by The American Society for Biochemistry and Molecular Biology, Inc.

Cite

CITATION STYLE

APA

Ma, B., & Johnson, R. (2012, February). De novo sequencing and homology searching. Molecular and Cellular Proteomics. https://doi.org/10.1074/mcp.O111.014902

De novo sequencing and homology searching

Abstract

Cite

Register to see more suggestions