Evaluation of bottom-up and top-down mass spectrum identifications with different customized protein sequences databases

5Citations
Citations of this article
19Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Motivation: Generally, bottom-up and top-down are two complementary approaches for proteoforms identification. The inference of proteoforms relies on searching mass spectra against an accurate proteoform sequence database. A customized protein sequence database derived by RNA-Seq data can be used to better identify the proteoform existed in a studied species. However, the quality of sequences in customized databases which constructed by different strategies affect the performances of mass spectrometry (MS) identification. Additionally, performances of identifications between bottom-up and top-down using customized databases are also needed to be evaluated Results: Three customized databases were constructed with different strategies separately. Two of them were based on translating assembled transcripts with or without genomic annotation, and the third one is a variant-extending protein database. By testing with bottom-up and top-down MS data separately, a variant-extending protein database could identify not only the most number of spectra but also the alleles expressed at the same time in diploid cells. An assembled database could identify the spectrum missed in reference database and amino acid (AA) alterations existed in studied species.

Cite

CITATION STYLE

APA

Li, Z., He, B., & Feng, W. (2020). Evaluation of bottom-up and top-down mass spectrum identifications with different customized protein sequences databases. Bioinformatics, 36(4), 1030–1036. https://doi.org/10.1093/bioinformatics/btz733

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free