Filling a protein scaffold with a reference

0Citations
Citations of this article
1Readers
Mendeley users who have this article in their library.
Get full text

Abstract

In mass spectrometry-based de novo protein sequencing, it is hard to complete the sequence of the whole protein. Motivated by this we study the (one-sided) problem of filling a protein scaffold S with some missing amino acids, given a sequence of contigs none of which is allowed to be altered, with respect to a complete reference protein P of length n, such that the BLOSUM62 score between P and the filled sequence S′ is maximized. We show that this problem is polynomial-time solvable in O(n26) time. We also consider the case when the contigs are not of high quality and they are concatenated into an (incomplete) sequence I, where the missing amino acids can be inserted anywhere in I to obtain I′, such that the BLOSUM62 score between P and I′ is maximized. We show that this problem is polynomial-time solvable in O(n22) time. Due to the high running time, both of these algorithms are impractical, we hence present several algorithms based on greedy and local search, trying to solve the problems practically. The empirical results show that the algorithms can fill protein scaffolds almost perfectly, provided that a good pair of scaffold and reference are given.

Cite

CITATION STYLE

APA

Qingge, L., Liu, X., Zhong, F., & Zhu, B. (2016). Filling a protein scaffold with a reference. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9683, pp. 175–186). Springer Verlag. https://doi.org/10.1007/978-3-319-38782-6_15

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free