Algorithmic complexity of protein identification: Searching in weighted strings

Mark Cieliebak; Zsuzsanna Lipták; Emo Welzl; Thomas Erlebach; Jens Stoye

Journal ArticleOPEN ACCESS

Algorithmic complexity of protein identification: Searching in weighted strings

IFIP Advances in Information and Communication Technology (2002) 96 143-156

DOI: 10.1007/978-0-387-35608-2_13

0Citations

2Readers

Abstract

We investigate a problem which arises in computational biology: Given a constant-size alphabet A with a weight function μ: A → ℕ, find an efficient data structure and query algorithm solving the following problem: For a string σ over A and a weight M ∈ ℕ, decide whether σ contains a substring with weight M (One-String Mass Finding Problem). If the answer is yes, then we may in addition require a witness, i.e., indices i ≤ j such that the substring beginning at position i and ending at position j has weight M. We allow preprocessing of the string, and measure efficiency in two parameters: storage space required for the preprocessed data, and running time of the query algorithm for given M. We are interested in data structures and algorithms requiring subquadratic storage space and sublinear query time, where we measure the input size as the length of the input string. Among others, we present two non-trivial efficient algorithms: Lookup solves the problem with O(n) space and O(n/log · log log n) time; Interval solves the problem for binary alphabets with O(n) storage space in O(log n) query time. Finally, we introduce other variants of the problem and sketch how our algorithms may be extended for these variants.

Author supplied keywords

Cite

CITATION STYLE

APA

Cieliebak, M., Lipták, Z., Welzl, E., Erlebach, T., & Stoye, J. (2002). Algorithmic complexity of protein identification: Searching in weighted strings. IFIP Advances in Information and Communication Technology, 96, 143–156. https://doi.org/10.1007/978-0-387-35608-2_13

Algorithmic complexity of protein identification: Searching in weighted strings

Abstract

Author supplied keywords

Cite

Register to see more suggestions