Sequence alignment as a database technology challenge

Hans Philippi

Conference ProceedingsOPEN ACCESS

Sequence alignment as a database technology challenge

Philippi H

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2007) 4653 LNCS 459-468

DOI: 10.1007/978-3-540-74469-6_45

2Citations

5Readers

Get full text

Abstract

Sequence alignment is an important task for molecular biologists. Because alignment basically deals with approximate string matching on large biological sequence collections, it is both data intensive and computationally complex. There exist several tools for the variety of problems related to sequence alignment. Our first observation is that the term 'sequence database' is used in general for textually formatted string collections. A second observation is that the search tools are specifically dedicated to a single problem. They have limited capabilities to serve as a solution for related problems that require minor adaptations. Our aim is to show the possibilities and advantages of a DBMS-based approach toward sequence alignment. For this purpose, we will adopt techniques from single sequence alignment to speed up multiple sequence alignment. We will show how the problem of matching a protein string family against a large protein string database can be tackled with q-gram indexing techniques based on relational database technology. The use of Monet, a main-memory DBMS, allows us to realize a flexible environment for developing searching heuristics that outperform classical dynamic programming, while keeping up satisfying sensitivity figures. © Springer-Verlag Berlin Heidelberg 2007.

Cite

CITATION STYLE

APA

Philippi, H. (2007). Sequence alignment as a database technology challenge. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4653 LNCS, pp. 459–468). Springer Verlag. https://doi.org/10.1007/978-3-540-74469-6_45

Sequence alignment as a database technology challenge

Abstract

Cite

Register to see more suggestions