Maximal words in sequence comparisons based on subword composition

16Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Measures of sequence similarity and distance based more or less explicitly on subword composition are attracting an increasing interest driven by intensive applications such as massive document classification and genome-wide molecular taxonomy. A uniform character of such measures is in some underlying notion of relative compressibility, whereby two similar sequences are expected to share a larger number of common substrings than two distant ones. This paper reviews some of the approaches to sequence comparison based on subword composition and suggests that their common denominator may ultimately reside in special classes of subwords, the nature of which resonates in interesting ways with the structure of popular subword trees and graphs. © Springer-Verlag Berlin Heidelberg 2010.

Cite

CITATION STYLE

APA

Apostolico, A. (2010). Maximal words in sequence comparisons based on subword composition. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6060 LNCS, pp. 34–44). https://doi.org/10.1007/978-3-642-12476-1_2

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free