A general algorithm for subtree similarity-search

Sara Cohen; Nerya Or

Conference Proceedings

A general algorithm for subtree similarity-search

Proceedings - International Conference on Data Engineering (2014) 928-939

DOI: 10.1109/ICDE.2014.6816712

10Citations

34Readers

Get full text

Abstract

Determining similarity between trees is an important problem in a variety of areas. The subtree similarity-search problem is that of finding, given a tree Q and a large set of trees Γ = (T1;...; Tn), the subtrees of trees among Γ that are most similar to Q. Similarity is defined using some tree distance function. While subtree similarity-search has been studied in the past, solutions mostly focused on specific tree distance functions, and were usually applicable only to ordered trees. This paper presents an efficient new algorithm that solves the subtree similarity-search problem, and is compatible with a wide family of tree distance functions (for both ordered and unordered trees). Extensive experimentation confirms the efficiency and scalability of the algorithm, which displays consistently good runtime even for large queries and datasets. © 2014 IEEE.

Cite

CITATION STYLE

APA

Cohen, S., & Or, N. (2014). A general algorithm for subtree similarity-search. In Proceedings - International Conference on Data Engineering (pp. 928–939). IEEE Computer Society. https://doi.org/10.1109/ICDE.2014.6816712

A general algorithm for subtree similarity-search

Abstract

Cite

Register to see more suggestions