Towards order-preserving submatrix search and indexing

Tao Jiang; Zhanhuai Li; Qun Chen; Kaiwen Li; Zhong Wang; Wei Pan

Conference Proceedings

Towards order-preserving submatrix search and indexing

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2015) 9050 309-326

DOI: 10.1007/978-3-319-18123-3_19

5Citations

5Readers

Get full text

Abstract

Order-Preserving SubMatrix (OPSM) has been proved to be important in modelling biologically meaningful subspace cluster, capturing the general tendency of gene expressions across a subset of conditions. Given an OPSM query based on row or column keywords, it is desirable to retrieve OPSMs quickly from a large gene expression dataset or OPSM data via indices. However, the time of OPSM mining from gene expression dataset is long and the volume of OPSM data is huge. In this paper, we investigate the issues of indexing two datasets above and first present a naive solution pfTree by applying prefix-Tree. Due to it is not efficient to search the tree, we give an optimization indexing method pIndex. Different from pfTree, pIndex employs row and column header tables to traverse related branches in a bottom-up manner. Further, two pruning rules based on number and order of keywords are introduced. To reduce the number of column keyword candidates on fuzzy queries, we introduce a First Item of keywords roTation method FIT, which reduces it from n! to n. We conduct extensive experiments with real datasets on a single machine, Hadoop and Hama, and the experimental results show the efficiency and scalability of the proposed techniques.

Author supplied keywords

Cite

CITATION STYLE

APA

Jiang, T., Li, Z., Chen, Q., Li, K., Wang, Z., & Pan, W. (2015). Towards order-preserving submatrix search and indexing. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9050, pp. 309–326). Springer Verlag. https://doi.org/10.1007/978-3-319-18123-3_19

Towards order-preserving submatrix search and indexing

Abstract

Author supplied keywords

Cite

Register to see more suggestions