Continuation methods for approximate large scale object sequencing

Xenophon Evangelopoulos; Austin J. Brockmeier; Tingting Mu; John Y. Goulermas

Journal ArticleOPEN ACCESS

Continuation methods for approximate large scale object sequencing

Machine Learning (2019) 108(4) 595-626

DOI: 10.1007/s10994-018-5764-7

2Citations

11Readers

Abstract

We propose a set of highly scalable algorithms for the combinatorial data analysis problem of seriating similarity matrices. Seriation consists of finding a permutation of data instances, such that similar instances are nearby in the ordering. Applications of the seriation problem can be found in various disciplines such as in bioinformatics for genome sequencing, data visualization and exploratory data analysis. Our algorithms attempt to minimize certain p-SUM objectives, which also arise in the problem of envelope reduction of sparse matrices. In particular, we present a set of graduated non-convexity algorithms for vector-based relaxations of the general p-SUM problem for p∈{2,1,12} that can scale to very large problem sizes. Different choices of p emphasize global versus local similarity pattern structure. We conduct a number of experiments to compare our algorithms to various state-of-the-art combinatorial optimization methods on real and synthetic datasets. The experimental results demonstrate that compared to other approaches, the proposed algorithms are very competitive and scale well with large problem sizes.

Author supplied keywords

References Powered by Scopus

View more at Scopus

Cited by Powered by Scopus

View more at Scopus

Cite

CITATION STYLE

APA

Evangelopoulos, X., Brockmeier, A. J., Mu, T., & Goulermas, J. Y. (2019). Continuation methods for approximate large scale object sequencing. Machine Learning, 108(4), 595–626. https://doi.org/10.1007/s10994-018-5764-7

Readers over time

Readers' Seniority

Professor / Associate Prof. 2

29%

PhD / Post grad / Masters / Doc 2

29%

Researcher 2

29%

Lecturer / Post doc 1

14%

Readers' Discipline

Computer Science 6

67%

Decision Sciences 1

11%

Medicine and Dentistry 1

11%

Engineering 1

11%

Continuation methods for approximate large scale object sequencing

Abstract

Author supplied keywords

References Powered by Scopus

Distinctive image features from scale-invariant keypoints

The University of Florida Sparse Matrix Collection

Singular value decomposition for genome-Wide expression data processing and modeling

Cited by Powered by Scopus

The seriation problem in the presence of a double Fiedler value

Circular object arrangement using spherical embeddings

Register to see more suggestions

Cite

Readers over time

Readers' Seniority

Readers' Discipline