Robust and efficient single-cell Hi-C clustering with approximate k-nearest neighbor graphs

Joachim Wolff; Rolf Backofen; Björn Grüning

Journal ArticleOPEN ACCESS

Robust and efficient single-cell Hi-C clustering with approximate k-nearest neighbor graphs

Bioinformatics (2021) 37(22) 4006-4013

DOI: 10.1093/bioinformatics/btab394

15Citations

14Readers

Abstract

Motivation: Hi-C technology provides insights into the 3D organization of the chromatin, and the single-cell Hi-C method enables researchers to gain knowledge about the chromatin state in individual cell levels. Single-cell Hi-C interaction matrices are high dimensional and very sparse. To cluster thousands of single-cell Hi-C interaction matrices, they are flattened and compiled into one matrix. Depending on the resolution, this matrix can have a few million or even billions of features; therefore, computations can be memory intensive. We present a single-cell Hi-C clustering approach using an approximate nearest neighbors method based on locality-sensitive hashing to reduce the dimensions and the computational resources. Results: The presented method can process a 10 kb single-cell Hi-C dataset with 2600 cells and needs 40 GB of memory, while competitive approaches are not computable even with 1 TB of memory. It can be shown that the differentiation of the cells by their chromatin folding properties and, therefore, the quality of the clustering of single-cell Hi-C data is advantageous compared to competitive algorithms.

Cite

CITATION STYLE

APA

Wolff, J., Backofen, R., & Grüning, B. (2021). Robust and efficient single-cell Hi-C clustering with approximate k-nearest neighbor graphs. Bioinformatics, 37(22), 4006–4013. https://doi.org/10.1093/bioinformatics/btab394

Robust and efficient single-cell Hi-C clustering with approximate k-nearest neighbor graphs

Abstract

Cite

Register to see more suggestions