Abstract
Submegabase-size topologically associating domains (TAD) have been observed in high-throughput chromatin interaction data (Hi-C). However, accurate detection of TADs depends on ultra-deep sequencing and sophisticated normalization procedures. Here we propose a fast and normalization-free method to decode the domains of chromosomes (deDoc) that utilizes structural information theory. By treating Hi-C contact matrix as a representation of a graph, deDoc partitions the graph into segments with minimal structural entropy. We show that structural entropy can also be used to determine the proper bin size of the Hi-C data. By applying deDoc to pooled Hi-C data from 10 single cells, we detect megabase-size TAD-like domains. This result implies that the modular structure of the genome spatial organization may be fundamental to even a small cohort of single cells. Our algorithms may facilitate systematic investigations of chromosomal domains on a larger scale than hitherto have been possible.
Cite
CITATION STYLE
Li, A., Yin, X., Xu, B., Wang, D., Han, J., Wei, Y., … Zhang, Z. (2018). Decoding topologically associating domains with ultra-low resolution Hi-C data by graph structural entropy. Nature Communications, 9(1). https://doi.org/10.1038/s41467-018-05691-7
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.