EagleC: A deep-learning framework for detecting a full range of structural variations from bulk and single-cell contact maps

53Citations
Citations of this article
57Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

The Hi-C technique has been shown to be a promising method to detect structural variations (SVs) in human genomes. However, algorithms that can use Hi-C data for a full-range SV detection have been severely lacking. Current methods can only identify interchromosomal translocations and long-range intrachromosomal SVs (>1 Mb) at less-than-optimal resolution. Therefore, we develop EagleC, a framework that combines deep-learning and ensemble-learning strategies to predict a full range of SVs at high resolution. We show that EagleC can uniquely capture a set of fusion genes that are missed by whole-genome sequencing or nanopore. Furthermore, EagleC also effectively captures SVs in other chromatin interaction platforms, such as HiChIP, Chromatin interaction analysis with paired-end tag sequencing (ChIA-PET), and capture Hi-C. We apply EagleC in more than 100 cancer cell lines and primary tumors and identify a valuable set of high-quality SVs. Last, we demonstrate that EagleC can be applied to single-cell Hi-C and used to study the SV heterogeneity in primary tumors.

Cite

CITATION STYLE

APA

Wang, X., Luan, Y., & Yue, F. (2022). EagleC: A deep-learning framework for detecting a full range of structural variations from bulk and single-cell contact maps. Science Advances, 8(24). https://doi.org/10.1126/sciadv.abn9215

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free