Background: The DNase I hypersensitive sites (DHSs) are associated with the cis-regulatory DNA elements. An efficient method of identifying DHSs can enhance the understanding on the accessibility of chromatin. Despite a multitude of resources available on line including experimental datasets and computational tools, the complex language of DHSs remains incompletely understood. Methods: Here, we address this challenge using an approach based on a state-of-the-art machine learning method. We present a novel convolutional neural network (CNN) which combined Inception like networks with a gating mechanism for the response of multiple patterns and longterm association in DNA sequences to predict multi-scale DHSs in Arabidopsis, rice and Homo sapiens. Results: Our method obtains 0.961 area under curve (AUC) on Arabidopsis, 0.969 AUC on rice and 0.918 AUC on Homo sapiens. Conclusions: Our method provides an efficient and accurate way to identify multi-scale DHSs sequences by deep learning.
CITATION STYLE
Lyu, C., Wang, L., & Zhang, J. (2018). Deep learning for DNase i hypersensitive sites identification. BMC Genomics, 19. https://doi.org/10.1186/s12864-018-5283-8
Mendeley helps you to discover research relevant for your work.