Motivation: Functional enrichment testing facilitates the interpretation of Chromatin immunoprecipitation followed by high-throughput sequencing (ChIP-seq) data in terms of pathways and other biological contexts. Previous methods developed and used to test for key gene sets affected in ChIP-seq experiments treat peaks as points, and are based on the number of peaks associated with a gene or a binary score for each gene. These approaches work well for transcription factors, but histone modifications often occur over broad domains, and across multiple genes. Results: To incorporate the unique properties of broad domains into functional enrichment testing, we developed Broad-Enrich, a method that uses the proportion of each gene's locus covered by a peak. We show that our method has a well-calibrated false-positive rate, performing well with ChIP-seq data having broad domains compared with alternative approaches. We illustrate Broad-Enrich with 55 ENCODE ChIP-seq datasets using different methods to define gene loci. Broad- Enrich can also be applied to other datasets consisting of broad genomic domains such as copy number variations. © The Author 2014. Published by Oxford University Press. All rights reserved.
CITATION STYLE
Cavalcante, R. G., Lee, C., Welch, R. P., Patil, S., Weymouth, T., Scott, L. J., & Sartor, M. A. (2014). Broad-Enrich: Functional interpretation of large sets of broad genomic regions. In Bioinformatics (Vol. 30). Oxford University Press. https://doi.org/10.1093/bioinformatics/btu444
Mendeley helps you to discover research relevant for your work.