EXTREME: An online em algorithm for motif discovery

40Citations
Citations of this article
86Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Motivation: Identifying regulatory elements is a fundamental problem in the field of gene transcription. Motif discovery - the task of identifying the sequence preference of transcription factor proteins, which bind to these elements - is an important step in this challenge. MEME is a popular motif discovery algorithm. Unfortunately, MEME's running time scales poorly with the size of the dataset. Experiments such as ChIP-Seq and DNase-Seq are providing a rich amount of information on the binding preference of transcription factors. MEME cannot discover motifs in data from these experiments in a practical amount of time without a compromising strategy such as discarding a majority of the sequences. Results: We present EXTREME, a motif discovery algorithm designed to find DNA-binding motifs in ChIP-Seq and DNase-Seq data. Unlike MEME, which uses the expectation-maximization algorithm for motif discovery, EXTREME uses the online expectation-maximization algorithm to discover motifs. EXTREME can discover motifs in large datasets in a practical amount of time without discarding any sequences. Using EXTREME on ChIP-Seq and DNase-Seq data, we discover many motifs, including some novel and infrequent motifs that can only be discovered by using the entire dataset. Conservation analysis of one of these novel infrequent motifs confirms that it is evolutionarily conserved and possibly functional. © 2014 The Author. Published by Oxford University Press. All rights reserved.

Cite

CITATION STYLE

APA

Quang, D., & Xie, X. (2014). EXTREME: An online em algorithm for motif discovery. Bioinformatics, 30(12), 1667–1673. https://doi.org/10.1093/bioinformatics/btu093

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free