Learning protein-DNA interaction landscapes by integrating experimental data through computational models

8Citations
Citations of this article
26Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Motivation: Transcriptional regulation is directly enacted by the interactions between DNA and many proteins, including transcription factors (TFs), nucleosomes and polymerases. A critical step in deciphering transcriptional regulation is to infer, and eventually predict, the precise locations of these interactions, along with their strength and frequency. While recent datasets yield great insight into these interactions, individual data sources often provide only partial information regarding one aspect of the complete interaction landscape. For example, chromatin immunoprecipitation (ChIP) reveals the binding positions of a protein, but only for one protein at a time. In contrast, nucleases like MNase and DNase can be used to reveal binding positions for many different proteins at once, but cannot easily determine the identities of those proteins. Currently, few statistical frameworks jointly model these different data sources to reveal an accurate, holistic view of the in vivo protein-DNA interaction landscape. Results: Here, we develop a novel statistical framework that integrates different sources of experimental information within a thermodynamic model of competitive binding to jointly learn a holistic view of the in vivo protein-DNA interaction landscape. We show that our framework learns an interaction landscape with increased accuracy, explaining multiple sets of data in accordance with thermodynamic principles of competitive DNA binding. The resulting model of genomic occupancy provides a precise mechanistic vantage point from which to explore the role of protein-DNA interactions in transcriptional regulation. Availability and implementation: The C source code for COMPETE and Python source code for MCMC-based inference are available at http://www.cs.duke.edu/~amink.

References Powered by Scopus

A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition

17165Citations
N/AReaders
Get full text

Global analysis of protein expression in yeast

3132Citations
N/AReaders
Get full text

Transcriptional regulatory code of a eukaryotic genome

1774Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Implications of Big Data for cell biology

40Citations
N/AReaders
Get full text

Mapping nucleosome positions using DNase-seq

38Citations
N/AReaders
Get full text

Protein-DNA binding in high-resolution

35Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Zhong, J., Wasson, T., & Hartemink, A. J. (2014). Learning protein-DNA interaction landscapes by integrating experimental data through computational models. Bioinformatics, 30(20), 2868–2874. https://doi.org/10.1093/bioinformatics/btu408

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 9

41%

Researcher 7

32%

Professor / Associate Prof. 4

18%

Lecturer / Post doc 2

9%

Readers' Discipline

Tooltip

Agricultural and Biological Sciences 11

50%

Computer Science 4

18%

Biochemistry, Genetics and Molecular Bi... 4

18%

Medicine and Dentistry 3

14%

Save time finding and organizing research with Mendeley

Sign up for free