Genome annotation test with validation on transcription start site and ChIP-Seq for Pol-II binding data

Justin Bedo; Adam Kowalczyk

Journal ArticleOPEN ACCESS

Genome annotation test with validation on transcription start site and ChIP-Seq for Pol-II binding data

Bioinformatics (2011) 27(12) 1610-1617

DOI: 10.1093/bioinformatics/btr263

0Citations

46Readers

Abstract

Motivation: Many ChIP-Seq experiments are aimed at developing gold standards for determining the locations of various genomic features such as transcription start or transcription factor binding sites on the whole genome. Many such pioneering experiments lack rigorous testing methods and adequate 'gold standard' annotations to compare against as they themselves are the most reliable source of empirical data available. To overcome this problem, we propose a self-consistency test whereby a dataset is tested against itself. It relies on a supervised machine learning style protocol for in silico annotation of a genome and accuracy estimation to guarantee, at least, self-consistency. Results: The main results use a novel performance metric (a calibrated precision) in order to assess and compare the robustness of the proposed supervised learning method across different test sets. As a proof of principle, we applied the whole protocol to two recent ChIP-Seq ENCODE datasets of STAT1 and Pol-II binding sites. STAT1 is benchmarked against in silicodetection of binding sites using available position weight matrices. Pol-II, the main focus of this paper, is benchmarked against 17 algorithms for the closely related and well-studied problem of in silico transcription start site (TSS) prediction. Our results also demonstrate the feasibility of in silico genome annotation extension with encouraging results from a small portion of annotated genome to the remainder. © The Author 2011. Published by Oxford University Press. All rights reserved.

Cite

CITATION STYLE

APA

Bedo, J., & Kowalczyk, A. (2011). Genome annotation test with validation on transcription start site and ChIP-Seq for Pol-II binding data. Bioinformatics, 27(12), 1610–1617. https://doi.org/10.1093/bioinformatics/btr263

Genome annotation test with validation on transcription start site and ChIP-Seq for Pol-II binding data

Abstract

Cite

Register to see more suggestions