STARRPeaker: uniform processing and accurate identification of STARR-seq active regions

Donghoon Lee; Manman Shi; Jennifer Moran; Martha Wall; Jing Zhang; Jason Liu; Dominic Fitzgerald; Yasuhiro Kyono; Lijia Ma; Kevin P. White; Mark Gerstein

Journal ArticleOPEN ACCESS

STARRPeaker: uniform processing and accurate identification of STARR-seq active regions

Genome Biology (2020) 21(1)

DOI: 10.1186/s13059-020-02194-x

39Citations

60Readers

Abstract

STARR-seq technology has employed progressively more complex genomic libraries and increased sequencing depths. An issue with the increased complexity and depth is that the coverage in STARR-seq experiments is non-uniform, overdispersed, and often confounded by sequencing biases, such as GC content. Furthermore, STARR-seq readout is confounded by RNA secondary structure and thermodynamic stability. To address these potential confounders, we developed a negative binomial regression framework for uniformly processing STARR-seq data, called STARRPeaker. Moreover, to aid our effort, we generated whole-genome STARR-seq data from the HepG2 and K562 human cell lines and applied STARRPeaker to comprehensively and unbiasedly call enhancers in them.

Cite

CITATION STYLE

APA

Lee, D., Shi, M., Moran, J., Wall, M., Zhang, J., Liu, J., … Gerstein, M. (2020). STARRPeaker: uniform processing and accurate identification of STARR-seq active regions. Genome Biology, 21(1). https://doi.org/10.1186/s13059-020-02194-x

STARRPeaker: uniform processing and accurate identification of STARR-seq active regions

Abstract

Cite

Register to see more suggestions