Quantitative analysis and prediction of G-quadruplex forming sequences in double-stranded DNA

19Citations
Citations of this article
59Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

G-quadruplex (GQ) is a four-stranded DNA structure that can be formed in guanine-rich sequences. GQ structures have been proposed to regulate diverse biological processes including transcription, replication, translation and telomere maintenance. Recent studies have demonstrated the existence of GQ DNA in live mammalian cells and a significant number of potential GQ forming sequences in the human genome. We present a systematic and quantitative analysis of GQ folding propensity on a large set of 438 GQ forming sequences in double-stranded DNA by integrating fluorescence measurement, single-molecule imaging and computational modeling. We find that short minimum loop length and the thymine base are two main factors that lead to high GQ folding propensity. Linear and Gaussian process regression models further validate that the GQ folding potential can be predicted with high accuracy based on the loop length distribution and the nucleotide content of the loop sequences. Our study provides important new parameters that can inform the evaluation and classification of putative GQ sequences in the human genome.

Cite

CITATION STYLE

APA

Kim, M., Kreig, A., Lee, C. Y., Rube, H. T., Calvert, J., Song, J. S., & Myong, S. (2016). Quantitative analysis and prediction of G-quadruplex forming sequences in double-stranded DNA. Nucleic Acids Research, 44(10), 4807–4817. https://doi.org/10.1093/nar/gkw272

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free