Data mining as a discovery tool for imprinted genes

0Citations
Citations of this article
8Readers
Mendeley users who have this article in their library.
Get full text

Abstract

This chapter serves as an introduction to the collection of genome-wide sequence and epigenomic data, as well as the use of these data in training generalized linear models (glm) to predicted imprinted status. This is meant to be an introduction to the method, so only the most straightforward examples will be covered. For instance, the examples given below refer to 11 classes of genomic regions (the entire gene body, introns, exons, 5′ UTR, 3′ UTR, and 1, 10, and 100 kb upstream and downstream of each gene). One could also build models based on combinations of these regions. Likewise, models could be built on combinations of epigenetic features, or on combinations of both genomic regions and epigenetic features. This chapter relies heavily on computational methods, including basic programming. However, this chapter is not meant to be an introduction to programming. Throughout the chapter, the reader will be provided with example code in the Perl programming language. © 2012 Springer Science+Business Media, LLC.

Cite

CITATION STYLE

APA

Brideau, C., & Soloway, P. (2012). Data mining as a discovery tool for imprinted genes. Methods in Molecular Biology, 925, 89–134. https://doi.org/10.1007/978-1-62703-011-3_7

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free