Identification of regulatory elements using a feature selection method

Sündüz Keleş; Mark Van Der Laan; Michael B. Eisen

Journal ArticleOPEN ACCESS

Identification of regulatory elements using a feature selection method

Bioinformatics (2002) 18(9) 1167-1175

DOI: 10.1093/bioinformatics/18.9.1167

86Citations

87Readers

Abstract

Motivation: Many methods have been described to identify regulatory motifs in the transcription control regions of genes that exhibit similar patterns of gene expression across a variety of experimental conditions. Here we focus on a single experimental condition, and utilize gene expression data to identify sequence motifs associated with genes that are activated under this experimental condition. We use a linear model with two-way interactions to model gene expression as a function of sequence features (words) present in presumptive transcription control regions. The most relevant features are selected by a feature selection method called stepwise selection with monte carlo cross validation. We apply this method to a publicly available dataset of the yeast Saccharomyces cerevisiae, focussing on the 800 basepairs immediately upstream of each gene's translation start site (the upstream control region (UCR). Result: We successfully identify regulatory motifs that are known to be active under the experimental conditions analyzed, and find additional significant sequences that may represent novel regulatory motifs. We also discuss a complementary method that utilizes gene expression data from a single microarray experiment and allows averaging over variety of experimental conditions as an alternative to motif finding methods that act on clusters of co-expressed genes.

Cite

CITATION STYLE

APA

Keleş, S., Van Der Laan, M., & Eisen, M. B. (2002). Identification of regulatory elements using a feature selection method. Bioinformatics, 18(9), 1167–1175. https://doi.org/10.1093/bioinformatics/18.9.1167

Identification of regulatory elements using a feature selection method

Abstract

Cite

Register to see more suggestions