PCA-HPR: A principle component analysis model for human promoter recognition

  • Li X
  • Zeng J
  • Yan H
N/ACitations
Citations of this article
9Readers
Mendeley users who have this article in their library.

Abstract

We describe a promoter recognition method named PCA-HPR to locate eukaryotic promoter regions and predict transcription start sites (TSSs). We computed codon (3-mer) and pentamer (5-mer) frequencies and created codon and pentamer frequency feature matrices to extract informative and discriminative features for effective classification. Principal component analysis (PCA) is applied to the feature matrices and a subset of principal components (PCs) are selected for classification. Our system uses three neural network classifiers to distinguish promoters versus exons, promoters versus introns, and promoters versus 3' un-translated region (3'UTR). We compared PCA-HPR with three well-known existing promoter prediction systems such as DragonGSF, Eponine and FirstEF. Validation shows that PCA-HPR achieves the best performance with three test sets for all the four predictive systems.

Cite

CITATION STYLE

APA

Li, X., Zeng, J., & Yan, H. (2008). PCA-HPR: A principle component analysis model for human promoter recognition. Bioinformation, 2(9), 373–378. https://doi.org/10.6026/97320630002373

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free