Abstract
Motivation: Protein dispensability is fundamental to the understanding of gene function and evolution. Recent advances in generating high-throughput data such as genomic sequence data, protein-protein interaction data, gene-expression data and growth-rate data of mutants allow us to investigate protein dispensability systematically at the genome scale. Results: In our studies, protein dispensability is represented as a fitness score that is measured by the growth rate of gene-deletion mutants. By the analyses of high-throughput data in yeast Saccharomyces cerevisiae, we found that a protein's dispensability had significant correlations with its evolutionary rate and duplication rate, as well as its connectivity in protein-protein interaction network and gene-expression correlation network. Neural network and support vector machine were applied to predict protein dispensability through high-throughput data. Our studies shed some lights on global characteristics of protein dispensability and evolution. © The Author 2004. Published by Oxford University Press. All rights reserved.
Cite
CITATION STYLE
Chen, Y., & Xu, D. (2005). Understanding protein dispensability through machine-learning analysis of high-throughput data. Bioinformatics, 21(5), 575–581. https://doi.org/10.1093/bioinformatics/bti058
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.