Abstract
Nearly 2200 genomes that encode around 6 million proteins have now been sequenced. Around 40% of these proteins are of unknown function, even when function is loosely and minimally defined as 'belonging to a superfamily'. In addition to in silico methods, the swelling stream of high-throughput experimental data can give valuable clues for linking these unknowns with precise biological roles. The goal is to develop integrative data-mining platforms that allow the scientific community at large to access and utilize this rich source of experimental knowledge. To this end, we review recent advances in generating whole-genome experimental datasets, where this data can be accessed, and how it can be used to drive prediction of gene function.
Cite
CITATION STYLE
Kuri-Morales, A., & Aldana-Bobadill, E. (2011). The Search for Irregularly Shaped Clusters in Data Mining. In New Fundamental Technologies in Data Mining. InTech. https://doi.org/10.5772/13655
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.