Inter-functional analysis of high-throughput phenotype data by non-parametric clustering and its application to photosynthesis

Qiaozi Gao; Elisabeth Ostendorf; Jeffrey A. Cruz; Rong Jin; David M. Kramer; Jin Chen

Journal ArticleOPEN ACCESS

Inter-functional analysis of high-throughput phenotype data by non-parametric clustering and its application to photosynthesis

Bioinformatics (2016) 32(1) 67-76

DOI: 10.1093/bioinformatics/btv515

5Citations

21Readers

Abstract

Motivation: Phenomics is the study of the properties and behaviors of organisms (i.e. their phenotypes) on a high-throughput scale. New computational tools are needed to analyze complex phenomics data, which consists of multiple traits/behaviors that interact with each other and are dependent on external factors, such as genotype and environmental conditions, in a way that has not been well studied. Results: We deployed an efficient framework for partitioning complex and high dimensional phenotype data into distinct functional groups. To achieve this, we represented measured phenotype data from each genotype as a cloud-of-points, and developed a novel non-parametric clustering algorithm to cluster all the genotypes. When compared with conventional clustering approaches, the new method is advantageous in that it makes no assumption about the parametric form of the underlying data distribution and is thus particularly suitable for phenotype data analysis. We demonstrated the utility of the new clustering technique by distinguishing novel phenotypic patterns in both synthetic data and a high-throughput plant photosynthetic phenotype dataset. We biologically verified the clustering results using four Arabidopsis chloroplast mutant lines. Availability and implementation: Software is available at www.msu.edu/-jinchen/NPM.

Cite

CITATION STYLE

APA

Gao, Q., Ostendorf, E., Cruz, J. A., Jin, R., Kramer, D. M., & Chen, J. (2016). Inter-functional analysis of high-throughput phenotype data by non-parametric clustering and its application to photosynthesis. Bioinformatics, 32(1), 67–76. https://doi.org/10.1093/bioinformatics/btv515

Inter-functional analysis of high-throughput phenotype data by non-parametric clustering and its application to photosynthesis

Abstract

Cite

Register to see more suggestions