Hierarchical clustering, languages and cancer

Pritha Mahata; Wagner Costa; Carlos Cotta; Pablo Moscato

Conference Proceedings

Hierarchical clustering, languages and cancer

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2006) 3907 LNCS 67-78

DOI: 10.1007/11732242_7

9Citations

6Readers

Get full text

Abstract

In this paper, we introduce a novel objective function for the hierarchical clustering of data from distance matrices, a very relevant task in Bioinformatics. To test the robustness of the method, we test it in two areas: (a) the problem of deriving a phytogeny of languages and (b) subtype cancer classification from microarray data. For comparison purposes, we also consider both the use of ultrametric trees (generated via a two-phase evolutionary approach that creates a large number of hypothesis trees, and then takes a consensus), and the best-known results from the literature. We used a dataset of measured 'separation time' among 84 Indo-European languages. The hierarchy we produce agrees very well with existing data about these languages across a wide range of levels, and it helps to clarify and raise new hypothesis about the evolution of these languages. Our method also generated a classification tree for the different cancers in the NCI60 microarray dataset (comprising gene expression data for 60 cancer cell lines). In this case, the method seems to support the current belief about the heterogeneous nature of the ovarian, breast and non-small-lung cancer, as opposed to the relative homogeneity of other types of cancer. However, our method reveals a close relationship of the melanoma and CNS cell-lines. This is in correspondence with the fact that metastatic melanoma first appears in central nervous system (CNS). © Springer-Verlag Berlin Heidelberg 2006.

Cite

CITATION STYLE

APA

Mahata, P., Costa, W., Cotta, C., & Moscato, P. (2006). Hierarchical clustering, languages and cancer. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3907 LNCS, pp. 67–78). https://doi.org/10.1007/11732242_7

Hierarchical clustering, languages and cancer

Abstract

Cite

Register to see more suggestions