A parallel hierarchical agglomerative clustering technique for billingual corpora based on reduced terms with automatic weight optimization

2Citations
Citations of this article
2Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Multilingual corpora are becoming an essential resource for work in multilingual natural language processing. The aim of this paper is to investigate the effects of applying a clustering technique to parallel multilingual texts. It is interesting to look at the differences of the cluster mappings and the tree structures of the clusters. The effect of reducing the set of terms considered in clustering parallel corpora is also studied. After that, a genetic-based algorithm is applied to optimize the weights of terms considered in clustering the texts to classify unseen examples of documents. Specifically, the aim of this work is to introduce the tools necessary for this task and display a set of experimental results and issues which have become apparent. © 2009 Springer.

Cite

CITATION STYLE

APA

Alfred, R. (2009). A parallel hierarchical agglomerative clustering technique for billingual corpora based on reduced terms with automatic weight optimization. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5678 LNAI, pp. 19–30). https://doi.org/10.1007/978-3-642-03348-3_6

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free