Production of Large Analogical Clusters from Smaller Example Seed Clusters Using Word Embeddings

Yuzhong Hong; Yves Lepage

Conference Proceedings

Production of Large Analogical Clusters from Smaller Example Seed Clusters Using Word Embeddings

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2018) 11156 LNAI 548-562

DOI: 10.1007/978-3-030-01081-2_36

3Citations

5Readers

Get full text

Abstract

We introduce a method to automatically produce large analogical clusters from smaller seed clusters of representative examples. The method is based on techniques of processing and solving analogical equations in word vector space models, i.e., word embeddings. In our experiments, we use standard data sets in English which cover different relations extending from derivational morphology (like adjective–adverb, positive–comparative forms of adjectives) or inflectional morphology (like present–past forms) to encyclopedic semantics (like country–capital relations). The analogical clusters produced by our method are shown to be of reasonably good quality, as shown by comparing human judgment against automatic NDCG@n scores. In total, they contain 8.5 times as many relevant word pairs as the seed clusters.

Author supplied keywords

Cite

CITATION STYLE

APA

Hong, Y., & Lepage, Y. (2018). Production of Large Analogical Clusters from Smaller Example Seed Clusters Using Word Embeddings. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11156 LNAI, pp. 548–562). Springer Verlag. https://doi.org/10.1007/978-3-030-01081-2_36

Production of Large Analogical Clusters from Smaller Example Seed Clusters Using Word Embeddings

Abstract

Author supplied keywords

Cite

Register to see more suggestions