LX-DSemvectors: Distributional semantics models for Portuguese

João Rodrigues; António Branco; Steven Neale; João Silva

Conference Proceedings

LX-DSemvectors: Distributional semantics models for Portuguese

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2016) 9727 259-270

DOI: 10.1007/978-3-319-41552-9_27

26Citations

14Readers

Get full text

Abstract

In this article we describe the creation and distribution of the first publicly available word embeddings for Portuguese. Our embeddings are evaluated on their own and also compared with the original English models on a well-known analogy task. We gathered a large Portuguese corpus of 1.7 billion tokens, developed the first distributional semantic analogies test set for Portuguese, and proceeded with the first parametrization and evaluation of Portuguese word embeddings models.

Author supplied keywords

Cite

CITATION STYLE

APA

Rodrigues, J., Branco, A., Neale, S., & Silva, J. (2016). LX-DSemvectors: Distributional semantics models for Portuguese. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9727, pp. 259–270). Springer Verlag. https://doi.org/10.1007/978-3-319-41552-9_27

LX-DSemvectors: Distributional semantics models for Portuguese

Abstract

Author supplied keywords

Cite

Register to see more suggestions