Abstract
Word embedding is a technique for understanding the relationship among words by mapping words to numbers. Several kinds of research have been carried out in this field in different languages such as English, Hindi, Bengali etc. but very few works are available in the Nepali language domain. In this work, the word embedding technique using Word2Vec is implemented for Nepali news data. The methodology involved in this work includes Dataset preparation and Word2Vec modelling. Gensim package is used for implementing the Word2Vec model and its output shows the similarity between Nepali words. The work mainly focuses on developing word embedding on Nepali words generated by scraping the health section of Nepali news portals and has shown promising results.
Author supplied keywords
Cite
CITATION STYLE
Subedi, B., & Poudyal, P. (2022). Word Embedding in Nepali Language using Word2Vec. In ACM International Conference Proceeding Series (pp. 152–156). Association for Computing Machinery. https://doi.org/10.1145/3582768.3582799
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.