Princeton WordNet is one of the most important resources for natural language processing, but has not been updated for over ten years and is not suitable for analyzing the fast moving language as used on social media. We propose an extension to WordNet, with new terms that have been found from Twitter and Reddit, and cover language usage that is emergent or vulgar. In addition to our methodology for extraction, we analyze new terms to provide information about how new words are entering the English language. Finally, we discuss publishing this resource both as linguistic linked open data and as part of the Global WordNet Association’s Interlingual Index.
CITATION STYLE
McCrae, J. P., Wood, I., & Hicks, A. (2017). The colloquial WordNet: Extending Princeton WordNet with neologisms. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10318 LNAI, pp. 194–202). Springer Verlag. https://doi.org/10.1007/978-3-319-59888-8_17
Mendeley helps you to discover research relevant for your work.