Semantic visualization for short texts with word embeddings

4Citations
Citations of this article
25Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Semantic visualization integrates topic modeling and visualization, such that every document is associated with a topic distribution as well as visualization coordinates on a low-dimensional Euclidean space. We address the problem of semantic visualization for short texts. Such documents are increasingly common, including tweets, search snippets, news headlines, or status updates. Due to their short lengths, it is difficult to model semantics as the word co-occurrences in such a corpus are very sparse. Our approach is to incorporate auxiliary information, such as word embeddings from a larger corpus, to supplement the lack of co-occurrences. This requires the development of a novel semantic visualization model that seamlessly integrates visualization coordinates, topic distributions, and word vectors. We propose a model called GaussianSV, which outperforms pipelined baselines that derive topic models and visualization coordinates as disjoint steps, as well as semantic visualization baselines that do not consider word embeddings.

Cite

CITATION STYLE

APA

Le, T. M. V., & Lauw, H. W. (2017). Semantic visualization for short texts with word embeddings. In IJCAI International Joint Conference on Artificial Intelligence (Vol. 0, pp. 2074–2080). International Joint Conferences on Artificial Intelligence. https://doi.org/10.24963/ijcai.2017/288

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free