Utilizing word embeddings for result diversification in tweet search

7Citations
Citations of this article
6Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The performance of result diversification for tweet search suffers from the well-known vocabulary mismatch problem, as tweets are too short and usually informal. As a remedy, we propose to adopt a query and tweet expansion strategy that utilizes automatically-generated word embeddings. Our experiments using state-of-the-art diversification methods on the Tweets2013 corpus reveal encouraging results for expanding queries and/or tweets based on the word embeddings to improve the diversification performance in tweet search. We further show that the expansions based on the word embeddings may serve as useful as those based on a manually constructed knowledge base, namely, ConceptNet.

Cite

CITATION STYLE

APA

Onal, K. D., Altingovde, I. S., & Karagoz, P. (2015). Utilizing word embeddings for result diversification in tweet search. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9460, pp. 366–378). Springer Verlag. https://doi.org/10.1007/978-3-319-28940-3_29

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free