Utilizing word embeddings for result diversification in tweet search

Kezban Dilek Onal; Ismail Sengor Altingovde; Pinar Karagoz

Conference Proceedings

Utilizing word embeddings for result diversification in tweet search

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2015) 9460 366-378

DOI: 10.1007/978-3-319-28940-3_29

7Citations

6Readers

Get full text

Abstract

The performance of result diversification for tweet search suffers from the well-known vocabulary mismatch problem, as tweets are too short and usually informal. As a remedy, we propose to adopt a query and tweet expansion strategy that utilizes automatically-generated word embeddings. Our experiments using state-of-the-art diversification methods on the Tweets2013 corpus reveal encouraging results for expanding queries and/or tweets based on the word embeddings to improve the diversification performance in tweet search. We further show that the expansions based on the word embeddings may serve as useful as those based on a manually constructed knowledge base, namely, ConceptNet.

Author supplied keywords

Cite

CITATION STYLE

APA

Onal, K. D., Altingovde, I. S., & Karagoz, P. (2015). Utilizing word embeddings for result diversification in tweet search. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9460, pp. 366–378). Springer Verlag. https://doi.org/10.1007/978-3-319-28940-3_29

Utilizing word embeddings for result diversification in tweet search

Abstract

Author supplied keywords

Cite

Register to see more suggestions