Open entity extraction from web search query logs

Alpa Jain; Marco Pennacchiotti

Conference Proceedings

Open entity extraction from web search query logs

Coling 2010 - 23rd International Conference on Computational Linguistics, Proceedings of the Conference (2010) 2 510-518

53Citations

109Readers

Abstract

In this paper we propose a completely unsupervised method for open-domain entity extraction and clustering over query logs. The underlying hypothesis is that classes defined by mining search user activity may significantly differ from those typically considered over web documents, in that they better model the user space, i.e. users'perception and interests. We show that our method outperforms state of the art (semi-)supervised systems based either on web documents or on query logs (16% gain on the clustering task). We also report evidence that our method successfully supports a real world application, namely keyword generation for sponsored search.

Cite

CITATION STYLE

APA

Jain, A., & Pennacchiotti, M. (2010). Open entity extraction from web search query logs. In Coling 2010 - 23rd International Conference on Computational Linguistics, Proceedings of the Conference (Vol. 2, pp. 510–518).

Open entity extraction from web search query logs

Abstract

Cite

Register to see more suggestions