Learning to find comparable entities on the web

Xiaojiang Huang; Xiaojun Wan; Jianguo Xiao

Conference Proceedings

Learning to find comparable entities on the web

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2012) 7651 LNCS 16-29

DOI: 10.1007/978-3-642-35063-4_2

2Citations

8Readers

Get full text

Abstract

Comparison is a popular way for people to discover the commonality and difference between two entities (e.g. product, person, company, event, etc.). It would be very useful to automatically provide comparison results for the user. The prerequisite step of this task is to find comparable entities. In this paper, we propose a novel Web mining system to address the task of finding comparable entities for a given single entity. First, the system uses a bootstrapping method to find candidate entities for the given entity through natural language analysis in the snippets of search engine results. Then, the system uses set expansion techniques to find more candidate entities though semi-structured HTML analysis in the downloaded web pages. Finally, the system uses a supervised learning method to classify the candidate entities into either comparable or incomparable by incorporating linguistic, statistical and semantic features. Experimental results demonstrate that our proposed framework can outperform the baseline systems. © 2012 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Huang, X., Wan, X., & Xiao, J. (2012). Learning to find comparable entities on the web. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7651 LNCS, pp. 16–29). https://doi.org/10.1007/978-3-642-35063-4_2

Learning to find comparable entities on the web

Abstract

Author supplied keywords

Cite

Register to see more suggestions