Improving relevance of keyword extraction from the web utilizing visual style information

3Citations
Citations of this article
3Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Information growth is faster than ever before. We need to provide advanced services facilitating information "consumption" (e.g., recommendation, personalized navigation). At least a lightweight semantics is necessary for such services. Nowadays keyword paradigm is widely used and seems to achieve satisfactory results in fields such as social bookmarking or ontology learning. In this paper we explore impact of web site visual style on relevant keywords extraction. We propose a method for relevant keywords extraction from web pages combining traditional automatic term recognition algorithms with web site's visual style processing. We particularly focus on cascade style sheets. The evaluation conducted on 200 "wild" Web documents from 12 different web sites showed that our method increases the relevance of extracted keywords. © 2013 Springer-Verlag Berlin Heidelberg.

Cite

CITATION STYLE

APA

Lučanský, M., & Šimko, M. (2013). Improving relevance of keyword extraction from the web utilizing visual style information. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7741 LNCS, pp. 445–456). https://doi.org/10.1007/978-3-642-35843-2_38

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free