Assessing the Quality of Web Content

  • Lex E
  • Khan I
  • Bischof H
  • et al.
ArXiv: 1406.3188
N/ACitations
Citations of this article
11Readers
Mendeley users who have this article in their library.

Abstract

This paper describes our approach towards the ECML/PKDD Discovery Challenge 2010. The challenge consists of three tasks: (1) a Web genre and facet classification task for English hosts, (2) an English quality task, and (3) a multilingual quality task (German and French). In our approach, we create an ensemble of three classifiers to predict unseen Web hosts whereas each classifier is trained on a different feature set. Our final NDCG on the whole test set is 0:575 for Task 1, 0:852 for Task 2, and 0:81 (French) and 0:77 (German) for Task 3, which ranks second place in the ECML/PKDD Discovery Challenge 2010.

Cite

CITATION STYLE

APA

Lex, E., Khan, I., Bischof, H., & Granitzer, M. (2014). Assessing the Quality of Web Content. Retrieved from http://arxiv.org/abs/1406.3188

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free