A comparison of stylometric and lexical features for web genre classification and emotion classification in blogs

4Citations
Citations of this article
23Readers
Mendeley users who have this article in their library.
Get full text

Abstract

In the blogosphere, the amount of digital content is expanding and for search engines, new challenges have been imposed. Due to the changing information need, automatic methods are needed to support blog search users to filter information by different facets. In our work, we aim to support blog search with genre and facet information. Since we focus on the news genre, our approach is to classify blogs into news versus rest. Also, we assess the emotionality facet in news related blogs to enable users to identify people's feelings towards specific events. Our approach is to evaluate the performance of text classifiers with lexical and stylometric features to determine the best performing combination for our tasks. Our experiments on a subset of the TREC Blogs08 dataset reveal that classifiers trained on lexical features perform consistently better than classifiers trained on the best stylometric features. © 2010 IEEE.

Cite

CITATION STYLE

APA

Lex, E., Juffinger, A., & Granitzer, M. (2010). A comparison of stylometric and lexical features for web genre classification and emotion classification in blogs. In Proceedings - 21st International Workshop on Database and Expert Systems Applications, DEXA 2010 (pp. 10–14). https://doi.org/10.1109/DEXA.2010.24

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free