Searching for ground truth: A stepping stone in automating genre classification

Yunhyong Kim; Seamus Ross

Conference Proceedings

Searching for ground truth: A stepping stone in automating genre classification

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2007) 4877 LNCS 248-261

DOI: 10.1007/978-3-540-77088-6_24

4Citations

10Readers

Get full text

Abstract

This paper examines genre classification of documents and its role in enabling the effective automated management of digital documents by digital libraries and other repositories. We have previously presented genre classification as a valuable step toward achieving automated extraction of descriptive metadata for digital material. Here, we present results from experiments using human labellers, conducted to assist in genre characterisation and the prediction of obstacles which need to be overcome by an automated system, and to contribute to the process of creating a solid testbed corpus for extending automated genre classification and testing metadata extraction tools across genres. We also describe the performance of two classifiers based on image and stylistic modeling features in labelling the data resulting from the agreement of three human labellers across fifteen genre classes. © Springer-Verlag Berlin Heidelberg 2007.

Author supplied keywords

Cite

CITATION STYLE

APA

Kim, Y., & Ross, S. (2007). Searching for ground truth: A stepping stone in automating genre classification. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4877 LNCS, pp. 248–261). Springer Verlag. https://doi.org/10.1007/978-3-540-77088-6_24

Searching for ground truth: A stepping stone in automating genre classification

Abstract

Author supplied keywords

Cite

Register to see more suggestions