We focus on automatic coreference resolution for blogs and news articles with user comments as part of a project on opinion mining. We aim to study the effect of the genre shift from edited, structured newspaper text to unedited, unstructured blog data. We compare our coreference resolution system on three data sets: newspaper articles, mixed newspaper articles and reader comments, and blog data. As can be expected the performance of the automatic coreference resolution system drops drastically when tested on unedited text. We describe the characteristics of the different data sets and we examine the typical errors made by the resolution system. © 2009 Springer-Verlag Berlin Heidelberg.
CITATION STYLE
Hendrickx, I., & Hoste, V. (2009). Coreference resolution on blogs and commented news. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5847 LNAI, pp. 43–53). https://doi.org/10.1007/978-3-642-04975-0_4
Mendeley helps you to discover research relevant for your work.