All automated coreference resolution systems consider a number of features, such as head noun, NP type, gender, or number. Although the particular features used is one of the key factors for determining performance, they have not received much attention, especially for languages other than English. This paper delves into a considerable number of pairwise comparison features for coreference, including old and novel features, with a special focus on the Spanish language. We consider the contribution of each of the features as well as the interaction between them. In addition, given the problem of class imbalance in coreference resolution, we analyze the effect of sample selection. From the experiments with TiMBL (Tilburg Memory-Based Learner) on the AnCora corpus, interesting conclusions are drawn from both linguistic and computational perspectives.
CITATION STYLE
Recasens, M., & Hovy, E. (2009). Anaphora Processing and Applications. Anaphora Processing and Applications, 5847, 29–42. Retrieved from http://www.springerlink.com/index/10.1007/978-3-642-04975-0
Mendeley helps you to discover research relevant for your work.