Multilingual Extension of a Temporal Expression Normalizer using Annotated Corpora

6Citations
Citations of this article
67Readers
Mendeley users who have this article in their library.

Abstract

This paper presents the automatic extension to other languages of TERSEO, a knowledge-based system for the recognition and normalization of temporal expressions originally developed for Spanish1. TERSEO was first extended to English through the automatic translation of the temporal expressions. Then, an improved porting process was applied to Italian, where the automatic translation of the temporal expressions from English and from Spanish was combined with the extraction of new expressions from an Italian annotated corpus. Experimental results demonstrate how, while still adhering to the rule-based paradigm, the development of automatic rule translation procedures allowed us to minimize the effort required for porting to new languages. Relying on such procedures, and without any manual effort or previous knowledge of the target language, TERSEO recognizes and normalizes temporal expressions in Italian with good results (72% precision and 83% recall for recognition).

Cite

CITATION STYLE

APA

Saquete, E., Martínez-Barco, P., Muñoz, R., Negri, M., Speranza, M., & Sprugnoli, R. (2006). Multilingual Extension of a Temporal Expression Normalizer using Annotated Corpora. In Cross-Language Knowledge Induction Workshop - International Workshop held as part of EACL 2006: 11th Conference of the European Chapter of the Association for Computational Linguistics (pp. 1–8). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1608842.1608843

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free