The languages that are most commonly subject to linguistic annotation on a large scale tend to be those with the largest populations or with recent histories of linguistic scholarship. In this paper we discuss the problems associated with lowerdensity languages in the context of the development of linguistically annotated resources. We frame our work with three key questions regarding the definition of lower-density languages; increasing available resources and reducing data requirements. A number of steps forward are identified for increasing the number lowerdensity language corpora with linguistic annotations.
CITATION STYLE
Maxwell, M., & Hughes, B. (2006). Frontiers in Linguistic Annotation for Lower-Density Languages. In COLING ACL 2006 - Frontiers in Linguistically Annotated Corpora 2006, A Merged Workshop with 7th International Workshop on Linguistically Interpreted Corpora, LINC 2006 and Frontiers in Corpus Annotation III, Proceedings of the Workshop (pp. 29–37). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1641991.1641996
Mendeley helps you to discover research relevant for your work.