Tagging sentence boundaries

32Citations
Citations of this article
103Readers
Mendeley users who have this article in their library.

Abstract

In this paper we tackle sentence boundary disambiguation through a part-of-speech (POS) tagging framework. We describe necessary changes in text tokenization and the implementation of a POS tagger and provide results of an evaluation of this system on two corpora. We also describe an extension of the traditional POS tagging by combining it with the document-centered approach to proper name identification and abbreviation handling. This made the resulting system robust to domain and topic shifts.

Cite

CITATION STYLE

APA

Mikheev, A. (2000). Tagging sentence boundaries. In 1st Meeting of the North American Chapter of the Association for Computational Linguistics, NAACL 2000 - co-located with 6th Applied Natural Language Processing Conference, ANLP 2000 - Proceedings (pp. 264–271). Association for Computational Linguistics (ACL).

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free