New statistical methods for phrase break prediction

Helmut Schmid; Michaela Atterer

Conference Proceedings

New statistical methods for phrase break prediction

COLING 2004 - Proceedings of the 20th International Conference on Computational Linguistics (2004)

DOI: 10.3115/1220355.1220450

17Citations

80Readers

Get full text

Abstract

The paper presents two methods for the prediction of phrase breaks. The first method uses a standard HMM part-of-speech tagger with variable context length. The second method directly encodes the distance from the last phrase break in its states. It combines the probability of a phrase break given the distance from the last phrase break with the probability of a break given the local context consisting of the surrounding words and part of speech tags. The accuracy of the new tagger is 2 percentage points higher than that of Taylor and Black (1998) on similar data.

Cite

CITATION STYLE

APA

Schmid, H., & Atterer, M. (2004). New statistical methods for phrase break prediction. In COLING 2004 - Proceedings of the 20th International Conference on Computational Linguistics. Association for Computational Linguistics (ACL). https://doi.org/10.3115/1220355.1220450

New statistical methods for phrase break prediction

Abstract

Cite

Register to see more suggestions