Capitalization and punctuation restoration: a survey

18Citations
Citations of this article
56Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Ensuring proper punctuation and letter casing is a key pre-processing step towards applying complex natural language processing algorithms. This is especially significant for textual sources where punctuation and casing are missing, such as the raw output of automatic speech recognition systems. Additionally, short text messages and micro-blogging platforms offer unreliable and often wrong punctuation and casing. This survey offers an overview of both historical and state-of-the-art techniques for restoring punctuation and correcting word casing. Furthermore, current challenges and research directions are highlighted.

Cite

CITATION STYLE

APA

Păiş, V., & Tufiş, D. (2022). Capitalization and punctuation restoration: a survey. Artificial Intelligence Review, 55(3), 1681–1722. https://doi.org/10.1007/s10462-021-10051-x

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free