Translation from narrative text to standard codes variables with Stata

Federico Belotti; Domenico Depalo

Journal ArticleOPEN ACCESS

Translation from narrative text to standard codes variables with Stata

Stata Journal (2010) 10(3) 458-481

DOI: 10.1177/1536867x1001000310

6Citations

15Readers

Abstract

In this article, we describe screening, a new Stata command for data management that can be used to examine the content of complex narrative-text variables to identify one or more user-defined keywords. The command is useful when dealing with string data contaminated with abbreviations, typos, or mistakes. A rich set of options allows a direct translation from the original narrative string to a user-defined standard coding scheme. Moreover, screening is flexible enough to facilitate the merging of information from different sources and to extract or reorganize the content of string variables. Editors' note. This article refers to undocumented functions of Mata, meaning that there are no corresponding manual entries. Documentation for these functions is available only as help files; see help regex. © 2010 StataCorp LP.

Author supplied keywords

Cite

CITATION STYLE

APA

Belotti, F., & Depalo, D. (2010). Translation from narrative text to standard codes variables with Stata. Stata Journal, 10(3), 458–481. https://doi.org/10.1177/1536867x1001000310

Translation from narrative text to standard codes variables with Stata

Abstract

Author supplied keywords

Cite

Register to see more suggestions