In this article, we describe screening, a new Stata command for data management that can be used to examine the content of complex narrative-text variables to identify one or more user-defined keywords. The command is useful when dealing with string data contaminated with abbreviations, typos, or mistakes. A rich set of options allows a direct translation from the original narrative string to a user-defined standard coding scheme. Moreover, screening is flexible enough to facilitate the merging of information from different sources and to extract or reorganize the content of string variables. Editors' note. This article refers to undocumented functions of Mata, meaning that there are no corresponding manual entries. Documentation for these functions is available only as help files; see help regex. © 2010 StataCorp LP.
CITATION STYLE
Belotti, F., & Depalo, D. (2010). Translation from narrative text to standard codes variables with Stata. Stata Journal, 10(3), 458–481. https://doi.org/10.1177/1536867x1001000310
Mendeley helps you to discover research relevant for your work.