Abstract
Copyright and personal data protection are two of the most important legal aspects of collecting data for a learner corpus. The paper explains the challenges in data collection for the learner corpus of Latvian "LaVA" and describes the procedure undertaken to ensure protection of the texts' authors' rights. An agreement / metadata questionnaire form was created to inform the authors of the ways their texts are used and to receive the authors' permission to use them in the stated way. The information, permission, and the metadata questionnaire are printed on one side of an A4 size paper sheet, and the author is supposed to write the text on the other side by hand, thus eliminating the need to identify the author of the text separately. After scanning and adding to the corpus, the text originals are returned to the authors.
Cite
CITATION STYLE
Kaija, I., & Auzina, I. (2020). Data collection for learner corpus of Latvian: copyright and personal data protection. In Introduction (Vol. 172, pp. 41–47). Linköping University Electronic Press. https://doi.org/10.3384/ecp2020172006
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.