Abstract
Abstract: Despite the growing interest in the dynamics of the writing process in writing research, publicly available large-scale corpora of keystroke logs have been rare. We introduce KLiCKe, a freely available collection of keystroke logs for around 5,000 argumentative texts written by adults in the United States. The KLiCKe corpus also includes human-rated holistic scores for the essays as well as writers' demographic details, their typing skills, and vocabulary knowledge. We describe our methods for constructing the corpus and present descriptives for different components of the corpus. To illustrate the use of the KLiCKe corpus, we report a study using a subset of the corpus to investigate whether keystroke features are associated with holistic writing quality for L1 and L2 writers. The study shows that higher writing scores are related to shorter pauses in general, shorter between-word pauses, lower proportion of deletions, higher proportion of insertions, and less process variance. The KLiCKe corpus provides a robust resource for researchers to study the dynamics of text production and revision that will help spur the development of process-oriented tools and methodologies in writing assessment and instruction.
Author supplied keywords
Cite
CITATION STYLE
Tian, Y., Crossley, S., & Waes, L. V. (2025). The KLiCKe Corpus: Keystroke Logging in Compositions for Knowledge Evaluation. Journal of Writing Research, 17(1), 23–60. https://doi.org/10.17239/jowr-2025.17.01.02
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.