Data Organization in Spreadsheets

91Citations
Citations of this article
1.8kReaders
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Spreadsheets are widely used software tools for data entry, storage, analysis, and visualization. Focusing on the data entry and storage aspects, this article offers practical recommendations for organizing spreadsheet data to reduce errors and ease later analyses. The basic principles are: be consistent, write dates like YYYY-MM-DD, do not leave any cells empty, put just one thing in a cell, organize the data as a single rectangle (with subjects as rows and variables as columns, and with a single header row), create a data dictionary, do not include calculations in the raw data files, do not use font color or highlighting as data, choose good names for things, make backups, use data validation to avoid data entry errors, and save the data in plain text files.

Cite

CITATION STYLE

APA

Broman, K. W., & Woo, K. H. (2018). Data Organization in Spreadsheets. American Statistician, 72(1), 2–10. https://doi.org/10.1080/00031305.2017.1375989

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free