Integrating the use of large datasets into our teaching provides critical and unique opportunities to build students' skills and conceptual knowledge. Here, we discuss the core components needed to develop effective activities based on large datasets, which align with the 5E learning cycle. Data-based activities should be structured around a relevant question, use authentic publicly accessible data, be scaffolded to include choice, and involve discussion of the results. It is important that the software that is used to manipulate, analyze and/or visualize the data is accessible for students. There are a range of strategies to reduce the barriers of working with large datasets through pre-organizing and pre-scripting code for analyses, using online cloud-based versions of software, and reducing opportunities for error in syntax. Resources exist for learning open-source software (e.g., Data Carpentry) as well as for support and professional development in teaching with large datasets (Project EDDIE).
CITATION STYLE
O’Reilly, C. M., Josek, T., Darner, R. D., & Fortner, S. K. (2022). Pedagogy of teaching with large datasets: Designing and implementing effective data-based activities. Biochemistry and Molecular Biology Education, 50(5), 466–472. https://doi.org/10.1002/bmb.21663
Mendeley helps you to discover research relevant for your work.