OkCupid data for introductory statistics and data science courses

11Citations
Citations of this article
55Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

We present a data set consisting of user profile data for 59,946 San Francisco OkCupid users (a free online dating website) from June 2012. The data set includes typical user information, lifestyle variables, and text responses to 10 essay questions. We present four example analyses suitable for use in undergraduate introductory probability and statistics and data science courses that use R. The statistical and data science concepts covered include basic data visualization, exploratory data analysis, multivariate relationships, text analysis, and logistic regression for prediction.

Cite

CITATION STYLE

APA

Kim, A. Y., & Escobedo-Land, A. (2015). OkCupid data for introductory statistics and data science courses. Journal of Statistics Education, 23(2). https://doi.org/10.1080/10691898.2015.11889737

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free