Abstract
A barrier that prevents many social scientists from pursuing big data research is the lack of technical training required to assemble and organize big data. In an effort to address this barrier, we provide an introductory tutorial into machine learning for social scientists by demonstrating the basic steps and fundamental concepts involved in binary classification. We first describe the data and libraries required for analysis. We then demonstrate data cleaning methods, feature engineering, the model-building process, model assessment, and feature importance. Last, we discuss the ways in which social scientists can use machine learning to complement inference-based approaches and how it can contribute to a richer understanding of social science.
Cite
CITATION STYLE
Ta, V., Carrico, L., & Bousquet, A. (2021). Binary Classification: An Introductory Machine Learning Tutorial for Social Scientists. Journal of Methods and Measurement in the Social Sciences, 12(2). https://doi.org/10.2458/jmmss.5186
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.