Should College Dropout Prediction Models Include Protected Attributes?

Renzhe Yu; Hansol Lee; René F. Kizilcec

Conference ProceedingsOPEN ACCESS

Should College Dropout Prediction Models Include Protected Attributes?

L@S 2021 - Proceedings of the 8th ACM Conference on Learning @ Scale (2021) 91-100

DOI: 10.1145/3430895.3460139

54Citations

94Readers

Abstract

Early identification of college dropouts can provide tremendous value for improving student success and institutional effectiveness, and predictive analytics are increasingly used for this purpose. However, ethical concerns have emerged about whether including protected attributes in these prediction models discriminates against underrepresented student groups and exacerbates existing inequities. We examine this issue in the context of a large U.S. research university with both residential and fully online degree-seeking students. Based on comprehensive institutional records for the entire student population across multiple years (N = 93,457), we build machine learning models to predict student dropout after one academic year of study and compare the overall performance and fairness of model predictions with or without four protected attributes (gender, URM, first-generation student, and high financial need). We find that including protected attributes does not impact the overall prediction performance and it only marginally improves the algorithmic fairness of predictions. These findings suggest that including protected attributes is preferable. We offer guidance on how to evaluate the impact of including protected attributes in a local context, where institutional stakeholders seek to leverage predictive analytics to support student success.

Author supplied keywords

Cite

CITATION STYLE

APA

Yu, R., Lee, H., & Kizilcec, R. F. (2021). Should College Dropout Prediction Models Include Protected Attributes? In L@S 2021 - Proceedings of the 8th ACM Conference on Learning @ Scale (pp. 91–100). Association for Computing Machinery, Inc. https://doi.org/10.1145/3430895.3460139

Should College Dropout Prediction Models Include Protected Attributes?

Abstract

Author supplied keywords

Cite

Register to see more suggestions