Proposing Machine Learning Models Suitable for Predicting Open Data Utilization

Junyoung Jeong; Keuntae Cho

Journal ArticleOPEN ACCESS

Proposing Machine Learning Models Suitable for Predicting Open Data Utilization

Sustainability (Switzerland) (2024) 16(14)

DOI: 10.3390/su16145880

5Citations

38Readers

Get full text

Abstract

As the digital transformation accelerates in our society, open data are being increasingly recognized as a key resource for digital innovation in the public sector. This study explores the following two research questions: (1) Can a machine learning approach be appropriately used for measuring and evaluating open data utilization? (2) Should different machine learning models be applied for measuring open data utilization depending on open data attributes (field and usage type)? This study used single-model (random forest, XGBoost, LightGBM, CatBoost) and multi-model (stacking ensemble) machine learning methods. A key finding is that the best-performing models differed depending on open data attributes (field and type of use). The applicability of the machine learning approach for measuring and evaluating open data utilization in advance was also confirmed. This study contributes to open data utilization and to the application of its intrinsic value to society.

Author supplied keywords

Cite

CITATION STYLE

APA

Jeong, J., & Cho, K. (2024). Proposing Machine Learning Models Suitable for Predicting Open Data Utilization. Sustainability (Switzerland), 16(14). https://doi.org/10.3390/su16145880

Proposing Machine Learning Models Suitable for Predicting Open Data Utilization

Abstract

Author supplied keywords

Cite

Register to see more suggestions