A Differential Privacy Budget Allocation Algorithm Based on Out-of-Bag Estimation in Random Forest

Xin Li; Baodong Qin; Yiyuan Luo; Dong Zheng

Journal ArticleOPEN ACCESS

A Differential Privacy Budget Allocation Algorithm Based on Out-of-Bag Estimation in Random Forest

Mathematics (2022) 10(22)

DOI: 10.3390/math10224338

7Citations

11Readers

Abstract

The issue of how to improve the usability of data publishing under differential privacy has become one of the top questions in the field of machine learning privacy protection, and the key to solving this problem is to allocate a reasonable privacy protection budget. To solve this problem, we design a privacy budget allocation algorithm based on out-of-bag estimation in random forest. The algorithm firstly calculates the decision tree weights and feature weights by the out-of-bag data under differential privacy protection. Secondly, statistical methods are introduced to classify features into best feature set, pruned feature set, and removable feature set. Then, pruning is performed using the pruned feature set to avoid decision trees over-fitting when constructing an (Formula presented.) -differential privacy random forest. Finally, the privacy budget is allocated proportionally based on the decision tree weights and feature weights in the random forest. We conducted experimental comparisons with real data sets from Adult and Mushroom to demonstrate that this algorithm not only protects data security and privacy, but also improves model classification accuracy and data availability.

Author supplied keywords

Cite

CITATION STYLE

APA

Li, X., Qin, B., Luo, Y., & Zheng, D. (2022). A Differential Privacy Budget Allocation Algorithm Based on Out-of-Bag Estimation in Random Forest. Mathematics, 10(22). https://doi.org/10.3390/math10224338

A Differential Privacy Budget Allocation Algorithm Based on Out-of-Bag Estimation in Random Forest

Abstract

Author supplied keywords

Cite

Register to see more suggestions