Rethinking Logic Minimization for Tabular Machine Learning

Litao Qiao; Weijia Wang; Sanjoy Dasgupta; Bill Lin

Journal ArticleOPEN ACCESS

Rethinking Logic Minimization for Tabular Machine Learning

IEEE Transactions on Artificial Intelligence (2023) 4(5) 1129-1140

DOI: 10.1109/TAI.2022.3224415

1Citations

13Readers

Abstract

Tabular datasets can be viewed as logic functions that can be simplified using two-level logic minimization to produce minimal logic formulas in disjunctive normal form, which in turn can be readily viewed as an explainable decision rule set for binary classification. However, there are two problems with using logic minimization for tabular machine learning. First, tabular datasets often contain overlapping examples that have different class labels, which have to be resolved before logic minimization can be applied since logic minimization assumes consistent logic functions. Second, even without inconsistencies, logic minimization alone generally produces complex models with poor generalization because it exactly fits all data points, which leads to detrimental overfitting. How best to remove training instances to eliminate inconsistencies and overfitting is highly nontrivial. In this article, we propose a novel statistical framework for removing these training samples so that logic minimization can become an effective approach to tabular machine learning. Using the proposed approach, we are able to obtain comparable performance as gradient boosted and ensemble decision trees, which have been the winning hypothesis classes in tabular learning competitions, but with human-understandable explanations in the form of decision rules. To the best of authors' knowledge, neither logic minimization nor explainable decision rule methods have been able to achieve the state-of-the-art performance before in tabular learning problems.

Author supplied keywords

Cite

CITATION STYLE

APA

Qiao, L., Wang, W., Dasgupta, S., & Lin, B. (2023). Rethinking Logic Minimization for Tabular Machine Learning. IEEE Transactions on Artificial Intelligence, 4(5), 1129–1140. https://doi.org/10.1109/TAI.2022.3224415

Rethinking Logic Minimization for Tabular Machine Learning

Abstract

Author supplied keywords

Cite

Register to see more suggestions