Abstract
Deep learning models such as CNNs have surpassed human performance in computer vision tasks such as image classification. However, despite their sophistication, these models lack interpretability which can lead to biased outcomes reflecting existing prejudices in the data. We aim to make predictions made by a CNN interpretable. Hence, we present a novel framework called NeSyFOLD to create a neurosymbolic (NeSy) model for image classification tasks. The model is a CNN with all layers following the last convolutional layer replaced by a stratified answer set program (ASP) derived from the last layer kernels. The answer set program can be viewed as a rule-set, wherein the truth value of each predicate depends on the activation of the corresponding kernel in the CNN. The rule-set serves as a global explanation for the model and is interpretable. We also use our NeSyFOLD framework with a CNN that is trained using a sparse kernel learning technique called Elite BackProp (EBP). This leads to a significant reduction in rule-set size without compromising accuracy or fidelity thus improving scalability of the NeSy model and interpretability of its rule-set. Evaluation is done on datasets with varied complexity and sizes. We also propose a novel algorithm for labeling the predicates in the ruleset with meaningful semantic concept (s) learnt by the CNN. We evaluate the performance of our "semantic labeling algorithm"to quantify the efficacy of the semantic labeling for both the NeSy model and the NeSy-EBP model.
Cite
CITATION STYLE
Padalkar, P., Wang, H., & Gupta, G. (2024). NeSyFOLD: A Framework for Interpretable Image Classification. In Proceedings of the AAAI Conference on Artificial Intelligence (Vol. 38, pp. 4378–4387). Association for the Advancement of Artificial Intelligence. https://doi.org/10.1609/aaai.v38i5.28235
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.