Exact and Approximate Rule Extraction from Neural Networks with Boolean Features

1Citations
Citations of this article
17Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Rule extraction from classifiers treated as black boxes is an important topic in explainable artificial intelligence (XAI). It is concerned with finding rules that describe classifiers and that are understandable to humans, having the form of (I f…T hen…Else). Neural network classifiers are one type of classifier where it is difficult to know how the inputs map to the decision. This paper presents a technique to extract rules from a neural network where the feature space is Boolean, without looking at the inner structure of the network. For such a network with a small feature space, a Boolean function describing it can be directly calculated, whilst for a network with a larger feature space, a sampling method is described to produce rule-based approximations to the behaviour of the network with varying granularity, leading to XAI. The technique is experimentally assessed on a dataset of cross-site scripting (XSS) attacks, and proves to give very high accuracy and precision, comparable to that given by the neural network being approximated.

Cite

CITATION STYLE

APA

Mereani, F. A., & Howe, J. M. (2019). Exact and Approximate Rule Extraction from Neural Networks with Boolean Features. In International Joint Conference on Computational Intelligence (Vol. 1, pp. 424–433). Science and Technology Publications, Lda. https://doi.org/10.5220/0008362904240433

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free