A coverage-based utility model for identifying unknown unknowns

Gagan Bansal; Daniel S. Weld

Conference ProceedingsOPEN ACCESS

A coverage-based utility model for identifying unknown unknowns

32nd AAAI Conference on Artificial Intelligence, AAAI 2018 (2018) 1463-1470

DOI: 10.1609/aaai.v32i1.11493

24Citations

53Readers

Abstract

A classifier's low confidence in prediction is often indicative of whether its prediction will be wrong; in this case, inputs are called known unknowns. In contrast, unknown unknowns (UUs) are inputs on which a classifier makes a high confidence mistake. Identifying UUs is especially important in safety-critical domains like medicine (diagnosis) and law (recidivism prediction). Previous work by Lakkaraju et al. (2017) on identifying unknown unknowns assumes that the utility of each revealed UU is independent of the others, rather than considering the set holistically. While this assumption yields an efficient discovery algorithm, we argue that it produces an incomplete understanding of the classifier's limitations. In response, this paper proposes a new class of utility models that rewards how well the discovered UUs cover (or “explain”) a sample distribution of expected queries. Although choosing an optimal cover is intractable, even if the UUs were known, our utility model is monotone submodular, affording a greedy discovery strategy. Experimental results on four datasets show that our method outperforms bandit-based approaches and achieves within 60.9% utility of an omniscient, tractable upper bound.

Cite

CITATION STYLE

APA

Bansal, G., & Weld, D. S. (2018). A coverage-based utility model for identifying unknown unknowns. In 32nd AAAI Conference on Artificial Intelligence, AAAI 2018 (pp. 1463–1470). AAAI press. https://doi.org/10.1609/aaai.v32i1.11493

A coverage-based utility model for identifying unknown unknowns

Abstract

Cite

Register to see more suggestions