Comparing the Interpretability of Deep Networks via Network Dissection

Bolei Zhou; David Bau; Aude Oliva; Antonio Torralba

Book Chapter

Comparing the Interpretability of Deep Networks via Network Dissection

Springer Verlag, (2019), 243-252

DOI: 10.1007/978-3-030-28954-6_12

16Citations

17Readers

Get full text

Abstract

In this chapter, we introduce Network Dissection (The complete paper and code are available at http://netdissect.csail.mit.edu ), a general framework to quantify the interpretability of the units inside a deep convolutional neural networks (CNNs). We compare the different vocabularies of interpretable units as concept detectors emerged from the networks trained to solve different supervised learning tasks such as object recognition on ImageNet and scene classification on Places. The network dissection is further applied to analyze how the units acting as semantic detectors grow and evolve over the training iterations both in the scenario of the train-from-scratch and in the stage of the fine-tuning between data sources. Our results highlight that interpretability is an important property of deep neural networks that provides new insights into their hierarchical structure.

Author supplied keywords

Cite

CITATION STYLE

APA

Zhou, B., Bau, D., Oliva, A., & Torralba, A. (2019). Comparing the Interpretability of Deep Networks via Network Dissection. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11700 LNCS, pp. 243–252). Springer Verlag. https://doi.org/10.1007/978-3-030-28954-6_12

Comparing the Interpretability of Deep Networks via Network Dissection

Abstract

Author supplied keywords

Cite

Register to see more suggestions