ConceptX: A Framework for Latent Concept Analysis

3Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.

Abstract

The opacity of deep neural networks remains a challenge in deploying solutions where explanation is as important as precision. We present ConceptX, a human-in-the-loop framework for interpreting and annotating latent representational space in pre-trained Language Models (pLMs). We use an unsupervised method to discover concepts learned in these models and enable a graphical interface for humans to generate explanations for the concepts. To facilitate the process, we provide auto-annotations of the concepts (based on traditional linguistic ontologies). Such annotations enable development of a linguistic resource that directly represents latent concepts learned within deep NLP models. These include not just traditional linguistic concepts, but also task-specific or sensitive concepts (words grouped based on gender or religious connotation) that helps the annotators to mark bias in the model. The framework consists of two parts (i) concept discovery and (ii) annotation platform.

References Powered by Scopus

Agglomerative clustering using the concept of mutual nearest neighbourhood

336Citations
N/AReaders
Get full text

HateXplain: A Benchmark Dataset for Explainable Hate Speech Detection

326Citations
N/AReaders
Get full text

What do neural machine translation models learn about morphology?

233Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Alam, F., Dalvi, F., Durrani, N., Sajjad, H., Khan, A. R., & Xu, J. (2023). ConceptX: A Framework for Latent Concept Analysis. In Proceedings of the 37th AAAI Conference on Artificial Intelligence, AAAI 2023 (Vol. 37, pp. 16395–16397). AAAI Press. https://doi.org/10.1609/aaai.v37i13.27057

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 3

60%

Researcher 2

40%

Readers' Discipline

Tooltip

Computer Science 4

80%

Physics and Astronomy 1

20%

Save time finding and organizing research with Mendeley

Sign up for free