Vector-quantization-based topic modeling

7Citations
Citations of this article
17Readers
Mendeley users who have this article in their library.

Abstract

With the purpose of learning and utilizing explicit and dense topic embeddings, we propose three variations of novel vector-quantization-based topic models (VQ-TMs): (1) Hard VQ-TM, (2) Soft VQ-TM, and (3) Multi-View Soft VQ-TM. The model family capitalize on vector quantization techniques, embedded input documents, and viewing words as mixtures of topics. Guided by a comprehensive set of evaluation metrics, we conduct systematic quantitative and qualitative empirical studies, and demonstrate the superior performance of VQ-TMs compared to important baseline models. Through a unique case study on code generation from natural language descriptions, we further illustrate the power of VQ-TMs in downstream tasks.

Cite

CITATION STYLE

APA

Gupta, A., & Zhang, Z. (2021). Vector-quantization-based topic modeling. ACM Transactions on Intelligent Systems and Technology, 12(3). https://doi.org/10.1145/3450946

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free