Selection of the optimal number of topics for LDA topic model—Taking patent policy analysis as an example

60Citations
Citations of this article
82Readers
Mendeley users who have this article in their library.

Abstract

This study constructs a comprehensive index to effectively judge the optimal number of topics in the LDA topic model. Based on the requirements for selecting the number of topics, a comprehensive judgment index of perplexity, isolation, stability, and coincidence is constructed to select the number of topics. This method provides four advantages to selecting the optimal number of topics: (1) good predictive ability, (2) high isolation between topics, (3) no duplicate topics, and (4) repeatability. First, we use three general datasets to compare our proposed method with existing methods, and the results show that the optimal topic number selection method has better selection results. Then, we collected the patent policies of various provinces and cities in China (excluding Hong Kong, Macao, and Taiwan) as datasets. By using the optimal topic number selection method proposed in this study, we can classify patent policies well.

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Gan, J., & Qi, Y. (2021). Selection of the optimal number of topics for LDA topic model—Taking patent policy analysis as an example. Entropy, 23(10). https://doi.org/10.3390/e23101301

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 16

50%

Professor / Associate Prof. 7

22%

Researcher 5

16%

Lecturer / Post doc 4

13%

Readers' Discipline

Tooltip

Computer Science 17

55%

Business, Management and Accounting 7

23%

Engineering 5

16%

Social Sciences 2

6%

Save time finding and organizing research with Mendeley

Sign up for free