Machine reading tea leaves: Automatically evaluating topic coherence and topic model quality

488Citations
Citations of this article
437Readers
Mendeley users who have this article in their library.

Abstract

Topic models based on latent Dirichlet allocation and related methods are used in a range of user-focused tasks including document navigation and trend analysis, but evaluation of the intrinsic quality of the topic model and topics remains an open research area. In this work, we explore the two tasks of automatic evaluation of single topics and automatic evaluation of whole topic models, and provide recommendations on the best strategy for performing the two tasks, in addition to providing an open-source toolkit for topic and topic model evaluation. © 2014 Association for Computational Linguistics.

References Powered by Scopus

Probabilistic latent semantic indexing

4296Citations
N/AReaders
Get full text

Training linear SVMs in linear time

1551Citations
N/AReaders
Get full text

Exploring content models for multi-document summarization

429Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Exploring the space of topic coherence measures

1509Citations
N/AReaders
Get full text

Topic modeling in embedding spaces

503Citations
N/AReaders
Get full text

An analysis of the coherence of descriptors in topic modeling

266Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Lau, J. H., Newman, D., & Baldwin, T. (2014). Machine reading tea leaves: Automatically evaluating topic coherence and topic model quality. In 14th Conference of the European Chapter of the Association for Computational Linguistics 2014, EACL 2014 (pp. 530–539). Association for Computational Linguistics (ACL). https://doi.org/10.3115/v1/e14-1056

Readers over time

‘14‘15‘16‘17‘18‘19‘20‘21‘22‘23‘24‘25015304560

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 189

72%

Researcher 49

19%

Lecturer / Post doc 18

7%

Professor / Associate Prof. 8

3%

Readers' Discipline

Tooltip

Computer Science 204

78%

Business, Management and Accounting 22

8%

Engineering 19

7%

Social Sciences 18

7%

Article Metrics

Tooltip
Mentions
News Mentions: 1

Save time finding and organizing research with Mendeley

Sign up for free
0