A study on text modelling via Dirichlet compound multinomial

Concetto Elvio Bonafede; Paola Cerchiello

Conference Proceedings

A study on text modelling via Dirichlet compound multinomial

Studies in Classification, Data Analysis, and Knowledge Organization (2011) 115-123

DOI: 10.1007/978-3-642-13312-1_11

1Citations

4Readers

Get full text

Abstract

This contributions deals with a generative approach for the analysis of textual data. Instead of creating heuristic rules for the representation of documents and word counts, we employ a distribution able to model words along text considering different topics. In this regard, following Minka proposal [5], we implement a Dirichlet compound Multinomial distribution that is a mixture of random variables over words and topics. On the basis of this model we evaluate the predictive performance of the distribution by using seven different classifiers and taking into account the count of words in common between text document and reference class. © Springer-Verlag Berlin Heidelberg 2011.

Cite

CITATION STYLE

APA

Bonafede, C. E., & Cerchiello, P. (2011). A study on text modelling via Dirichlet compound multinomial. In Studies in Classification, Data Analysis, and Knowledge Organization (pp. 115–123). Kluwer Academic Publishers. https://doi.org/10.1007/978-3-642-13312-1_11

A study on text modelling via Dirichlet compound multinomial

Abstract

Cite

Register to see more suggestions