The paper describes the results of experiments on the development of a statistical model of the Russian text corpus on musicology. We construct a topic model based on Latent Dirichlet Allocation and process corpus data with the help of the GenSim statistical toolkit. Results achieved in course of experiments allow us to distinguish general and special topics which describe conceptual structure of the corpus in question and to analyze paradigmatic and syntagmatic relations between lemmata within topics.
CITATION STYLE
Mitrofanova, O. (2015). Probabilistic topic modeling of the russian text corpus on musicology. In Communications in Computer and Information Science (Vol. 561, pp. 69–76). Springer Verlag. https://doi.org/10.1007/978-3-319-27498-0_6
Mendeley helps you to discover research relevant for your work.