The chapter introduces a representation of a textual event as a mixture of semantic stereotypes and factual information. We also present a method to distinguish semantic prototypes that are specific for a given event from generic elements that might provide cause and result information. Moreover, this chapter discusses the results of experiments of unsupervised topic extraction performed on documents from a large-scale corpus with an additional temporal structure. These experiments were realized as a comparison of the nature of information provided by Latent Dirichlet Allocation based on Log-Entropy weights and Vector Space modelling. The impact of different corpus time windows on this information is discussed. Finally, we try to answer if the unsupervised topic modelling may reflect deeper semantic information, such as elements describing given event or its causes and results, and discern it from factual data.
CITATION STYLE
Korzycki, M., & Korczyński, W. (2015). Does topic modelling reflect semantic prototypes? Advances in Intelligent Systems and Computing, 314, 113–122. https://doi.org/10.1007/978-3-319-10383-9_11
Mendeley helps you to discover research relevant for your work.