Abstract
Topics in 0-1 datasets are sets of variables whose occurrences are positively connected together. Earlier, we described a simple generative topic model. In this paper we show that, given data produced by this model, the lift statistics of attributes can be described in matrix form. We use this result to obtain a simple algorithm for finding topics in 0-1 data. We also show that a problem related to the identification of topics is NP-hard. We give experimental results on the topic identification problem, both on generated and real data.
Cite
CITATION STYLE
Seppänen, J. K., Bingham, E., & Mannila, H. (2003). A simple algorithm for topic identification in 0-1 data. In Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science) (Vol. 2838, pp. 423–434). Springer Verlag. https://doi.org/10.1007/978-3-540-39804-2_38
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.