In this article, we study the scale-dependent dimensionality properties and overall structure of text data with a method that measures correlation dimension in different scales. As experimental results, we present the analysis of text data sets with the Reuters and Europarl corpora, which are also compared to artificially generated point sets. A comparison is also made with speech data. The results reflect some of the typical properties of the data and the use of our method in improving various data analysis applications is discussed. © 2010 Springer-Verlag Berlin Heidelberg.
CITATION STYLE
Kivimäki, I., Lagus, K., Nieminen, I. T., Väyrynen, J. J., & Honkela, T. (2010). Using correlation dimension for analysing text data. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6352 LNCS, pp. 368–373). https://doi.org/10.1007/978-3-642-15819-3_49
Mendeley helps you to discover research relevant for your work.