Multivariate stream data classification using simple text classifiers

Sungbo Seo; Jaewoo Kang; Dongwon Lee; Keun Ho Ryu

Conference Proceedings

Multivariate stream data classification using simple text classifiers

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2006) 4080 LNCS 420-429

DOI: 10.1007/11827405_41

1Citations

13Readers

Get full text

Abstract

We introduce a classification framework for continuous multivariate stream data. The proposed approach works in two steps. In the preprocessing step, it takes as input a sliding window of multivariate stream data and discretizes the data in the window into a string of symbols that characterize the signal changes. In the classification step, it uses a simple text classification algorithm to classify the discretized data in the window. We evaluated both supervised and unsupervised classification algorithms. For supervised, we tested Naïve Bayes Model and SVM, and for unsupervised, we tested Jaccard, TFIDF, Jaro and Jaro Winkler. In our experiments, SVM and TFIDF outperformed the other classification methods. In particular, we observed that classification accuracy is improved when the correlation of attributes is also considered along with the n-gram tokens of symbols. © Springer-Verlag Berlin Heidelberg 2006.

Cite

CITATION STYLE

APA

Seo, S., Kang, J., Lee, D., & Ryu, K. H. (2006). Multivariate stream data classification using simple text classifiers. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4080 LNCS, pp. 420–429). Springer Verlag. https://doi.org/10.1007/11827405_41

Multivariate stream data classification using simple text classifiers

Abstract

Cite

Register to see more suggestions