Constructing feature set by using temporal clustering of term usages in document categorization

0Citations
Citations of this article
1Readers
Mendeley users who have this article in their library.
Get full text

Abstract

For discovering some chances in documents with temporal context, it is important to handle their contents represented as words and phrases, called keywords. However, in conventional methods, keywords are selected based on their frequency and/or a particular importance index such as tf-idf throughout their observed period. In this chapter, we describe a method for characterizing large number of documents, considering the temporal features of appeared terms, by obtaining document clusters based on the similarities between the document that are characterized by the temporal patterns of an importance index for considering temporal differences in term usages. As an experiment, we performed document clustering for four sets of bibliographical documents using two feature sets: popular feature terms appearances and the appearances of temporal patterns for each document. Then, we compared the time dependencies of the two document clustering results. Our feature construction method succeeded in representing the time differences in the documents using features based on temporal patterns.

Cite

CITATION STYLE

APA

Abe, H., & Tsumoto, S. (2013). Constructing feature set by using temporal clustering of term usages in document categorization. In Studies in Computational Intelligence (Vol. 423, pp. 215–229). Springer Verlag. https://doi.org/10.1007/978-3-642-30114-8_14

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free