Improved Keyword Extraction by Separation into Multiple Document Sets According to Time Series

Ryosuke Saga; Hiroshi Tsuji

Conference ProceedingsOPEN ACCESS

Improved Keyword Extraction by Separation into Multiple Document Sets According to Time Series

Communications in Computer and Information Science (2013) 374(PART II) 450-453

DOI: 10.1007/978-3-642-39476-8_91

4Citations

1Readers

Abstract

This study proposes a method of extracting keywords including those that appear locally. Useful keyword extraction methods are available for text mining, such as TF-IDF and support vector machine. However, when keywords are extracted on the basis of time series, the local keywords are not often extracted. We propose a method of extracting the local keywords by separating a document set, which we call the document separation approach. The approach splits a document set into multiple sets according to time series, extracts the keywords for each document set, and integrates them. Using 1812 newspaper articles, we experimentally demonstrate that we can extract the local feature keywords using the document separation approach. © Springer-Verlag Berlin Heidelberg 2013.

Author supplied keywords

Cite

CITATION STYLE

APA

Saga, R., & Tsuji, H. (2013). Improved Keyword Extraction by Separation into Multiple Document Sets According to Time Series. In Communications in Computer and Information Science (Vol. 374, pp. 450–453). Springer Verlag. https://doi.org/10.1007/978-3-642-39476-8_91

Improved Keyword Extraction by Separation into Multiple Document Sets According to Time Series

Abstract

Author supplied keywords

Cite

Register to see more suggestions