In this paper we present a proposal including collocations into the preprocessing of the text mining, which we use for the fast news article recommendation and experiments based on real data from the biggest Slovak newspaper. The news article section can be predicted based on several article’s characteristics as article name, content, keywords etc. We provided experiments aimed at comparison of several approaches and algorithms including expressive vector representation, with considering most popular words collocations obtained from Slovak National Corpus.
CITATION STYLE
Kompan, M., & Bieliková, M. (2011). News article classification based on a vector representation including words’ collocations. In Advances in Intelligent and Soft Computing (Vol. 101, pp. 1–8). Springer Verlag. https://doi.org/10.1007/978-3-642-23163-6_1
Mendeley helps you to discover research relevant for your work.