Orthogonal Matching Pursuit for Text Classification

2Citations
Citations of this article
79Readers
Mendeley users who have this article in their library.

Abstract

In text classification, the problem of overfitting arises due to the high dimensionality, making regularization essential. Although classic regularizers provide sparsity, they fail to return highly accurate models. On the contrary, state-of-the-art group-lasso regularizers provide better results at the expense of low sparsity. In this paper, we apply a greedy variable selection algorithm, called Orthogonal Matching Pursuit, for the text classification task. We also extend standard group OMP by introducing overlapping Group OMP to handle overlapping groups of features. Empirical analysis verifies that both OMP and overlapping GOMP constitute powerful regularizers, able to produce effective and very sparse models. Code and data are available online1

Cite

CITATION STYLE

APA

Skianis, K., Tziortziotis, N., & Vazirgiannis, M. (2018). Orthogonal Matching Pursuit for Text Classification. In 4th Workshop on Noisy User-Generated Text, W-NUT 2018 - Proceedings of the Workshop (pp. 93–103). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w18-6113

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free