A Short Text Classification Algorithm Based on Semantic Extension

8Citations
Citations of this article
7Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

A semantic-extension-based algorithm for short texts is proposed, by involving the Word2vec and the LDA model, to improve the performance of classification, which is frequently deteriorated by semantic dependencies and scarcity of features. For every keyword within a short text, weighted synonyms and related words can be generated by the Word2Vec and LDA model, respectively, and subsequently be inserted to extend the short text to a reasonable length. We not only have established a criterion by means of similarity estimation to determine whether a sentence should be extended, we designed a scheme to choose the number of extended words. The extended text will be classified. Experimental results show that, the classification performance of the proposed algorithm, in terms of the precision rate, is approximately 5% higher than that of the TF-IDF model and approximately 10% higher than that of the VSM method.

References Powered by Scopus

A Vector Space Model for Automatic Indexing

5610Citations
N/AReaders
Get full text

Probabilistic latent semantic indexing

4284Citations
N/AReaders
Get full text

Latent Semantic Analysis

817Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Improving Arabic Cognitive Distortion Classification in Twitter using BERTopic

36Citations
N/AReaders
Get full text

Deep learning model with multi-feature fusion and label association for suicide detection

7Citations
N/AReaders
Get full text

A Novel Ensemble Learning Framework Based on News Sentiment Enhancement and Multi-objective Optimizer for Carbon Price Forecasting

2Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Yajian, Z., Dingpeng, D., & Junhui, C. (2021). A Short Text Classification Algorithm Based on Semantic Extension. Chinese Journal of Electronics, 30(1), 153–159. https://doi.org/10.1049/cje.2020.11.014

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 4

100%

Readers' Discipline

Tooltip

Computer Science 4

80%

Social Sciences 1

20%

Save time finding and organizing research with Mendeley

Sign up for free