Short Text Classification Based on Hierarchical Heterogeneous Graph and LDA Fusion

5Citations
Citations of this article
8Readers
Mendeley users who have this article in their library.

Abstract

The proliferation of short texts resulting from the rapid advancements of social networks, online communication, and e-commerce has created a pressing need for short text classification in various applications. This paper presents a novel approach for short text classification, which combines a hierarchical heterogeneous graph with latent Dirichlet allocation (LDA) fusion. Our method first models the short text dataset as a hierarchical heterogeneous graph, which incorporates more syntactic and semantic information through a word graph, parts-of-speech (POS) tag graph, and entity graph. We then connected the representation of these three feature maps to derive a comprehensive feature vector for the text. Finally, we used the LDA topic model to adjust the feature weight, enhancing the effectiveness of short text extension. Our experiments demonstrated that our proposed approach has a promising performance in English short text classification, while in Chinese short text classification, although slightly inferior to the LDA + TF-IDF method, it still achieved promising results.

References Powered by Scopus

A re-examination of text categorization methods

2146Citations
N/AReaders
Get full text

Graph convolutional networks for text classification

1699Citations
N/AReaders
Get full text

A survey of text classification algorithms

1369Citations
N/AReaders
Get full text

Cited by Powered by Scopus

EHR-HGCN: An Enhanced Hybrid Approach for Text Classification Using Heterogeneous Graph Convolutional Networks in Electronic Health Records

4Citations
N/AReaders
Get full text

Data Sorting Influence on Short Text Manual Labeling Quality for Hierarchical Classification

3Citations
N/AReaders
Get full text

CoGraphNet for enhanced text classification using word-sentence heterogeneous graph representations and improved interpretability

0Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Xu, X., Li, B., Shen, Y., Luo, B., Zhang, C., & Hao, F. (2023). Short Text Classification Based on Hierarchical Heterogeneous Graph and LDA Fusion. Electronics (Switzerland), 12(12). https://doi.org/10.3390/electronics12122560

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 4

80%

Lecturer / Post doc 1

20%

Readers' Discipline

Tooltip

Computer Science 3

60%

Engineering 2

40%

Article Metrics

Tooltip
Mentions
Blog Mentions: 1
News Mentions: 1

Save time finding and organizing research with Mendeley

Sign up for free