A Topic Recognition Method of News Text Based on Word Embedding Enhancement

14Citations
Citations of this article
10Readers
Mendeley users who have this article in their library.

Abstract

Topic recognition technology has been commonly applied to identify different categories of news topics from the vast amount of web information, which has a wide application prospect in the field of online public opinion monitoring, news recommendation, and so on. However, it is very challenging to effectively utilize key feature information such as syntax and semantics in the text to improve topic recognition accuracy. Some researchers proposed to combine the topic model with the word embedding model, whose results had shown that this approach could enrich text representation and benefit natural language processing downstream tasks. However, for the topic recognition problem of news texts, there is currently no standard way of combining topic model and word embedding model. Besides, some existing similar approaches were more complex and did not consider the fusion between topic distribution of different granularity and word embedding information. Therefore, this paper proposes a novel text representation method based on word embedding enhancement and further forms a full-process topic recognition framework for news text. In contrast to traditional topic recognition methods, this framework is designed to use the probabilistic topic model LDA, the word embedding models Word2vec and Glove to fully extract and integrate the topic distribution, semantic knowledge, and syntactic relationship of the text, and then use popular classifiers to automatically recognize the topic categories of news based on the obtained text representation vectors. As a result, the proposed framework can take advantage of the relationship between document and topic and the context information, which improves the expressive ability and reduces the dimensionality. Based on the two benchmark datasets of 20NewsGroup and BBC News, the experimental results verify the effectiveness and superiority of the proposed method based on word embedding enhancement for the news topic recognition problem.

References Powered by Scopus

GloVe: Global vectors for word representation

26891Citations
N/AReaders
Get full text

Indexing by latent semantic analysis

9514Citations
N/AReaders
Get full text

Probabilistic topic models

3958Citations
N/AReaders
Get full text

Cited by Powered by Scopus

A survey on cross-media search based on user intention understanding in social networks

23Citations
N/AReaders
Get full text

Ensemble Deep Learning Framework for Situational Aspects-Based Annotation and Classification of International Student’s Tweets during COVID-19

5Citations
N/AReaders
Get full text

Strengthening Sentence Similarity Identification Through OpenAI Embeddings and Deep Learning

3Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Du, Q., Li, N., Liu, W., Sun, D., Yang, S., & Yue, F. (2022). A Topic Recognition Method of News Text Based on Word Embedding Enhancement. Computational Intelligence and Neuroscience, 2022. https://doi.org/10.1155/2022/4582480

Readers over time

‘22‘23‘2401234

Readers' Seniority

Tooltip

Professor / Associate Prof. 1

50%

Lecturer / Post doc 1

50%

Readers' Discipline

Tooltip

Computer Science 2

67%

Social Sciences 1

33%

Save time finding and organizing research with Mendeley

Sign up for free
0