Identification High Influential Articles by Considering the Topic Characteristics of Articles

8Citations
Citations of this article
17Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

The topic of one article reflects its main semantic content, which is also the main guidance for researchers to choose reference literature. In order to explore whether the topic of an article will affect its citation trend in future, this paper establishes a machine learning framework to study the role of topic characteristics in the prediction of future high influential articles. Articles from four different disciplines are collected as experimental samples to verify whether the framework proposed in this paper can be applied to the prediction task in different disciplines. The Latent Dirichlet Allocation (LDA) is used to determine the topic characteristics of sample articles. LDA can map sample articles to current hot topics and generate the mapping probability of sample articles under different hot topics. The maximum mapping probability of the sample article under the hot topics is extracted as the topic feature of the article. Then the feature space for the prediction task is constructed by combining the topic feature and some bibliometrics indices of articles. Three feature selection algorithms, Fisher Score, Relief-F and Spectral Feature Selection (SPEC), are taken to select the important features in the feature space. The prediction performance of these features is finally tested by three classifiers, SVM, KNN and Bagging. The experimental results show that the topic characteristics of article, the early citation characteristics of article, and the reputation of the author are the key factors that determine whether an article can grow into a highly influential one. The important value of topic characteristics in articles' citation activities shows that the content of the article is an important factor in attracting more citations.

Cite

CITATION STYLE

APA

Wang, M., Jiao, S., Zhang, J., Zhang, X., & Zhu, N. (2020). Identification High Influential Articles by Considering the Topic Characteristics of Articles. IEEE Access, 8, 107887–107899. https://doi.org/10.1109/ACCESS.2020.3001190

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free