Modelling Citation Networks for Improving Scientific Paper Classification Performance

Mengjie Zhang; Xiaoying Gao; Minh Duc Cao; Yuejin Ma

Book Chapter

Modelling Citation Networks for Improving Scientific Paper Classification Performance

Zhang M
Gao X
Cao M
et al.

DOI: 10.1007/978-3-540-36668-3_45

N/ACitations

5Readers

Get full text

Abstract

This paper describes an approach to the use of citation links to improve the scientific paper classification performance. In this approach, we develop two refinement functions, a linear label refinement (LLR) and a probabilistic label refinement (PLR), to model the citation link structures of the scientific papers for refining the class labels of the documents obtained by the content-based Naive Bayes classification method. The approach with the two new refinement models is examined and compared with the content-based Naive Bayes method on a standard paper classification data set with increasing training set sizes. The results suggest that both refinement models can significantly improve the system performance over the content-based method for all the training set sizes and that PLR is better than LLR when the training examples are sufficient.

Cite

CITATION STYLE

APA

Zhang, M., Gao, X., Cao, M. D., & Ma, Y. (2006). Modelling Citation Networks for Improving Scientific Paper Classification Performance (pp. 413–422). https://doi.org/10.1007/978-3-540-36668-3_45

Modelling Citation Networks for Improving Scientific Paper Classification Performance

Abstract

Cite

Register to see more suggestions