Incidental or influential? - Challenges in automatically detecting citation importance using publication full texts

David Pride; Petr Knoth

Conference Proceedings

Incidental or influential? - Challenges in automatically detecting citation importance using publication full texts

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2017) 10450 LNCS 572-578

DOI: 10.1007/978-3-319-67008-9_48

16Citations

9Readers

Get full text

Abstract

This work looks in depth at several studies that have attempted to automate the process of citation importance classification based on the publications’ full text. We analyse a range of features that have been previously used in this task. Our experimental results confirm that the number of in-text references are highly predictive of influence. Contrary to the work of Valenzuela et al. (2015) [1], we find abstract similarity one of the most predictive features. Overall, we show that many of the features previously described in literature are not particularly predictive. Consequently, we discuss challenges and potential improvements in the classification pipeline, provide a critical review of the performance of individual features and address the importance of constructing a large scale gold-standard reference dataset.

Cite

CITATION STYLE

APA

Pride, D., & Knoth, P. (2017). Incidental or influential? - Challenges in automatically detecting citation importance using publication full texts. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10450 LNCS, pp. 572–578). Springer Verlag. https://doi.org/10.1007/978-3-319-67008-9_48

Incidental or influential? - Challenges in automatically detecting citation importance using publication full texts

Abstract

Cite

Register to see more suggestions