Evaluating semantometrics from computer science publications

8Citations
Citations of this article
18Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Identification of important works and assessment of importance of publications in vast scientific corpora are challenging yet common tasks subjected by many research projects. While the influence of citations in finding seminal papers has been analysed thoroughly, citation-based approaches come with several problems. Their impracticality when confronted with new publications which did not yet receive any citations, area-dependent citation practices and different reasons for citing are only a few drawbacks of them. Methods relying on more than citations, for example semantic features such as words or topics contained in publications of citation networks, are regarded with less vigour while providing promising preliminary results. In this work we tackle the issue of classifying publications with their respective referenced and citing papers as either seminal, survey or uninfluential by utilising semantometrics. We use distance measures over words, semantics, topics and publication years of papers in their citation network to engineer features on which we predict the class of a publication. We present the SUSdblp dataset consisting of 1980 labelled entries to provide a means of evaluating this approach. A classification accuracy of up to.9247 was achieved when combining multiple types of features using semantometrics. This is +.1232 compared to the current state of the art (SOTA) which uses binary classification to identify papers from classes seminal and survey. The utilisation of one-vector representations for the ternary classification task resulted in an accuracy of.949 which is +.1475 compared to the binary SOTA. Classification based on information available at publication time derived with semantometrics resulted in an accuracy of.8152 while an accuracy of.9323 could be achieved when using one-vector representations.

References Powered by Scopus

GloVe: Global vectors for word representation

26881Citations
N/AReaders
Get full text

An index to quantify an individual's scientific research output

8588Citations
N/AReaders
Get full text

Radiomics: Images are more than pictures, they are data

6127Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Value creation in emerging technologies through text mining: the case of blockchain

12Citations
N/AReaders
Get full text

Identifying Promising Technologies of Electric Vehicles from the Perspective of Market and Technical Attributes

9Citations
N/AReaders
Get full text

RevASIDE: Assignment of Suitable Reviewer Sets for Publications from Fixed Candidate Pools

4Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Kreutz, C. K., Sahitaj, P., & Schenkel, R. (2020). Evaluating semantometrics from computer science publications. Scientometrics, 125(3), 2915–2954. https://doi.org/10.1007/s11192-020-03409-5

Readers over time

‘20‘21‘23‘24‘25036912

Readers' Seniority

Tooltip

Professor / Associate Prof. 3

33%

Researcher 3

33%

PhD / Post grad / Masters / Doc 2

22%

Lecturer / Post doc 1

11%

Readers' Discipline

Tooltip

Computer Science 8

57%

Social Sciences 3

21%

Engineering 2

14%

Medicine and Dentistry 1

7%

Save time finding and organizing research with Mendeley

Sign up for free
0