SmartPub: A Platform for Long-Tail Entity Extraction from Scientific Publications

5Citations
Citations of this article
15Readers
Mendeley users who have this article in their library.
Get full text

Abstract

This demo presents SmartPub, a novel web-based platform that supports the exploration and visualization of shallow meta-data (e.g., author list, keywords) and deep meta-data - long tail named entities which are rare, and often relevant only in specific knowledge domain - from scientific publications. The platform collects documents from different sources (e.g. DBLP and Arxiv), and extracts the domain-specific named entities from the text of the publications using Named Entity Recognizers (NERs) which we can train with minimal human supervision even for rare entity types. The platform further enables the interaction with the Crowd for filtering purposes or training data generation, and provides extended visualization and exploration capabilities. SmartPub will be demonstrated using sample collection of scientific publications focusing on the computer science domain and will address the entity types Dataset (i.e. dataset presented or used in a publication), and Methods (i.e. algorithms used to create/enrich/analyse a data set).

Cite

CITATION STYLE

APA

Mesbah, S., Bozzon, A., Lofi, C., & Houben, G. J. (2018). SmartPub: A Platform for Long-Tail Entity Extraction from Scientific Publications. In The Web Conference 2018 - Companion of the World Wide Web Conference, WWW 2018 (pp. 191–194). Association for Computing Machinery, Inc. https://doi.org/10.1145/3184558.3186976

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free