R ultimate multilabel dataset repository

Francisco Charte; David Charte; Antonio Rivera; María José del Jesus; Francisco Herrera

Conference Proceedings

R ultimate multilabel dataset repository

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2016) 9648 487-499

DOI: 10.1007/978-3-319-32034-2_41

19Citations

18Readers

Get full text

Abstract

Multilabeled data is everywhere on the Internet. From news on digital media and entries published in blogs, to videos hosted in Youtube, every object is usually tagged with a set of labels. This way they can be categorized into several non-exclusive groups. However, publicly available multilabel datasets (MLDs) are not so common. There is a handful of websites providing a few of them, using disparate file formats. Finding proper MLDs, converting them into the correct format and locating the appropriate bibliographic data to cite them are some of the difficulties usually confronted by researchers and practitioners. In this paper RUMDR (R Ultimate Multilabel Dataset Repository), a new multilabel dataset repository aimed to fuse all public MLDs, is introduced, along with mldr.datasets, an R package which eases the process of retrieving MLDs and their bibliographic information, exporting them to the desired file formats and partitioning them.

Author supplied keywords

Cite

CITATION STYLE

APA

Charte, F., Charte, D., Rivera, A., del Jesus, M. J., & Herrera, F. (2016). R ultimate multilabel dataset repository. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9648, pp. 487–499). Springer Verlag. https://doi.org/10.1007/978-3-319-32034-2_41

R ultimate multilabel dataset repository

Abstract

Author supplied keywords

Cite

Register to see more suggestions