Identifying Novel Features from Specimen Data for the Prediction of Valuable Collection Trips

3Citations
Citations of this article
14Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Primary biodiversity data provide “what, where, and when” data points: the assertion that a species occurred at a particular point in space and time. These are most valuable when associated with specimens stored in natural history museums and herbaria, which evidence the assertions with reference to a physical specimen. The research presented uses novel data-mining techniques to uncover two hidden dimensions in specimen data - who collected the specimens and how they were collected. A combination of unsupervised and supervised learning techniques are used, which establish two new entities: collector and collection trip. Features are defined against these higher order representations of the data, which support the use of the data to answer novel questions such as which collection trips discover the most new species? We explore the features by building classifiers to predict species discovery, and compare these with a baseline model grouped using collector team transcriptions derived from the raw specimen data. Preliminary results are promising and whilst the particular focus of this research was botanical specimens, the technique is equally applicable to datasets of field-collected specimens from other scientific domains.

Cite

CITATION STYLE

APA

Nicolson, N., & Tucker, A. (2017). Identifying Novel Features from Specimen Data for the Prediction of Valuable Collection Trips. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10584 LNCS, pp. 235–246). Springer Verlag. https://doi.org/10.1007/978-3-319-68765-0_20

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free