A latent variable model for discovering bird species commonly misidentified by citizen scientists

6Citations
Citations of this article
35Readers
Mendeley users who have this article in their library.

Abstract

Data quality is a common source of concern for large-scale citizen science projects like cBird. In the case of eBird, a major cause of poor quality data is the misidentification of bird species by inexperienced contributors. A proactive approach for improving data quality is to discover commonly misidentified bird species and to teach inexperienced birders the differences between these species. To accomplish this goal, we develop a latent variable graphical model that can identify groups of bird species that are often confused for each other by eBird participants. Our model is a multi-species extension of the classic occupancy-detection model in the ecology literature. This multi-species extension requires a structure learning step as well as a computationally expensive parameter learning stage which we make efficient through a variational approximation. We show that our model can not only discover groups of misidentified species, but by including these misidentifications in the model, it can also achieve more accurate predictions of both species occupancy and detection.

Cite

CITATION STYLE

APA

Yu, J., Hutchinson, R. A., & Wong, W. K. (2014). A latent variable model for discovering bird species commonly misidentified by citizen scientists. In Proceedings of the National Conference on Artificial Intelligence (Vol. 1, pp. 500–506). AI Access Foundation. https://doi.org/10.1609/aaai.v28i1.8763

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free