Improving the Precise Identification of Animal Species through the use of Vision Transformers

0Citations
Citations of this article
3Readers
Mendeley users who have this article in their library.
Get full text

Abstract

When it comes to the precise recognition of animal species, fine-grained classification proves to be particularly difficult since there are often minute differences between species as in substantial variation in the same species. Convolutional Neural Networks (CNNs), which are predominantly used in image classification, have an edge over these networks because image details are compressed into high dimensional embeddings and thus fine-grained classification becomes a problem. An innovative approach can be found in the use of Vision Transformers (ViTs), which employ self-attention mechanisms and model long-range dependencies. In this study, we put ViTs to a more complex task of identifying closely related animal species, in which the conveyance of the minute differences and context specifics is of utmost importance. The benchmarks datasets are employed along with a series of comparative research predictive of the classification performance competing with the best-in-class CNNs. The results indicate that in fine-grained classification tasks, ViTs performance is more accurate and stable than alternatives. In addition, the research demonstrates that ViTs can transfer well in the classification of datasets containing different animal species, which suggests their capability for use in practical ecology. The present method enhances the efficiency of species identification but more importantly offers a revolutionary device for biodiversity conservation, ecological data processing and research planning. This contribution to knowledge places Vision Transformers at the forefront of technology advancement in fine-grain visual recognition and enables the automation of ecological research.

Cite

CITATION STYLE

APA

Soundarya, M., Kumar, L. K., & Prasath, P. S. M. (2024). Improving the Precise Identification of Animal Species through the use of Vision Transformers. In 3rd International Conference on Automation, Computing and Renewable Systems, ICACRS 2024 - Proceedings (pp. 1287–1293). Institute of Electrical and Electronics Engineers Inc. https://doi.org/10.1109/ICACRS62842.2024.10841612

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free