UniAP: Towards Universal Animal Perception in Vision via Few-Shot Learning

3Citations
Citations of this article
7Readers
Mendeley users who have this article in their library.

Abstract

Animal visual perception is an important technique for automatically monitoring animal health, understanding animal behaviors, and assisting animal-related research. However, it is challenging to design a deep learning-based perception model that can freely adapt to different animals across various perception tasks, due to the varying poses of a large diversity of animals, lacking data on rare species, and the semantic inconsistency of different tasks. We introduce UniAP, a novel Universal Animal Perception model that leverages fewshot learning to enable cross-species perception among various visual tasks. Our proposed model takes support images and labels as prompt guidance for a query image. Images and labels are processed through a Transformer-based encoder and a lightweight label encoder, respectively. Then a matching module is designed for aggregating information between prompt guidance and the query image, followed by a multihead label decoder to generate outputs for various tasks. By capitalizing on the shared visual characteristics among different animals and tasks, UniAP enables the transfer of knowledge from well-studied species to those with limited labeled data or even unseen species. We demonstrate the effectiveness of UniAP through comprehensive experiments in pose estimation, segmentation, and classification tasks on diverse animal species, showcasing its ability to generalize and adapt to new classes with minimal labeled examples.

References Powered by Scopus

ImageNet: A Large-Scale Hierarchical Image Database

51043Citations
N/AReaders
Get full text

ImageNet Large Scale Visual Recognition Challenge

30433Citations
N/AReaders
Get full text

Microsoft COCO: Common objects in context

28871Citations
N/AReaders
Get full text

Cited by Powered by Scopus

From Vision to Vocabulary: A Multimodal Approach to Detect and Track Black Cattle Behaviors

0Citations
N/AReaders
Get full text

Learning Structure-Supporting Dependencies via Keypoint Interactive Transformer for General Mammal Pose Estimation

0Citations
N/AReaders
Get full text

UniFS: Universal Few-Shot Instance Perception with Point Representations

0Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Sun, M., Zhao, Z., Chai, W., Luo, H., Cao, S., Zhang, Y., … Wang, G. (2024). UniAP: Towards Universal Animal Perception in Vision via Few-Shot Learning. In Proceedings of the AAAI Conference on Artificial Intelligence (Vol. 38, pp. 5008–5016). Association for the Advancement of Artificial Intelligence. https://doi.org/10.1609/aaai.v38i5.28305

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 4

100%

Readers' Discipline

Tooltip

Computer Science 2

67%

Biochemistry, Genetics and Molecular Bi... 1

33%

Save time finding and organizing research with Mendeley

Sign up for free