SGAP-Net: Semantic-guided attentive prototypes network for few-shot human-object interaction recognition

13Citations
Citations of this article
29Readers
Mendeley users who have this article in their library.

Abstract

Extreme instance imbalance among categories and combinatorial explosion make the recognition of Human-Object Interaction (HOI) a challenging task. Few studies have addressed both challenges directly. Motivated by the success of few-shot learning that learns a robust model from a few instances, we formulate HOI as a few-shot task in a meta-learning framework to alleviate the above challenges. Due to the fact that the intrinsic characteristic of HOI is diverse and interactive, we propose a Semantic-Guided Attentive Prototypes Network (SGAP-Net) to learn a semantic-guided metric space where HOI recognition can be performed by computing distances to attentive prototypes of each class. Specifically, the model generates attentive prototypes guided by the category names of actions and objects, which highlight the commonalities of images from the same class in HOI. In addition, we design a novel decision method to alleviate the biases produced by different patterns of the same action in HOI. Finally, in order to realize the task of few-shot HOI, we reorganize two HOI benchmark datasets, i.e., HICO-FS and TUHOI-FS, to realize the task of few-shot HOI. Extensive experimental results on both datasets have demonstrated the effectiveness of our proposed SGAP-Net approach.

Cite

CITATION STYLE

APA

Ji, Z., Liu, X., Pang, Y., & Li, X. (2020). SGAP-Net: Semantic-guided attentive prototypes network for few-shot human-object interaction recognition. In AAAI 2020 - 34th AAAI Conference on Artificial Intelligence (pp. 11085–11092). AAAI press. https://doi.org/10.1609/aaai.v34i07.6764

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free