Human-Object Interaction (HOI) detection plays a core role in activity understanding. As a compositional learning problem (human-verb-object), studying its generalization matters. However, widely-used metric mean average precision (mAP) maybe not enough to model the compositional generalization well. Here, we propose a novel metric, mPD (mean Performance Degradation), as a complementary of mAP to evaluate the performance gap among compositions of different objects and the same verb. Surprisingly, mPD reveals that previous state-of-the-arts usually do not generalize well. With mPD as a cue, we propose Object Category (OC) Immunity to advance HOI generalization. Concretely, our core idea is to prevent model from learning spurious object-verb correlations as a short-cut to over-fit the train set. To achieve OC-immunity, we propose an OC-immune network that decouples the inputs from OC, extracts OC-immune representations and leverages uncertainty quantification to generalize to unseen objects. In both conventional and zero-shot experiments, our method achieves decent improvements. To fully evaluate the generalization, we design a new and more difficult benchmark, on which we present significant advantage. The code is available at https://github.com/Foruck/OC-Immunity.
CITATION STYLE
Liu, X., Li, Y. L., & Lu, C. (2022). Highlighting Object Category Immunity for the Generalization of Human-Object Interaction Detection. In Proceedings of the 36th AAAI Conference on Artificial Intelligence, AAAI 2022 (Vol. 36, pp. 1819–1827). Association for the Advancement of Artificial Intelligence. https://doi.org/10.1609/aaai.v36i2.20075
Mendeley helps you to discover research relevant for your work.