ASTA-Net: Adaptive Spatio-Temporal Attention Network for Person Re-Identification in Videos

Xierong Zhu; Jiawei Liu; Haoze Wu; Meng Wang; Zheng Jun Zha

Conference ProceedingsOPEN ACCESS

ASTA-Net: Adaptive Spatio-Temporal Attention Network for Person Re-Identification in Videos

MM 2020 - Proceedings of the 28th ACM International Conference on Multimedia (2020) 1706-1715

DOI: 10.1145/3394171.3413843

14Citations

15Readers

Get full text

Abstract

The attention mechanism has been widely applied to enhance pedestrian representation for person re-identification in videos. However, most existing methods learn the spatial and temporal attention separately, and thus ignore the correlation between them. In this work, we propose a novel Adaptive Spatio-Temporal Attention Network (ASTA-Net) to adaptively aggregate the spatial and temporal attention features into discriminative pedestrian representation for person re-identification in videos. Specifically, multiple Adaptive Spatio-Temporal Fusion modules within ASTA-Net are designed for exploring precise spatio-temporal attention on multi-level feature maps. They first obtain the preliminary spatial and temporal attention features via the spatial semantic relations for each frame and temporal dependencies among inconsecutive frames, then adaptively aggregate the preliminary attention features on the basis of their correlation. Moreover, an Adjacent-Frame Motion module is designed to explicitly extract motion patterns according to the feature-level variation among adjacent frames. Extensive experiments on the three widely-used datasets, i.e., MARS, iLIDS-VID and PRID2011, have demonstrated the effectiveness of the proposed approach.

Author supplied keywords

Cite

CITATION STYLE

APA

Zhu, X., Liu, J., Wu, H., Wang, M., & Zha, Z. J. (2020). ASTA-Net: Adaptive Spatio-Temporal Attention Network for Person Re-Identification in Videos. In MM 2020 - Proceedings of the 28th ACM International Conference on Multimedia (pp. 1706–1715). Association for Computing Machinery, Inc. https://doi.org/10.1145/3394171.3413843

ASTA-Net: Adaptive Spatio-Temporal Attention Network for Person Re-Identification in Videos

Abstract

Author supplied keywords

Cite

Register to see more suggestions