Integration of Multi-Head Self-Attention and Convolution for Person Re-Identification

Yalei Zhou; Peng Liu; Yue Cui; Chunguang Liu; Wenli Duan

Journal ArticleOPEN ACCESS

Integration of Multi-Head Self-Attention and Convolution for Person Re-Identification

Sensors (2022) 22(16)

DOI: 10.3390/s22166293

8Citations

7Readers

Abstract

Person re-identification is essential to intelligent video analytics, whose results affect downstream tasks such as behavior and event analysis. However, most existing models only consider the accuracy, rather than the computational complexity, which is also an aspect to consider in practical deployment. We note that self-attention is a powerful technique for representation learning. It can work with convolution to learn more discriminative feature representations for re-identification. We propose an improved multi-scale feature learning structure, DM-OSNet, with better performance than the original OSNet. Our DM-OSNet replaces the (Formula presented.) convolutional stream in OSNet with multi-head self-attention. To maintain model efficiency, we use double-layer multi-head self-attention to reduce the computational complexity of the original multi-head self-attention. The computational complexity is reduced from the original (Formula presented.) to (Formula presented.). To further improve the model performance, we use SpCL to perform unsupervised pre-training on the large-scale unlabeled pedestrian dataset LUPerson. Finally, our DM-OSNet achieves an mAP of 87.36%, 78.26%, 72.96%, and 57.13% on the Market1501, DukeMTMC-reID, CUHK03, and MSMT17 datasets.

Author supplied keywords

Cite

CITATION STYLE

APA

Zhou, Y., Liu, P., Cui, Y., Liu, C., & Duan, W. (2022). Integration of Multi-Head Self-Attention and Convolution for Person Re-Identification. Sensors, 22(16). https://doi.org/10.3390/s22166293

Integration of Multi-Head Self-Attention and Convolution for Person Re-Identification

Abstract

Author supplied keywords

Cite

Register to see more suggestions