CMT: Context-Matching-Guided Transformer for 3D Tracking in Point Clouds

Zhiyang Guo; Yunyao Mao; Wengang Zhou; Min Wang; Houqiang Li

Conference Proceedings

CMT: Context-Matching-Guided Transformer for 3D Tracking in Point Clouds

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2022) 13682 LNCS 95-111

DOI: 10.1007/978-3-031-20047-2_6

8Citations

13Readers

Get full text

Abstract

How to effectively match the target template features with the search area is the core problem in point-cloud-based 3D single object tracking. However, in the literature, most of the methods focus on devising sophisticated matching modules at point-level, while overlooking the rich spatial context information of points. To this end, we propose Context-Matching-Guided Transformer (CMT), a Siamese tracking paradigm for 3D single object tracking. In this work, we first leverage the local distribution of points to construct a horizontally rotation-invariant contextual descriptor for both the template and the search area. Then, a novel matching strategy based on shifted windows is designed for such descriptors to effectively measure the template-search contextual similarity. Furthermore, we introduce a target-specific transformer and a spatial-aware orientation encoder to exploit the target-aware information in the most contextually relevant template points, thereby enhancing the search feature for a better target proposal. We conduct extensive experiments to verify the merits of our proposed CMT and report a series of new state-of-the-art records on three widely-adopted datasets.

Author supplied keywords

Cite

CITATION STYLE

APA

Guo, Z., Mao, Y., Zhou, W., Wang, M., & Li, H. (2022). CMT: Context-Matching-Guided Transformer for 3D Tracking in Point Clouds. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 13682 LNCS, pp. 95–111). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-031-20047-2_6

CMT: Context-Matching-Guided Transformer for 3D Tracking in Point Clouds

Abstract

Author supplied keywords

Cite

Register to see more suggestions