SPT: Single Pedestrian Tracking Framework with Re-Identification-Based Learning Using the Siamese Model

4Citations
Citations of this article
10Readers
Mendeley users who have this article in their library.

Abstract

Pedestrian tracking is a challenging task in the area of visual object tracking research and it is a vital component of various vision-based applications such as surveillance systems, human-following robots, and autonomous vehicles. In this paper, we proposed a single pedestrian tracking (SPT) framework for identifying each instance of a person across all video frames through a tracking-by-detection paradigm that combines deep learning and metric learning-based approaches. The SPT framework comprises three main modules: detection, re-identification, and tracking. Our contribution is a significant improvement in the results by designing two compact metric learning-based models using Siamese architecture in the pedestrian re-identification module and combining one of the most robust re-identification models for data associated with the pedestrian detector in the tracking module. We carried out several analyses to evaluate the performance of our SPT framework for single pedestrian tracking in the videos. The results of the re-identification module validate that our two proposed re-identification models surpass existing state-of-the-art models with increased accuracies of 79.2% and 83.9% on the large dataset and 92% and 96% on the small dataset. Moreover, the proposed SPT tracker, along with six state-of-the-art (SOTA) tracking models, has been tested on various indoor and outdoor video sequences. A qualitative analysis considering six major environmental factors verifies the effectiveness of our SPT tracker under illumination changes, appearance variations due to pose changes, changes in target position, and partial occlusions. In addition, quantitative analysis based on experimental results also demonstrates that our proposed SPT tracker outperforms the GOTURN, CSRT, KCF, and SiamFC trackers with a success rate of 79.7% while beating the DiamSiamRPN, SiamFC, CSRT, GOTURN, and SiamMask trackers with an average of 18 tracking frames per second.

References Powered by Scopus

Deep residual learning for image recognition

174052Citations
N/AReaders
Get full text

Histograms of oriented gradients for human detection

30454Citations
N/AReaders
Get full text

Rethinking the Inception Architecture for Computer Vision

24021Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Edge Deployment of Vision-Based Model for Human Following Robot

3Citations
N/AReaders
Get full text

Enhancing real human detection and people counting using YOLOv8

2Citations
N/AReaders
Get full text

Multi-target detection and tracking based on CRF network and spatio-temporal attention for sports videos

0Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Manzoor, S., An, Y. C., In, G. G., Zhang, Y., Kim, S., & Kuc, T. Y. (2023). SPT: Single Pedestrian Tracking Framework with Re-Identification-Based Learning Using the Siamese Model. Sensors, 23(10). https://doi.org/10.3390/s23104906

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 2

50%

Professor / Associate Prof. 1

25%

Lecturer / Post doc 1

25%

Readers' Discipline

Tooltip

Computer Science 4

67%

Engineering 1

17%

Earth and Planetary Sciences 1

17%

Article Metrics

Tooltip
Mentions
News Mentions: 1

Save time finding and organizing research with Mendeley

Sign up for free