Online Multi-Object Tracking Based on Feature Representation and Bayesian Filtering Within a Deep Learning Architecture

39Citations
Citations of this article
44Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

In detection-based multi-object tracking (MOT), one challenging problem is to design a robust affinity model for data association. Moreover, since these approaches entirely rely on detection responses to locate targets, a strategy should be taken to deal with a detector's defect. In this paper, we propose a robust online MOT tracking method that can handle these two issues effectively. We first present a novel affinity model by jointly learning more powerful feature representation and distance metric within a deep architecture. Specifically, we design a convolutional neural network to extract appearance cue tailored toward person Re-ID and a long short-term memory network to extract motion cue to encode dynamics of targets. Both the cues are then combined with a triplet loss function, which performs end-to-end deep metric learning to encode dependences across both cues automatically and thus generates fused features in embedding space to distinguish targets. To overcome the detector's limitation, a trajectory estimation strategy is presented. We design a recurrent neural network-based Bayesian filtering module, which takes a hidden state of the above-mentioned LSTM network as an input and performs recursive prediction and update for explicitly estimating targets state. In this way, we can reconstruct trajectories by filling the gaps where no detections are present or adjusting the exact locations of trajectory where detections are imprecise. The experiments on the challenging MOT 2015 and 2016 datasets show very competitive results when comparing our method with the recent state-of-the-art batch and online tracking approaches. We achieve top one in terms of multiple objects tracking accuracy and multiple objects tracking precision among online methods on the MOT2016 dataset.

Cite

CITATION STYLE

APA

Xiang, J., Zhang, G., & Hou, J. (2019). Online Multi-Object Tracking Based on Feature Representation and Bayesian Filtering Within a Deep Learning Architecture. IEEE Access, 7, 27923–27935. https://doi.org/10.1109/ACCESS.2019.2901520

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free