Depth-adaptive computational policies for efficient visual tracking

Chris Ying; Katerina Fragkiadaki

Conference Proceedings

Depth-adaptive computational policies for efficient visual tracking

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2018) 10746 LNCS 109-122

DOI: 10.1007/978-3-319-78199-0_8

6Citations

13Readers

Get full text

Abstract

Current convolutional neural networks algorithms for video object tracking spend the same amount of computation for each object and video frame [3]. However, it is harder to track an object in some frames than others, due to the varying amount of clutter, scene complexity, amount of motion, and object’s distinctiveness against its background. We propose a depth-adaptive convolutional siamese network that performs video tracking adaptively at multiple neural network depths. Parametric gating functions are trained to control the depth of the convolutional feature extractor by minimizing a joint loss of computational cost and tracking error. Our network achieves accuracy comparable to the state-of-the-art on the VOT2016 benchmark. Furthermore, our adaptive depth computation achieves higher accuracy for a given computational cost than traditional fixed-structure neural networks. The presented framework extends to other tasks that use convolutional neural networks and enables trading speed for accuracy at runtime.

Author supplied keywords

Cite

CITATION STYLE

APA

Ying, C., & Fragkiadaki, K. (2018). Depth-adaptive computational policies for efficient visual tracking. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10746 LNCS, pp. 109–122). Springer Verlag. https://doi.org/10.1007/978-3-319-78199-0_8

Depth-adaptive computational policies for efficient visual tracking

Abstract

Author supplied keywords

Cite

Register to see more suggestions