Online multi-modal robust non-negative dictionary learning for visual tracking

Xiang Zhang; Naiyang Guan; Dacheng Tao; Xiaogang Qiu; Zhigang Luo

Journal ArticleOPEN ACCESS

Online multi-modal robust non-negative dictionary learning for visual tracking

PLoS ONE (2015) 10(5)

DOI: 10.1371/journal.pone.0124685

7Citations

21Readers

Abstract

Dictionary learning is a method of acquiring a collection of atoms for subsequent signal representation. Due to its excellent representation ability, dictionary learning has been widely applied in multimedia and computer vision. However, conventional dictionary learning algorithms fail to deal with multi-modal datasets. In this paper, we propose an online multi-modal robust non-negative dictionary learning (OMRNDL) algorithm to overcome this deficiency. Notably, OMRNDL casts visual tracking as a dictionary learning problem under the particle filter framework and captures the intrinsic knowledge about the target from multiple visual modalities, e.g., pixel intensity and texture information. To this end, OMRNDL adaptively learns an individual dictionary, i.e., template, for each modality from available frames, and then represents new particles over all the learned dictionaries by minimizing the fitting loss of data based on M-estimation. The resultant representation coefficient can be viewed as the common semantic representation of particles across multiple modalities, and can be utilized to track the target. OMRNDL incrementally learns the dictionary and the coefficient of each particle by using multiplicative update rules to respectively guarantee their non-negativity constraints. Experimental results on a popular challenging video benchmark validate the effectiveness of OMRNDL for visual tracking in both quantity and quality.

Cite

CITATION STYLE

APA

Zhang, X., Guan, N., Tao, D., Qiu, X., & Luo, Z. (2015). Online multi-modal robust non-negative dictionary learning for visual tracking. PLoS ONE, 10(5). https://doi.org/10.1371/journal.pone.0124685

Online multi-modal robust non-negative dictionary learning for visual tracking

Abstract

Cite

Register to see more suggestions