Learning video features for multi-label classification

Shivam Garg

Conference ProceedingsOPEN ACCESS

Learning video features for multi-label classification

Garg S

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2019) 11132 LNCS 325-337

DOI: 10.1007/978-3-030-11018-5_30

1Citations

11Readers

Abstract

This paper studies some approaches to learn representation of videos. This work was done as a part of Youtube-8M Video Understanding Challenge. The main focus is to analyze various approaches used to model temporal data and evaluate the performance of such approaches on this problem. Also, a model is proposed which reduces the size of feature vector by 70% but does not compromise on accuracy. The first approach is to use recurrent neural network architectures to learn a single video level feature from frame level features and then use this aggregated feature to do multi-label classification. The second approach is to use video level features and deep neural networks to assign the labels.

Author supplied keywords

Cite

CITATION STYLE

APA

Garg, S. (2019). Learning video features for multi-label classification. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11132 LNCS, pp. 325–337). Springer Verlag. https://doi.org/10.1007/978-3-030-11018-5_30

Learning video features for multi-label classification

Abstract

Author supplied keywords

Cite

Register to see more suggestions