Herein we present the solution to the 2 nd YouTube-8M video understanding challenge which placed 1 st . Competition participants were tasked with building a size constrained video labeling model with a model size of less than 1 GB. Our final solution consists of several submodels belonging to Fisher vectors, NetVlad, Deep Bag of Frames and Recurrent neural networks model families. To make the classifier efficient under size constraints we introduced model distillation, partial weights quantization and training with exponential moving average.
CITATION STYLE
Skalic, M., & Austin, D. (2019). Building a size constrained predictive models for video classification. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11132 LNCS, pp. 297–305). Springer Verlag. https://doi.org/10.1007/978-3-030-11018-5_27
Mendeley helps you to discover research relevant for your work.