Semantic aware video transcription using random forest classifiers

Chen Sun; Ram Nevatia

Conference ProceedingsOPEN ACCESS

Semantic aware video transcription using random forest classifiers

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2014) 8689 LNCS(PART 1) 772-786

DOI: 10.1007/978-3-319-10590-1_50

8Citations

27Readers

Abstract

This paper focuses on transcription generation in the form of subject, verb, object (SVO) triplets for videos in the wild, given off-the-shelf visual concept detectors. This problem is challenging due to the availability of sentence only annotations, the unreliability of concept detectors, and the lack of training samples for many words. Facing these challenges, we propose a Semantic Aware Transcription (SAT) framework based on Random Forest classifiers. It takes concept detection results as input, and outputs a distribution of English words. SAT uses video, sentence pairs for training. It hierarchically learns node splits by grouping semantically similar words, measured by a continuous skip-gram language model. This not only addresses the sparsity of training samples per word, but also yields semantically reasonable errors during transcription. SAT provides a systematic way to measure the relatedness of a concept detector to real words, which helps us understand the relationship between current visual detectors and words in a semantic space. Experiments on a large video dataset with 1,970 clips and 85,550 sentences are used to demonstrate our idea. © 2014 Springer International Publishing.

Author supplied keywords

Cite

CITATION STYLE

APA

Sun, C., & Nevatia, R. (2014). Semantic aware video transcription using random forest classifiers. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8689 LNCS, pp. 772–786). Springer Verlag. https://doi.org/10.1007/978-3-319-10590-1_50

Semantic aware video transcription using random forest classifiers

Abstract

Author supplied keywords

Cite

Register to see more suggestions