Double fusion for multimedia event detection

Zhen Zhong Lan; Lei Bao; Shoou I. Yu; Wei Liu; Alexander G. Hauptmann

Conference Proceedings

Double fusion for multimedia event detection

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2012) 7131 LNCS 173-185

DOI: 10.1007/978-3-642-27355-1_18

45Citations

46Readers

Get full text

Abstract

Multimedia Event Detection is a multimedia retrieval task with the goal of finding videos of a particular event in an internet video archive, given example videos and descriptions. We focus here on mining features of example videos to learn the most characteristic features, which requires a combination of multiple complementary types of features. Generally, early fusion and late fusion are two popular combination strategies. The former one fuses features before performing classification and the latter one combines output of classifiers from different features. In this paper, we introduce a fusion scheme named double fusion, which combines early fusion and late fusion together to incorporate their advantages. Results are reported on TRECVID MED 2010 and 2011 data sets. For MED 2010, we get a mean minimal normalized detection cost (MNDC) of 0.49, which exceeds the state of the art performance by more than 12 percent. © 2012 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Lan, Z. Z., Bao, L., Yu, S. I., Liu, W., & Hauptmann, A. G. (2012). Double fusion for multimedia event detection. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7131 LNCS, pp. 173–185). https://doi.org/10.1007/978-3-642-27355-1_18

Double fusion for multimedia event detection

Abstract

Author supplied keywords

Cite

Register to see more suggestions