SVM-based audio classification for content-based multimedia retrieval

Yingying Zhu; Zhong Ming; Qiang Huang

Conference Proceedings

SVM-based audio classification for content-based multimedia retrieval

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2007) 4577 LNCS 474-482

DOI: 10.1007/978-3-540-73417-8_56

19Citations

12Readers

Get full text

Abstract

Audio classification is very important in multimedia retrieval such as audio indexing, analysis and content-based video retrieval. In this paper, we have proposed a clip-based support vector machine (SVM) approach to classify audio signals into six classes, which are pure speech, music, silence, environmental sound, speech with music and speech with environmental sound. The classification results are then used to partition a video into homogeneous audio segments, which is used to analyze and retrieve its higher-level content. The experimental results show that the proposed system not only improves classification accuracy, but also performs better than the other classification systems using the decision tree (DT), K Nearest Neighbor (K-NN) and Neural Network (NN). © Springer-Verlag Berlin Heidelberg 2007.

Author supplied keywords

Cite

CITATION STYLE

APA

Zhu, Y., Ming, Z., & Huang, Q. (2007). SVM-based audio classification for content-based multimedia retrieval. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4577 LNCS, pp. 474–482). Springer Verlag. https://doi.org/10.1007/978-3-540-73417-8_56

SVM-based audio classification for content-based multimedia retrieval

Abstract

Author supplied keywords

Cite

Register to see more suggestions