Audio classification for radio broadcast indexing: Feature normalization and multiple classifiers decision

Christine Sénac; Eliathamby Ambikairajh

Journal Article

Audio classification for radio broadcast indexing: Feature normalization and multiple classifiers decision

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2004) 3332 882-889

DOI: 10.1007/978-3-540-30542-2_109

1Citations

3Readers

Get full text

Abstract

This paper presents a system that detects the two basic components (speech and music) in the context of radio broadcast indexing. The originality of the approach covers three different points: a differentiated modelling based on Gaussian Mixture Model (GMM), which permits the extraction of speech and music components separately, the normalization of commonly used features and the efficient fusion of classifiers for speech classification which provides a substantial improvement in the presence of strong background music: accuracy of the indexing system goes from [69.2%,94.2%] for the best classifier to [90.25%,98.56%] for the fusion. Evaluation was performed on 12 hours of radio broadcast recorded under various noise conditions, channels and containing diverse speech and music mixtures. © Springer-Verlag Berlin Heidelberg 2004.

Cite

CITATION STYLE

APA

Sénac, C., & Ambikairajh, E. (2004). Audio classification for radio broadcast indexing: Feature normalization and multiple classifiers decision. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 3332, 882–889. https://doi.org/10.1007/978-3-540-30542-2_109

Audio classification for radio broadcast indexing: Feature normalization and multiple classifiers decision

Abstract

Cite

Register to see more suggestions