A multi-class method for detecting audio events in news broadcasts

Sergios Petridis; Theodoros Giannakopoulos; Stavros Perantonis

Conference Proceedings

A multi-class method for detecting audio events in news broadcasts

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2010) 6040 LNAI 399-404

DOI: 10.1007/978-3-642-12842-4_50

4Citations

5Readers

Get full text

Abstract

We propose a method for audio event detection in video streams from news. Apart from detecting speech, which is obviously the major class in such content, the proposed method detects five non-speech audio classes. The major difficulty of the particular task lies in the fact that most of the non-speech audio events are actually background sounds, with speech as the primary sound. We have adopted a set of 21 statistics computed on a mid-term basis over 7 audio features. A variation of the One Vs All classification architecture has been adopted and each binary classification problem is modeled using a separate probabilistic Support Vector Machine. Experiments have shown that the proposed method can achieve high precision rates for most of the audio events of interest. © Springer-Verlag Berlin Heidelberg 2010.

Author supplied keywords

Cite

CITATION STYLE

APA

Petridis, S., Giannakopoulos, T., & Perantonis, S. (2010). A multi-class method for detecting audio events in news broadcasts. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6040 LNAI, pp. 399–404). https://doi.org/10.1007/978-3-642-12842-4_50

A multi-class method for detecting audio events in news broadcasts

Abstract

Author supplied keywords

Cite

Register to see more suggestions