Audio segmentation-by-classification approach based on factor analysis in broadcast news domain

Diego Castán; Alfonso Ortega; Antonio Miguel; Eduardo Lleida

Journal ArticleOPEN ACCESS

Audio segmentation-by-classification approach based on factor analysis in broadcast news domain

Eurasip Journal on Audio, Speech, and Music Processing (2014) 2014(1) 1-13

DOI: 10.1186/s13636-014-0034-5

22Citations

22Readers

Abstract

This paper studies a novel audio segmentation-by-classification approach based on factor analysis. The proposed technique compensates the within-class variability by using class-dependent factor loading matrices and obtains the scores by computing the log-likelihood ratio for the class model to a non-class model over fixed-length windows. Afterwards, these scores are smoothed to yield longer contiguous segments of the same class by means of different back-end systems. Unlike previous solutions, our proposal does not make use of specific acoustic features and does not need a hierarchical structure. The proposed method is applied to segment and classify audios coming from TV shows into five different acoustic classes: speech, music, speech with music, speech with noise, and others. The technique is compared to a hierarchical system with specific acoustic features achieving a significant error reduction.

Author supplied keywords

Cite

CITATION STYLE

APA

Castán, D., Ortega, A., Miguel, A., & Lleida, E. (2014). Audio segmentation-by-classification approach based on factor analysis in broadcast news domain. Eurasip Journal on Audio, Speech, and Music Processing, 2014(1), 1–13. https://doi.org/10.1186/s13636-014-0034-5

Audio segmentation-by-classification approach based on factor analysis in broadcast news domain

Abstract

Author supplied keywords

Cite

Register to see more suggestions