An algorithm for detection of breath sounds in spontaneous speech with application to speaker recognition

Sri Harsha Dumpala; K. N.R.K.Raju Alluri

Conference Proceedings

An algorithm for detection of breath sounds in spontaneous speech with application to speaker recognition

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2017) 10458 LNAI 98-108

DOI: 10.1007/978-3-319-66429-3_9

12Citations

12Readers

Get full text

Abstract

Automatic detection and demarcation of non-speech sounds in speech is critical for developing sophisticated human-machine interaction systems. The main objective of this study is to develop acoustic features capturing the production differences between speech and breath sounds in terms of both, excitation source and vocal tract system based characteristics. Using these features, a rule-based algorithm is proposed for automatic detection of breath sounds in spontaneous speech. The proposed algorithm outperforms the previous methods for detection of breath sounds in spontaneous speech. Further, the importance of breath detection for speaker recognition is analyzed by considering an i-vector-based speaker recognition system. Experimental results show that the detection of breath sounds, prior to i-vector extraction, is essential to nullify the effect of breath sounds occurring in test samples on speaker recognition, which otherwise will degrade the performance of i-vector-based speaker recognition systems.

Author supplied keywords

Cite

CITATION STYLE

APA

Dumpala, S. H., & Alluri, K. N. R. K. R. (2017). An algorithm for detection of breath sounds in spontaneous speech with application to speaker recognition. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10458 LNAI, pp. 98–108). Springer Verlag. https://doi.org/10.1007/978-3-319-66429-3_9

An algorithm for detection of breath sounds in spontaneous speech with application to speaker recognition

Abstract

Author supplied keywords

Cite

Register to see more suggestions