Sparse DNN-based speaker segmentation using side information

Yong Ma; Chang Chun Bao

Journal ArticleOPEN ACCESS

Sparse DNN-based speaker segmentation using side information

Electronics Letters (2015) 51(8) 651-653

DOI: 10.1049/el.2015.0298

1Citations

13Readers

Abstract

Sparse deep neural networks (SDNNs) for speaker segmentation are proposed. First, the SDNNs are trained using the side information that is the class label of the input. Then, speaker-specific features are extracted from the super-vector feature of the speech signal by the SDNNs. Lastly, the label of each speech frame is obtained by Kmeans clustering, which is used to segment different speakers of a continuous speech stream. The performance evaluation using the multispeaker speech stream corpus generated from the TIMIT database shows that the proposed speaker segmentation algorithm outperforms the Bayesian information criterion method and the deep auto-encoder networks method.

Cite

CITATION STYLE

APA

Ma, Y., & Bao, C. C. (2015). Sparse DNN-based speaker segmentation using side information. Electronics Letters, 51(8), 651–653. https://doi.org/10.1049/el.2015.0298

Sparse DNN-based speaker segmentation using side information

Abstract

Cite

Register to see more suggestions