Better phoneme recognisers lead to better phoneme posteriorgrams for search on speech? An experimental analysis

Paula Lopez-Otero; Laura Docio-Fernandez; Carmen Garcia-Mateo

Conference Proceedings

Better phoneme recognisers lead to better phoneme posteriorgrams for search on speech? An experimental analysis

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2016) 10077 LNAI 128-137

DOI: 10.1007/978-3-319-49169-1_13

0Citations

2Readers

Get full text

Abstract

Phoneme posteriorgrams are widely used for speech representation when performing query-by-example search on speech. These posteriorgrams are computed by obtaining the per-frame a posteriori probability of each unit in a phoneme recogniser, regardless the architecture of this phoneme recogniser. It is straightforward to believe that the higher the quality of the phone transcriptions generated by a phoneme recogniser, the higher the quality of its resulting phoneme posteriorgrams; however, to the best of our knowledge, no analysis exist proving this statement. This paper aims at investigating whether there is a correlation between the phone error rate of a recogniser and the maximum term weighted value obtained when performing query-by-example search on speech. Experiments on the Albayzin corpus in Spanish language showed a slight correlation between these two metrics, which suggests that the goodness of phoneme posteriorgram representation is somehow related to phone error rate, but there are other factors that affect their performance in search on speech tasks.

Author supplied keywords

Cite

CITATION STYLE

APA

Lopez-Otero, P., Docio-Fernandez, L., & Garcia-Mateo, C. (2016). Better phoneme recognisers lead to better phoneme posteriorgrams for search on speech? An experimental analysis. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10077 LNAI, pp. 128–137). Springer Verlag. https://doi.org/10.1007/978-3-319-49169-1_13

Better phoneme recognisers lead to better phoneme posteriorgrams for search on speech? An experimental analysis

Abstract

Author supplied keywords

Cite

Register to see more suggestions