Evaluation of acoustic word embeddings

10Citations
Citations of this article
74Readers
Mendeley users who have this article in their library.

Abstract

Recently, researchers in speech recognition have started to reconsider using whole words as the basic modeling unit, instead of phonetic units. These systems rely on a function that embeds an arbitrary or fixed dimensional speech segments to a vector in a fixed-dimensional space, named acoustic word embedding. Thus, speech segments of words that sound similarly will be projected in a close area in a continuous space. This paper focuses on the evaluation of acoustic word embeddings. We propose two approaches to evaluate the intrinsic performances of acoustic word embeddings in comparison to orthographic representations in order to evaluate whether they capture discriminative phonetic information. Since French language is targeted in experiments, a particular focus is made on homophone words.

Cite

CITATION STYLE

APA

Ghannay, S., Esteve, Y., Camelin, N., & Deleglise, P. (2016). Evaluation of acoustic word embeddings. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 62–66). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w16-2511

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free