Abstract
General purpose automatic speech recognition (gpASR) systems such as Google, Watson, etc. sometimes output inaccurate sentences when used in a domain specific scenario as it may not have had enough training samples for that particular domain and context. Further, the accent of the speaker and the environmental conditions in which the speaker speaks a sentence may influence the speech engine to recognize certain words inaccurately. In the context of a domain and the environment in which a speaker speaks the sentences, gpASR output needs a lot of improvement in order to provide effective speech interfaces to domain-specific systems. In this paper, we demonstrate a method that combines bio-inspired artificial development (ArtDev) with machine learning (ML) approaches to repair the output of a gpASR1. Our method factors in the environment to tailor the repair process.
Cite
CITATION STYLE
Anantaram, C., Sangroya, A., Rawat, M., & Chhabra, A. (2018). Repairing ASR output by Artificial Development and Ontology based Learning. In IJCAI International Joint Conference on Artificial Intelligence (Vol. 2018-July, pp. 5799–5801). International Joint Conferences on Artificial Intelligence. https://doi.org/10.24963/ijcai.2018/842
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.