Speech recognition using Dynamic Time Warping (DTW)

48Citations
Citations of this article
60Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Sound is one of the most common communication medias used by humans. Every human has different sound characteristics. To recognize the compatibility of a sound, a special algorithm is needed, which is Dynamic Time Warping (DTW). DTW is a method to measure the similarity of a pattern with different time zones. The smaller the distance produced, the more similar between the two sound patterns. Both sound patterns are similar, thus the two voices are said to be the same. The initial data on the speech recognition process is transformed into frequency waves. Pronounce volume, pronunciation time, and noise from the sound around the recording takes place affecting the distance generated. The smaller the effect, the smaller the distance that will be generated.

Cite

CITATION STYLE

APA

Permanasari, Y., Harahap, E. H., & Ali, E. P. (2019). Speech recognition using Dynamic Time Warping (DTW). In Journal of Physics: Conference Series (Vol. 1366). Institute of Physics Publishing. https://doi.org/10.1088/1742-6596/1366/1/012091

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free