A comparative performance study of five pitch detection algorithms was conducted. A speech data base, consisting of eight utterances spoken by three males, three females, and one child was constructed. Both telephone and wideband recordings were made of each of the utterances. For each of the utterances in the data base a “standard” pitch contour was semiautomatically measured using a highly sophistocated interactive pitch detection program. The “standard” pitch contour was then compared with the pitch contour that was obtained from each of the five programmed pitch detectors. The algorithms used in this study were (1) a center clipping, infinite-peak clipping, modified autocorrelation method; (2) the cepstral method; (3) the SIFT method; (4) the parallel processing time domain method; and (5) the data reduction method. A set of measurements were made on the pitch contours to quantify the various types of errors which occur in each of the above methods. Included among the error measurements were the average and standard deviation of the error in pitch period during voiced regions, the number of gross errors in the pitch period, and the average and standard deviation of the error in choosing onset and offset of voicing. By pooling the various error measurements, the individual pitch detectors could be rank ordered as a measure of this relative performance.
CITATION STYLE
Cheng, M. J., Rabiner, L. R., Rosenberg, A. E., & McGonegal, C. A. (1975). Comparative performance study of several pitch detection algorithms. The Journal of the Acoustical Society of America, 58(S1), S61–S62. https://doi.org/10.1121/1.2002228
Mendeley helps you to discover research relevant for your work.