Facing data scarcity using variable feature vector dimension

Pablo Daniel Agüero; Antonio Bonafonte

Conference Proceedings

Facing data scarcity using variable feature vector dimension

Proceedings of the International Conference on Speech Prosody (2006)

DOI: 10.21437/speechprosody.2006-119

3Citations

5Readers

Get full text

Abstract

This paper focuses on three key points of intonation modelling: interpolation of fundamental frequency contour, sentence by sentence parameter extraction and data scarcity. In some cases, they introduce noise and inconsistency on training data reducing the performance of machine learning techniques. We consider that the F0 contour is segmented into prosodic units (such as accent groups, minor phrases, etc). Each segment of F0 contour has a corresponding feature vector with linguistic and non-linguistic components. We propose to face the limitations mentioned above using a technique based on clustering using different feature vector dimensions. The clustering of feature vectors produces also a partition in the F0 contour space. The proposal consists on a procedure to select the dimension that contributes to predict the best fundamental frequency contour from a RMSE sense compared to a reference contour. Experimental results show an improvement compared to other approaches.

Cite

CITATION STYLE

APA

Agüero, P. D., & Bonafonte, A. (2006). Facing data scarcity using variable feature vector dimension. In Proceedings of the International Conference on Speech Prosody. International Speech Communication Association. https://doi.org/10.21437/speechprosody.2006-119

Facing data scarcity using variable feature vector dimension

Abstract

Cite

Register to see more suggestions