The CNG corpus of European Portuguese children's speech

Annika Hämäläinen; Silvia Rodrigues; Ana Júdice; Sandra Morgado Silva; António Calado; Fernando Miguel Pinto; Miguel Sales Dias

Conference Proceedings

The CNG corpus of European Portuguese children's speech

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2013) 8082 LNAI 544-551

DOI: 10.1007/978-3-642-40585-3_68

8Citations

4Readers

Get full text

Abstract

Speech recognisers trained with adults' speech do not work well with children's speech because of the inherent acoustic and linguistic differences in the speech of these two populations. To develop speech-driven applications capable of successfully recognising children's speech, a sufficient amount of children's speech is needed for training acoustic models from scratch or for adapting acoustic models trained with adults' speech. However, the availability of suitable children's speech corpora is still limited, especially in the case of less-spoken languages. This paper describes the design, collection, transcription and annotation of a 21-hour corpus of prompted European Portuguese children's speech collected from 510 children aged 3-10. Before the development of this corpus, European Portuguese children's speech data have not been available at all for parts of this age range. © 2013 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Hämäläinen, A., Rodrigues, S., Júdice, A., Silva, S. M., Calado, A., Pinto, F. M., & Dias, M. S. (2013). The CNG corpus of European Portuguese children’s speech. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8082 LNAI, pp. 544–551). https://doi.org/10.1007/978-3-642-40585-3_68

The CNG corpus of European Portuguese children's speech

Abstract

Author supplied keywords

Cite

Register to see more suggestions