With this paper we present a European Portuguese database of hesitations in speech. Under the name of HESITA, this database contains annotations of hesitation events, such as filled pauses, vocalic extensions, truncated words, repetitions and substitutions. The hesitations were found over 30 daily news programs collected from podcasts of a Portuguese television channel. The database also includes speaking style classification as well as acoustical information and other speech events. Statistic analysis of the hesitation events in terms of their occurrence is presented. Insights into the process of human speech communication can be extracted from this database, which encloses relevant information about how Portuguese speakers hesitate. The HESITA database is freely available online to the research community.
CITATION STYLE
Candeias, S., Celorico, D., Proença, J., Veiga, A., & Perdigão, F. (2013). HESITA(tions) in Portuguese: a database. In 6th Workshop on Disfluency in Spontaneous Speech, DiSS 2013 (pp. 13–16). International Society for Computers and Their Applications (ISCA).
Mendeley helps you to discover research relevant for your work.