Acted clear speech corpus
Data CreatorMayo, Catherine
PublisherLISTA Consortium: (i) Language and Speech Laboratory, Universidad del Pais Vasco, Spain and Ikerbasque, Spain; (ii) Centre for Speech Technology Research, University of Edinburgh, UK; (iii) KTH Royal Institute of Technology, Sweden; (iv) Institute of Computer Science, FORTH, Greece
Relation (Is Referenced By)http://interspeech2012.org/accepted-abstract.html?id=661
MetadataShow full item record
CitationMayo, Catherine. (2013). Acted clear speech corpus, [sound]. LISTA Consortium: (i) Language and Speech Laboratory, Universidad del Pais Vasco, Spain and Ikerbasque, Spain; (ii) Centre for Speech Technology Research, University of Edinburgh, UK; (iii) KTH Royal Institute of Technology, Sweden; (iv) Institute of Computer Science, FORTH, Greece. http://dx.doi.org/10.7488/ds/138.
DescriptionSingle male native British English talker recorded producing 25 TIMIT sentences in 5 conditions, two natural: (i) quiet, (ii) while the talker listened to high-intensity speech-shaped noise, and three acted: (i) as if to a non-native listener, (ii) as if to a computer speech-recognition system, (iii) as if to an infant. Accompanied by automatic and hand-corrected phone-level transcription.
The following licence files are associated with this item:
Showing items related by title, author, creator and subject.
Listening test materials for "Evaluating comprehension of natural and synthetic conversational speech" Wester, Mirjam; Watts, Oliver; Henter, Gustav EjeCurrent speech synthesis methods typically operate on isolated sentences and lack convincing prosody when generating longer segments of speech. Similarly, prevailing TTS evaluation paradigms, such as intelligibility ...
Dall, RasmusData released in relation to the PhD thesis of Rasmus Dall. This contains: 1. Thesis pdf. 2. Released parallel corpora of read and spontaneous speech suitable for speech synthesis. 3. Experimental Data to enable ...
Valentini-Botinhao, CassiaClean and reverberant parallel speech database. The database was designed to train and test speech dereverberation methods that operate at 48kHz. A more detailed description can be found in the paper associated with the ...