Reverberant speech database for training speech dereverberation algorithms and TTS models
Data CreatorValentini-Botinhao, Cassia
PublisherUniversity of Edinburgh
MetadataShow full item record
CitationValentini-Botinhao, Cassia. (2016). Reverberant speech database for training speech dereverberation algorithms and TTS models, 2016 [dataset]. University of Edinburgh. http://dx.doi.org/10.7488/ds/1425.
DescriptionClean and reverberant parallel speech database. The database was designed to train and test speech dereverberation methods that operate at 48kHz. A more detailed description can be found in the paper associated with the database. Clean speech was made reverberant by convolving it with a room impulse response. The room impulse responses used to create this dataset were selected from: - The ACE challenge (http://www.commsp.ee.ic.ac.uk/~sap/projects/ace-challenge/) - The MIRD database (http://www.iks.rwth-aachen.de/en/research/tools-downloads/multichannel-impulse-response-database/) - The MARDY database (http://www.commsp.ee.ic.ac.uk/~sap/resources/mardy-multichannel-acoustic-reverberation-database-at-york-database/) The clean speech database was obtained from the Voice Banking Corpus, available here: http://homepages.inf.ed.ac.uk/jyamagis/release/VCTK-Corpus.tar.gz
Reverberant speech 48kHz waveforms containing 2 native English speakers with around 400 sentences each (Test set) (151.7Mb)
Reverberant speech 48kHz waveforms containing 28 native English speakers with around 400 sentences each (Train set 1) (2.427Gb)
Reverberant speech 48kHz waveforms containing 56 native English speakers with around 400 sentences each (Train set 2) (4.899Gb)
Showing items related by title, author, creator and subject.
Mayo, Catherine (LISTA Consortium: (i) Language and Speech Laboratory, Universidad del Pais Vasco, Spain and Ikerbasque, Spain; (ii) Centre for Speech Technology Research, University of Edinburgh, UK; (iii) KTH Royal Institute of Technology, Sweden; (iv) Institute of Computer Science, FORTH, Greece, 2013-09-24)Single male native British English talker recorded producing 25 TIMIT sentences in 5 conditions, two natural: (i) quiet, (ii) while the talker listened to high-intensity speech-shaped noise, and three acted: (i) as if to ...
Listening test materials for "Evaluating comprehension of natural and synthetic conversational speech" Wester, Mirjam; Watts, Oliver; Henter, Gustav EjeCurrent speech synthesis methods typically operate on isolated sentences and lack convincing prosody when generating longer segments of speech. Similarly, prevailing TTS evaluation paradigms, such as intelligibility ...
Dall, RasmusData released in relation to the PhD thesis of Rasmus Dall. This contains: 1. Thesis pdf. 2. Released parallel corpora of read and spontaneous speech suitable for speech synthesis. 3. Experimental Data to enable ...