Show simple item record

Depositordc.contributorValentini-Botinhao, Cassia
Funderdc.contributor.otherEPSRC - Engineering and Physical Sciences Research Councilen_UK
Data Creatordc.creatorValentini-Botinhao, Cassia
Date Accessioneddc.date.accessioned2016-03-22T11:04:35Z
Date Availabledc.date.available2016-03-22T11:04:35Z
Citationdc.identifier.citationValentini-Botinhao, Cassia. (2016). Noisy speech database for training speech enhancement algorithms and TTS models, [dataset]. University of Edinburgh. School of Informatics. Centre for Speech Technology Research (CSTR). https://doi.org/10.7488/ds/1356.en
Persistent Identifierdc.identifier.urihttp://hdl.handle.net/10283/1942
Persistent Identifierdc.identifier.urihttps://doi.org/10.7488/ds/1356
Dataset Description (abstract)dc.description.abstract## SUPERSEDED: THIS DATASET HAS BEEN REPLACED by the one which can be found at https://doi.org/10.7488/ds/2117. ## Clean and noisy parallel speech database. The database was designed to train and test speech enhancement methods that operate at 48kHz. A more detailed description can be found in the paper associated with the database. Some of the noises were obtained from the Demand database, available here: http://parole.loria.fr/DEMAND/ The speech database was obtained from the Voice Banking Corpus, available here: http://homepages.inf.ed.ac.uk/jyamagis/release/VCTK-Corpus.tar.gzen_UK
Dataset Description (TOC)dc.description.tableofcontentsThe files are wav format audio data sampled at 48kHz. Each file contains a sentence recorded by a range of speakers in quiet studio conditions. This audio material was added to a range of different noise signals, constituting the parallel noisy dataset. Accompanying each audio file there is a text file containing the orthographic transcription of what was said in that particular audio sample.en_UK
Languagedc.language.isoengen_UK
Publisherdc.publisherUniversity of Edinburgh. School of Informatics. Centre for Speech Technology Research (CSTR)en_UK
Relation (Is Referenced By)dc.relation.isreferencedbyCassia Valentini-Botinhao, Xin Wang, Shinji Takaki and Junichi Yamagishi. 2016. "Speech Enhancement for a Noise-Robust Text-to-Speech Synthesis System using Deep Recurrent Neural Networks" in Interspeech 2016.en_UK
Superseded Bydc.relation.isreplacedbyhttps://doi.org/10.7488/ds/2117
Rightsdc.rightsCreative Commons Attribution 4.0 International Public Licenseen
Sourcedc.sourcehttp://parole.loria.fr/DEMAND/en_UK
Sourcedc.sourcehttp://homepages.inf.ed.ac.uk/jyamagis/release/VCTK-Corpus.tar.gzen_UK
Subjectdc.subjectnoisy speechen_UK
Subjectdc.subjectspeech enhancementen_UK
Subjectdc.subjectspeech synthesisen_UK
Subjectdc.subjectVoice Bank Corpusen_UK
Subjectdc.subjectDemand Corpusen_UK
Subject Classificationdc.subject.classificationMathematical and Computer Sciences::Speech and Natural Language Processingen_UK
Titledc.title## SUPERSEDED: THIS DATASET HAS BEEN REPLACED. ## Noisy speech database for training speech enhancement algorithms and TTS modelsen_UK
Typedc.typedataseten_UK

Download All
zip file MD5 Checksum: bc6e16ffccf6a36d7eac3fc557e99ea2

Files in this item

Thumbnail
Thumbnail
Thumbnail
Thumbnail
Thumbnail
Thumbnail
Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record