The Voice Conversion Challenge 2016
Data CreatorTomoki, Toda
PublisherUniversity of Edinburgh. School of Informatics. Centre for Speech Technology Research
Relation (Is Referenced By)https://doi.org/10.21437/Interspeech.2016-1066
MetadataShow full item record
CitationTomoki, Toda; Chen, Ling-Hui; Saito, Daisuke; Villavicencio, Fernando; Wester, Mirjam; Wu, Zhizheng; Yamagishi, Junichi. (2016). The Voice Conversion Challenge 2016, 2016 [dataset]. University of Edinburgh. School of Informatics. Centre for Speech Technology Research. https://doi.org/10.7488/ds/1575.
DescriptionThe Voice Conversion Challenge (VCC) 2016, one of the special sessions at Interspeech 2016, deals with speaker identity conversion, referred as Voice Conversion (VC). The task of the challenge was speaker conversion, i.e., to transform the voice identity of a source speaker into that of a target speaker while preserving the linguistic content. Using a common dataset consisting of 162 utterances for training and 54 utterances for evaluation from each of 5 source and 5 target speakers, 17 groups working in VC around the world developed their own VC systems for every combination of the source and target speakers, i.e., 25 systems in total, and generated voice samples converted by the developed systems. The objective of the VCC was to compare various VC techniques on identical training and evaluation speech data. The samples were evaluated in terms of target speaker similarity and naturalness by 200 listeners in a controlled environment. This dataset consists of the participants' VC submissions and the listening test results for naturalness and similarity. For further information please see the accompanying paper "Interspeech2016_VC_challenge_description.pdf" included in this dataset. See also "The Voice Conversion Challenge, 2016: multidimensional scaling (MDS) listening test results" (DOI: 10.7488/ds/1504).
Listening test results: VCC questionnaire responses - spreadsheet with all descriptions of systems provided by participants (503.5Kb)
Participant submissions: sound files, source, target, baseline and participants' submissions (1.940Gb)
Tomoki Toda, Ling-Hui Chen, Daisuke Saito,Fernando Villavicencio, Mirjam Wester, Zhizheng Wu, Junichi Yamagishi "The Voice Conversion Challenge 2016" in Proc. of Interspeech, San Francisco. (195.8Kb)
Mirjam Wester, Zhizheng Wu, Junichi Yamagishi "Analysis of the Voice Conversion Challenge 2016 Evaluation Results" in Proc. of Interspeech 2016. (227.3Kb)