The School of Informatics is the largest, longest established and highest quality research group in informatics in the UK.

Research within the School is carried out across a number of institutes. The research programmes organised by the School of Informatics encompass a wide range of domains. Currently these include Artificial Life, Bioinformatics, Computational Thinking, Machine Learning, Music Informatics, Processes, Events & Activity, Software Engineering and System Level Integration.

Sub-communities within this community

Collections in this community

Recent Submissions

  • Parallel Audiobook Corpus 

    Ribeiro, Manuel Sam
    The Parallel Audiobook Corpus (version 1.0) is a collection of parallel readings of audiobooks. The corpus consists of approximately 121 hours of speech at 22.05KHz across 4 books and 59 speakers. The data is provided in ...
  • CINIC-10 Is Not ImageNet or CIFAR-10 

    Darlow, Luke N; Crowley, Elliot J; Antoniou, Antreas; Storkey, Amos
    CINIC-10 is an augmented extension of CIFAR-10. It contains the images from CIFAR-10 (60,000 images, 32x32 RGB pixels) and a selection of ImageNet database images (210,000 images downsampled to 32x32). It was compiled as ...
  • Manual and automatic labels for version 1.0 of UXTD, UXSSD, and UPX core data -- version 1.0 

    Eshky, Aciel; Ribeiro, Manuel Sam; Cleland, Joanne; Renals, Steve; Richmond, Korin; Roxburgh, Zoe; Scobbie, James; Wrench, Alan
    UltraSuite is a repository of ultrasound and acoustic data from child speech therapy sessions. The current release includes three data collections, one from typically developing children (UXTD) and two from children with ...
  • The Voice Conversion Challenge 2018: database and results 

    Lorenzo-Trueba, Jaime; Yamagishi, Junichi; Toda, Tomoki; Saito, Daisuke; Villavicencio, Fernando; Kinnunen, Tomi; Ling, Zhenhua
    Voice conversion (VC) is a technique to transform a speaker identity included in a source speech waveform into a different one while preserving linguistic information of the source speech waveform. In 2016, we have ...
  • The 2nd Automatic Speaker Verification Spoofing and Countermeasures Challenge (ASVspoof 2017) Database, Version 2 

    Kinnunen, Tomi; Sahidullah, Md; Delgado, Héctor; Todisco, Massimiliano; Evans, Nicholas; Yamagishi, Junichi; Lee, Kong Aik
    This is a database used for the Second Automatic Speaker Verification Spoofing and Countermeasuers Challenge, for short, ASVspoof 2017 (http://www.asvspoof.org) organized by Tomi Kinnunen, Md Sahidullah, Héctor Delgado, ...
  • Device Recorded VCTK (Small subset version) 

    Sarfjoo, Seyyed Saeed; Yamagishi, Junichi
    This dataset is a new variant of the voice cloning toolkit (VCTK) dataset: device-recorded VCTK (DR-VCTK), where the high-quality speech signals recorded in a semi-anechoic chamber using professional audio devices are ...
  • SUPERSEDED - The 2nd Automatic Speaker Verification Spoofing and Countermeasures Challenge (ASVspoof 2017) Database, Version 2 

    Kinnunen, Tomi; Sahidullah, Md; Delgado, Héctor; Todisco, Massimiliano; Evans, Nicholas; Yamagishi, Junichi; Lee, Kong Aik
    ## This item has been replaced by the one which can be found at http://dx.doi.org/10.7488/ds/2332 ##
  • SUPERSEDED - The 2nd Automatic Speaker Verification Spoofing and Countermeasures Challenge (ASVspoof 2017) Database, Version 2 

    Kinnunen, Tomi; Sahidullah, Md; Delgado, Héctor; Todisco, Massimiliano; Evans, Nicholas; Yamagishi, Junichi; Lee, Kong Aik
    ## This item has been replaced by the one which can be found at http://dx.doi.org/10.7488/ds/2332 ##
  • Dutch English Lombard Speech Native and Non-Native (DELNN) 

    Marcoux, Katherine; Ernestus, Mirjam; King, Simon
    The DELNN (Dutch English Lombard speech Native and Non-Native) corpus consists of 30 native Dutch speakers reading sentences in a quiet environment and in a noisy environment, to elicit Lombard speech. The Dutch speakers ...
  • Radboud Lombard Corpus_Dutch 

    Shen, Chen; Janse, Esther; King, Simon
    This data set contains 54 (12 for now) native Dutch speakers' Dutch sentence-reading material (48 sentences in natural and 48 sentences in Lombard condition per speaker).
  • Triangulating Context Lemmas 

    McLaughlin, Craig; McKinna, James; Stark, Ian
    Agda formalisation to accompany the paper "Triangulating Context Lemmas" by Craig McLaughlin, James McKinna and Ian Stark. DOI 10.1145/3167081.
  • SUPERSEDED - Device Recorded VCTK (Small subset version) 

    Sarfjoo, Seyyed Saeed; Yamagishi, Junichi
    ## This item has been replaced by the one which can be found at http://dx.doi.org/10.7488/ds/2316 ## This dataset is a new variant of the voice cloning toolkit (VCTK) dataset: device-recorded VCTK (DR-VCTK), where the ...
  • Macro-socioeconomic-energy Model for Technology Pathways, 7see-GB, 2017 

    Roberts, Simon; Foran, Barney; Axon, Colin; Warr, Benjamin; Goddard, Nigel
    In a resource-constrained world with growing population and demand for energy, goods, and services with commensurate environmental impacts, we need to understand how these trends relate to various aspects of economic ...
  • Noisy reverberant speech database for training speech enhancement algorithms and TTS models 

    Valentini-Botinhao, Cassia
    Noisy reverberant speech database. The database was designed to train and test speech enhancement (noise suppression and dereverberation) methods that operate at 48kHz. Clean speech was made reverberant and noisy by ...
  • Noisy speech database for training speech enhancement algorithms and TTS models 

    Valentini-Botinhao, Cassia
    Clean and noisy parallel speech database. The database was designed to train and test speech enhancement methods that operate at 48kHz. A more detailed description can be found in the papers associated with the database. ...
  • SUPERSEDED - The 2nd Automatic Speaker Verification Spoofing and Countermeasures Challenge (ASVspoof 2017) Database 

    Kinnunen, Tomi; Sahidullah, Md; Delgado, Héctor; Todisco, Massimiliano; Evans, Nicholas; Yamagishi, Junichi; Lee, Kong Aik
    ## This item has been replaced by the one which can be found at http://dx.doi.org/10.7488/ds/2332 ## This is a database used for the Second Automatic Speaker Verification Spoofing and Countermeasuers Challenge, for short, ...
  • 96kHz version of the CSTR VCTK Corpus 

    Veaux, Christophe; Yamagishi, Junichi
    This dataset includes 96kHz version of the CSTR VCTK Corpus including speech data uttered by 109 native speakers of English with various accents. The main dataset can be found at http://dx.doi.org/10.7488/ds/1994 (containing ...
  • Key files for Spoofing and Anti-Spoofing (SAS) corpus v1.0 

    Wu, Zhizheng; Khodabakhsh, Ali; Demiroglu, Cenk; Yamagishi, Junichi; Saito, Daisuke; Toda, Tomoki; Ling, Zhen-Hua; King, Simon
    These files are complementary to the fileset: Wu et al. (2015). Spoofing and Anti-Spoofing (SAS) corpus v1.0, [dataset]. University of Edinburgh. The Centre for Speech Technology Research (CSTR). http://dx.doi.org/10.7488/ds/252. ...
  • Thesis Material - Rasmus Dall 

    Dall, Rasmus
    Data released in relation to the PhD thesis of Rasmus Dall. This contains: 1. Thesis pdf. 2. Released parallel corpora of read and spontaneous speech suitable for speech synthesis. 3. Experimental Data to enable ...
  • CSTR VCTK Corpus: English Multi-speaker Corpus for CSTR Voice Cloning Toolkit 

    Veaux, Christophe; Yamagishi, Junichi; MacDonald, Kirsten
    This CSTR VCTK Corpus (Centre for Speech Technology Voice Cloning Toolkit) includes speech data uttered by 109 native speakers of English with various accents. 96kHz versions of the recordings are available at http://dx. ...

View all