Watson, Mick. (2018). Open prediction of polysaccharide utilisation loci (PUL) in 5414 public Bacteroidetes genomes using PULpy, [dataset]. The Roslin Institute and Royal (Dick) School of Veterinary Studies. University of Edinburgh. http://dx.doi.org/10.7488/ds/2438.
Polysaccharide utilisation loci (PUL) are regions within the genomes of Bacteroidetes that encode all the necessary machinery for the cleavage of particular carbohydrates. Prediction of PUL from genomic data alone involves the identification of carbohydrate-active enzymes (CAZymes) co-localised with susCD gene pairs. Here we present the open prediction of PUL in 5414 public Bacteroidetes genomes, and an open-source pipeline to reproduce or extend the results. The PULpy code "Open prediction of Polysaccharide Utilisation Loci (PUL)" can be obtained via GitHub as documented in the attached README.txt file.
The University of Edinburgh is a charitable body, registered in Scotland, with registration number SC005336, VAT Registration Number GB 592 9507 00, and is acknowledged by the UK authorities as a “Recognised body” which has been granted degree awarding powers.