Karampatsis, Rafael-Michael. (2019). ManySStuBs4J Dataset, [dataset]. University of Edinburgh. College of Science & Engineering. School of Informatics. Institute for Language, Cognition and Computation (ILCC). https://doi.org/10.7488/ds/2528.
The ManySStuBs4J corpus contains simple statement bugs mined from open-source Java projects hosted in GitHub. There are two variations of the dataset. One mined from the 100 Java Maven Projects and one mined from the top 1000 Java Projects.
A project's popularity is determined by computing the sum of z-scores of its forks and watchers.
See "README.txt" for further details.
The University of Edinburgh is a charitable body, registered in Scotland, with registration number SC005336, VAT Registration Number GB 592 9507 00, and is acknowledged by the UK authorities as a “Recognised body” which has been granted degree awarding powers.