KIT | KIT-Bibliothek | Impressum | Datenschutz

Replication Package for "Criminal Minds: How First-Year CS Students Plagiarize Code"

Maisch, Robin ORCID iD icon 1; Schmid, Larissa ORCID iD icon; Glassey, Richard; Fuchß, Dominik ORCID iD icon 1; Niehues, Nils ORCID iD icon 1; Liu, Haoyu ORCID iD icon 1; Koziolek, Anne ORCID iD icon 1
1 Institut für Informationssicherheit und Verlässlichkeit (KASTEL), Karlsruher Institut für Technologie (KIT)

Abstract:

Dataset for "Criminal Minds: How First-Year CS Students Plagiarize Code"
In this repository, we share the 10 original and 35 plagiarized code submissions that we collected during the study.

From the original dataset, we
* removed 12 plagiarized programs which were unchanged compared to their original
* mapped the names and IDs of participants to a random five-letter pseudonym identifier
* removed files related to IDEs, Git, Maven, and compiled artifacts.

We provide the following artifacts:
* cloc: Counts of lines of code and documentation for each submission
* code: the original (code/orig/) and plagiarized (code/plag/) submissions, each named to map to the original submissions (o1-o5), study group (A and B) and university (uni1 and uni2).
* logs: analysis results of the change logs. For a final version, we will also supply the proper change logs.
* similarity: analysis results of the plagiarism detection system JPlag, including the executable and instructions to replicate
* survey: survey results as raw data. For a final version, we will include free text responses.
* task: the task description for Dots and Boxes.


Download
Originalveröffentlichung
DOI: 10.5281/ZENODO.19115558
Zugehörige Institution(en) am KIT Institut für Informationssicherheit und Verlässlichkeit (KASTEL)
Publikationstyp Forschungsdaten
Publikationsdatum 19.03.2026
Identifikator KITopen-ID: 1000191845
Lizenz Creative Commons Namensnennung 4.0 International
Art der Forschungsdaten Dataset
Nachgewiesen in OpenAlex
Relationen in KITopen
KIT – Die Universität in der Helmholtz-Gemeinschaft
KITopen Landing Page