KIT | KIT-Bibliothek | Impressum | Datenschutz

Same Same But Different: Preventing Refactoring Attacks on Software Plagiarism Detection

Maisch, Robin ORCID iD icon 1; Schmid, Larissa ORCID iD icon 1; Sağlam, Timur ORCID iD icon 1; Niehues, Nils ORCID iD icon 1
1 Institut für Informationssicherheit und Verlässlichkeit (KASTEL), Karlsruher Institut für Technologie (KIT)

Abstract:

Plagiarism detection in programming education faces growing challenges due to increasingly sophisticated obfuscation techniques, particularly automated refactoring-based attacks. While code plagiarism detection systems used in education practice are resilient against basic obfuscation, they struggle against structural modifications that preserve program behavior, especially caused by refactoring-based obfuscation. This paper presents a novel and extensible framework that enhances state-of-the-art detectors by leveraging code property graphs and graph transformations to counteract refactoring-based obfuscation. Our comprehensive evaluation of real-world student submissions, obfuscated using both algorithmic and AI-based obfuscation attacks, demonstrates a significant improvement in detecting
plagiarized code.


Verlagsausgabe §
DOI: 10.5445/IR/1000191769
Veröffentlicht am 30.03.2026
Originalveröffentlichung
DOI: 10.1145/3744916.3773225
Cover der Publikation
Zugehörige Institution(en) am KIT Institut für Informationssicherheit und Verlässlichkeit (KASTEL)
Publikationstyp Proceedingsbeitrag
Publikationsjahr 2026
Sprache Englisch
Identifikator ISBN: 979-8-4007-2025-3
KITopen-ID: 1000191769
HGF-Programm 46.23.01 (POF IV, LK 01) Methods for Engineering Secure Systems
Erschienen in IEEE/ACM 48th International Conference on Software Engineering (ICSE ’26)
Veranstaltung 48th International Conference on Software Engineering (ICSE 2026), Rio de Janeiro, Brasilien, 12.04.2026 – 18.04.2026
Verlag Association for Computing Machinery (ACM)
Externe Relationen Supplement
Schlagwörter Software Plagiarism Detection, Plagiarism Obfuscation, Obfuscation Attacks, Code Property Graph, Refactoring, Tokenization
KIT – Die Universität in der Helmholtz-Gemeinschaft
KITopen Landing Page