KIT | KIT-Bibliothek | Impressum | Datenschutz

CodeSCAN: ScreenCast ANalysis for Video Programming Tutorials

Naumann, Alexander ORCID iD icon 1; Hertlein, Felix 2; Höllig, Jacqueline ORCID iD icon 2; Cazzonelli, Lucas 3; Thoma, Steffen 3
1 Institut für Operations Research (IOR), Karlsruher Institut für Technologie (KIT)
2 Institut für Angewandte Informatik und Formale Beschreibungsverfahren (AIFB), Karlsruher Institut für Technologie (KIT)
3 FZI Forschungszentrum Informatik (FZI)

Abstract (englisch):

Programming tutorials in the form of coding screencasts play a crucial role in programming education, serving both novices and experienced developers. However, the video format of these tutorials presents a challenge due to the difficulty of searching for and within videos. Addressing the absence of large-scale and diverse datasets for screencast analysis, we introduce the CodeSCAN dataset. It comprises 12,000 screenshots captured from the Visual Studio Code environment during development, featuring 24 programming languages, 25 fonts, and over 90 distinct themes, in addition to diverse layout changes and realistic user interactions. Moreover, we conduct detailed quantitative and qualitative evaluations to benchmark the performance of Integrated Development Environment (IDE) element detection, color-to-black-and-white conversion, and Optical Character Recognition (OCR). We hope that our contributions facilitate more research in coding screencast analysis, and we make the source code for creating the dataset and the benchmark publicly available at a-nau.github.io/codescan.


Download
Originalveröffentlichung
DOI: 10.5220/0013093100003912
Dimensions
Zitationen: 1
Zugehörige Institution(en) am KIT Institut für Angewandte Informatik und Formale Beschreibungsverfahren (AIFB)
Institut für Operations Research (IOR)
Publikationstyp Proceedingsbeitrag
Publikationsjahr 2025
Sprache Englisch
Identifikator ISBN: 978-9-8975-8728-3
ISSN: 2184-5921
KITopen-ID: 1000181695
Erschienen in Proceedings of the 20th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications; Porto, Portugal, 26.-28.02.2025
Veranstaltung 20th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (2025), Porto, Portugal, 26.02.2025 – 28.02.2025
Verlag SciTePress
Seiten S. 269 – 277
Serie VISAPP ; 2
Schlagwörter Computer Vision, Optical Character Recognition, Object Detection, Image Binarization, Datasets.
Nachgewiesen in Dimensions
OpenAlex
Scopus
Globale Ziele für nachhaltige Entwicklung Ziel 4 – Hochwertige Bildung
KIT – Die Universität in der Helmholtz-Gemeinschaft
KITopen Landing Page