KIT | KIT-Bibliothek | Impressum | Datenschutz

Koder - A multi-register corpus for investigating register variation in contemporary German

Costa, Andressa ORCID iD icon 1
1 Institut für Technikzukünfte (ITZ), Karlsruher Institut für Technologie (KIT)

Abstract:

This paper introduces the design decisions in building the Koder corpus, a multi-register-corpus of contemporary German. The purpose of this corpus is to serve as a basis for the investigation into the use of German across registers. In order to construct a representative corpus, the essential considerations are: the type and number of registers to include, the number of texts in each register and minimal text length. The paper describes which aspects were central in determining these issues as well the corpus composition and the necessary text processing.


Verlagsausgabe §
DOI: 10.5445/IR/1000186666/pub
Veröffentlicht am 10.11.2025
Cover der Publikation
Zugehörige Institution(en) am KIT Institut für Technikzukünfte (ITZ)
Publikationstyp Zeitschriftenaufsatz
Publikationsjahr 2019
Sprache Englisch
Identifikator ISSN: 2243-4712
KITopen-ID: 1000186666
Erschienen in Research in Corpus Linguistics
Verlag Asociación Española de Lingüística de Corpus
Band 7
Seiten 69–83
Schlagwörter corpus design; register; German
Nachgewiesen in OpenAlex
Dimensions
Scopus
KIT – Die Universität in der Helmholtz-Gemeinschaft
KITopen Landing Page