KIT | KIT-Bibliothek | Impressum | Datenschutz

Automated evaluation of out-of-context errors

Huber, P. 1; Niehues, J. ORCID iD icon 1; Waibel, A. 1
1 Institut für Anthropomatik und Robotik (IAR), Karlsruher Institut für Technologie (KIT)

Abstract:

We present a new approach to evaluate computational models for the task of text understanding by the means of out-of-context error
detection. Through the novel design of our automated modification process, existing large-scale data sources can be adopted for a vast
number of text understanding tasks. The data is thereby altered on a semantic level, allowing models to be tested against a challenging
set of modified text passages that require to comprise a broader narrative discourse. Our newly introduced task targets actual real-world problems of transcription and translation systems by inserting authentic out-of-context errors. The automated modification process is applied to the 2016 TEDTalk corpus. Entirely automating the process allows the adoption of complete datasets at low cost, facilitating supervised learning procedures and deeper networks to be trained and tested. To evaluate the quality of the modification algorithm a language model and a supervised binary classification model are trained and tested on the altered dataset. A human baseline evaluation is examined to compare the results with human performance. The outcome of the evaluation task indicates the difficulty to detect semantic errors for machine-learning algorithms and humans, showing that the errors cannot be identified when limited to a single sentence.


Verlagsausgabe §
DOI: 10.5445/IR/1000090652
Veröffentlicht am 02.06.2025
Cover der Publikation
Zugehörige Institution(en) am KIT Institut für Anthropomatik und Robotik (IAR)
Publikationstyp Proceedingsbeitrag
Publikationsmonat/-jahr 05.2018
Sprache Englisch
Identifikator ISBN: 979-1-09-554600-9
KITopen-ID: 1000090652
Erschienen in 11th International Conference on Language Resources and Evaluation, LREC 2018; Phoenix Seagaia Conference CenterMiyazaki; Japan; 7 May 2018 through 12 May 2018. Ed.: H. Isahara
Veranstaltung 11th Language Resources and Evaluation Conference (LREC 2018), Miyazaki, Japan, 07.05.2018 – 12.05.2018
Verlag European Language Resources Association (ELRA)
Seiten 2022-2026
Externe Relationen Abstract/Volltext
Nachgewiesen in Scopus
KIT – Die Universität in der Helmholtz-Gemeinschaft
KITopen Landing Page