Optimizing Computer-Assisted Transcription Quality with Iterative User Interfaces

Sperber, Matthias; Neubig, Graham; Nakamura, Satoshi; Waibel, Alex


Computer-assisted transcription promises high-quality speech transcription at reduced costs. This is achieved by limiting human effort to transcribing parts for which automatic transcription quality is insufficient. Our goal is to improve the human transcription quality via appropriate user interface design. We focus on iterative interfaces that allow humans to solve tasks based on an initially given suggestion, in this case an automatic transcription. We conduct a user study that reveals considerable quality gains for three variations of iterative interfaces over a non-iterative from-scratch transcription interface. Our iterative interfaces included post-editing, confidence-enhanced post-editing, and a novel retyping interface. All three yielded similar quality on average, but we found that the proposed retyping interface was less sensitive to the difficulty of the segment, and superior when the automatic transcription of the segment contained relatively many errors. An analysis using mixed-effects models allows us to quantify these and other factors and draw conclusions over which interface design should be chosen in which circumstance.

DOI: 10.5445/IR/1000166283
Veröffentlicht am 17.01.2024
Zugehörige Institution(en) am KIT Institut für Anthropomatik und Robotik (IAR)
Publikationstyp Proceedingsbeitrag
Publikationsjahr 2016
Sprache Englisch
Identifikator KITopen-ID: 1000166283
Erschienen in Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16). Ed.: N. Calzolari, K. Choukri, T. Declerck, S. Goggi, M. Grobelnik, B. Maegaard, J. Mariani, H. Mazo, A, Moreno, J. Odijk, S. Piperidis
Veranstaltung 10th Language Resources and Evaluation Conference (LREC 2016), Portorož, Slowenien, 23.05.2016 – 28.05.2016
Verlag European Language Resources Association (ELRA)
Seiten 1986–1992
