KIT | KIT-Bibliothek | Impressum | Datenschutz

CRF-based Disfluency Detection using Semantic Features for German to English Spoken Language Translation

Cho, Eunah; Ha, Thanh-Le; Waibel, Alex

Abstract:

Disfluencies in speech pose severe difficulties in machine translation of spontaneous speech. This paper presents our conditional random field (CRF)-based speech disfluency detection system developed on German to improve spoken language translation performance. In order to detect speech disfluencies considering syntactics and semantics of speech utterances, we carried out a CRF-based approach using information learned from the word representation and the phrase table used for machine translation. The word representation is gained using recurrent neural networks and projected words are clustered using the k-means algorithm. Using the output from the model trained with the word representations and phrase table information, we achieve an improvement of 1.96 BLEU points on the lecture test set. By keeping or removing humanannotated disfluencies, we show an upper bound and lower bound of translation quality. In an oracle experiment we gain 3.16 BLEU points of improvement on the lecture test set, compared to the same set with all disfluencies.


Verlagsausgabe §
DOI: 10.5445/IR/1000166327
Veröffentlicht am 06.02.2024
Cover der Publikation
Zugehörige Institution(en) am KIT Institut für Anthropomatik und Robotik (IAR)
Publikationstyp Proceedingsbeitrag
Publikationsjahr 2013
Sprache Englisch
Identifikator KITopen-ID: 1000166327
Erschienen in Proceedings of the 10th International Workshop on Spoken Language Translation: Papers. Ed.: J. Y. Zhang
Veranstaltung 10th International Workshop on Spoken Language Translation (IWSLT 2013), Heidelberg, Deutschland, 05.12.2013 – 06.12.2013
Verlag Association for Computational Linguistics (ACL)
KIT – Die Forschungsuniversität in der Helmholtz-Gemeinschaft
KITopen Landing Page