KIT | KIT-Bibliothek | Impressum | Datenschutz

The 2013 KIT IWSLT Speech-to-Text Systems for German and English

Kilgour, Kevin; Mohr, Christian; Heck, Michael; Nguyen, Quoc Bao; Nguyen, Van Huy; Shin, Evgeniy; Tseyzer, Igor; Gehring, Jonas; Müller, Markus; Sperber, Matthias; Stüker, Sebastian; Waibel, Alex

Abstract:

This paper describes our English Speech-to-Text (STT) systems for the 2013 IWSLT TED ASR track. The systems consist of multiple subsystems that are combinations of different front-ends, e.g. MVDR-MFCC based and lMel based ones, GMM and NN acoustic models and different phone sets. The outputs of the subsystems are combined via confusion network combination. Decoding is done in two stages, where the systems of the second stage are adapted in an unsupervised manner on the combination of the first stage outputs using VTLN, MLLR, and cMLLR.


Verlagsausgabe §
DOI: 10.5445/IR/1000166322
Veröffentlicht am 07.02.2024
Cover der Publikation
Zugehörige Institution(en) am KIT Institut für Anthropomatik und Robotik (IAR)
Publikationstyp Proceedingsbeitrag
Publikationsjahr 2013
Sprache Englisch
Identifikator KITopen-ID: 1000166322
Erschienen in Proceedings of the 10th International Workshop on Spoken Language Translation: Evaluation Campaign. Ed.: J. Y. Zhang
Veranstaltung 10th International Workshop on Spoken Language Translation (IWSLT 2013), Heidelberg, Deutschland, 05.12.2013 – 06.12.2013
Verlag Association for Computational Linguistics (ACL)
KIT – Die Forschungsuniversität in der Helmholtz-Gemeinschaft
KITopen Landing Page