The 2013 KIT IWSLT Speech-to-Text Systems for German and English

Kilgour, Kevin; Mohr, Christian; Heck, Michael; Nguyen, Quoc Bao; Nguyen, Van Huy; Shin, Evgeniy; Tseyzer, Igor; Gehring, Jonas; Müller, Markus; Sperber, Matthias; Stüker, Sebastian; Waibel, Alex

The 2013 KIT IWSLT Speech-to-Text Systems for German and English

Kilgour, Kevin; Mohr, Christian; Heck, Michael; Nguyen, Quoc Bao; Nguyen, Van Huy; Shin, Evgeniy; Tseyzer, Igor; Gehring, Jonas; Müller, Markus; Sperber, Matthias; Stüker, Sebastian; Waibel, Alex

Abstract:

This paper describes our English Speech-to-Text (STT) systems for the 2013 IWSLT TED ASR track. The systems consist of multiple subsystems that are combinations of different front-ends, e.g. MVDR-MFCC based and lMel based ones, GMM and NN acoustic models and different phone sets. The outputs of the subsystems are combined via confusion network combination. Decoding is done in two stages, where the systems of the second stage are adapted in an unsupervised manner on the combination of the first stage outputs using VTLN, MLLR, and cMLLR.

KITopen-Download

Verlagsausgabe

DOI: 10.5445/IR/1000166322

Veröffentlicht am 07.02.2024

Export

Statistiken

Seitenaufrufe: 236
seit 07.02.2024

Downloads: 239
seit 08.02.2024

Zugehörige Institution(en) am KIT	Institut für Anthropomatik und Robotik (IAR)
Publikationstyp	Proceedingsbeitrag
Publikationsjahr	2013
Sprache	Englisch
Identifikator	KITopen-ID: 1000166322
Erschienen in	Proceedings of the 10th International Workshop on Spoken Language Translation: Evaluation Campaign. Ed.: J. Y. Zhang
Veranstaltung	10th International Workshop on Spoken Language Translation (IWSLT 2013), Heidelberg, Deutschland, 05.12.2013 – 06.12.2013
Verlag	Association for Computational Linguistics (ACL)

Repository KITopen

The 2013 KIT IWSLT Speech-to-Text Systems for German and English

Abstract: