KIT | KIT-Bibliothek | Impressum | Datenschutz

The CMU-UKA statistical machine translation systems for IWSLT 2007

Lane, I.; Zollmann, A.; Nguyen, T.; Bach, N.; Venugopal, A.; Vogel, S.; Rottmann, K.; Zhang, Y.; Waibel, A.

Abstract:

This paper describes the CMU-UKA statistical machine translation systems submitted to the IWSLT 2007 evaluation campaign. Systems were submitted for three language-pairs: Japanese→English, Chinese→English and Arabic→English. All systems were based on a common phrase-based SMT (statistical machine translation) framework but for each language-pair a specific research problem was tackled. For Japanese→English we focused on two problems: first, punctuation recovery, and second, how to incorporate topic-knowledge into the translation framework. Our Chinese→English submission focused on syntax-augmented SMT and for the Arabic→English task we focused on incorporating morphological-decomposition into the SMT framework. This research strategy enabled us to evaluate a wide variety of approaches which proved effective for the language pairs they were evaluated on.


Verlagsausgabe §
DOI: 10.5445/IR/1000009533
Veröffentlicht am 18.06.2025
Cover der Publikation
Zugehörige Institution(en) am KIT Institut für Theoretische Informatik (ITI)
Publikationstyp Proceedingsbeitrag
Publikationsjahr 2007
Sprache Englisch
Identifikator KITopen-ID: 1000009533
Erschienen in International Workshop on Spoken Language Translation, IWSLT 2007, Trento, Italy, October 15-16, 2007.
Veranstaltung International Workshop on Spoken Language Translation (IWSLT 2007), Trient, Italien, 15.10.2007 – 16.10.2007
Externe Relationen Siehe auch
Siehe auch
KIT – Die Universität in der Helmholtz-Gemeinschaft
KITopen Landing Page