KIT | KIT-Bibliothek | Impressum | Datenschutz

Improving Zero-shot Translation with Language-Independent Constraints

Pham, Ngoc-Quan 1; Niehues, Jan ORCID iD icon; Ha, Thanh-Le; Waibel, Alex 1
1 Institut für Anthropomatik und Robotik (IAR), Karlsruher Institut für Technologie (KIT)

Abstract:

An important concern in training multilingual neural machine translation (NMT) is to translate between language pairs unseen during training, i.e zero-shot translation. Improving this ability kills two birds with one stone by providing an alternative to pivot translation which also allows us to better understand how the model captures information between languages. In this work, we carried out an investigation on this capability of the multilingual NMT models. First, we intentionally create an encoder architecture which is independent with respect to the source language. Such experiments shed light on the ability of NMT encoders to learn multilingual representations, in general. Based on such proof of concept, we were able to design regularization methods into the standard Transformer model, so that the whole architecture becomes more robust in zero-shot conditions. We investigated the behaviour of such models on the standard IWSLT 2017 multilingual dataset. We achieved an average improvement of 2.23 BLEU points across 12 language pairs compared to the zero-shot performance of a state-of-the-art multilingual system. Additionally, we carry out further experiments in which the effect is confirmed even for language pairs with multiple intermediate pivots.


Verlagsausgabe §
DOI: 10.5445/IR/1000145068
Veröffentlicht am 02.05.2022
Originalveröffentlichung
DOI: 10.18653/v1/W19-5202
Scopus
Zitationen: 52
Cover der Publikation
Zugehörige Institution(en) am KIT Institut für Anthropomatik und Robotik (IAR)
Publikationstyp Proceedingsbeitrag
Publikationsjahr 2019
Sprache Englisch
Identifikator ISBN: 978-1-950737-27-7
KITopen-ID: 1000145068
Erschienen in Proceedings of the Fourth Conference on Machine Translation. Vol. 1. Ed.: O. Bojar
Veranstaltung 4th Conference on Machine Translation (WMT 2019), Florenz, Italien, 01.08.2019 – 02.08.2019
Verlag Association for Computational Linguistics (ACL)
Seiten 13-23
Externe Relationen Siehe auch
Nachgewiesen in arXiv
Scopus
KIT – Die Universität in der Helmholtz-Gemeinschaft
KITopen Landing Page