KIT | KIT-Bibliothek | Impressum | Datenschutz

Augmenting a Statistical Translation System with a Translation Memory

Hewavitharana, Sanjika; Vogel, Stephan; Waibel, Alex

Abstract:

In this paper, we present a translation memory (TM) based system to augment a statistical translation (SMT) system. It is used for translating sentences which have close matches in the training corpus. Given a test sentence, we first extract sentence pairs from the training corpus, whose source side is similar to the test sentence. Then, the TM system modifies the translation of the sentences by a sequence of substitution, deletion and insertion operations, to obtain the desired result. Statistical phrase alignment model of the SMT system is used for this purpose. The system was evaluated using a corpus of Chinese-English conversational data. For close matching sentences, the translations produced by the translation memory approach were compared with the translations of the statistical decoder.


Verlagsausgabe §
DOI: 10.5445/IR/1000166426
Veröffentlicht am 27.02.2024
Cover der Publikation
Zugehörige Institution(en) am KIT Institut für Anthropomatik und Robotik (IAR)
Publikationstyp Proceedingsbeitrag
Publikationsjahr 2005
Sprache Englisch
Identifikator KITopen-ID: 1000166426
Erschienen in Proceedings of the 10th EAMT Conference: Practical applications of machine translation, Budapest, Hungary, 30-31 May 2005
Veranstaltung Conference of the European Association for Machine Translation (EAMT 2005), Budapest, Ungarn, 30.05.2005 – 31.05.2005
Verlag Association for Computational Linguistics (ACL)
Seiten 126-132
KIT – Die Forschungsuniversität in der Helmholtz-Gemeinschaft
KITopen Landing Page