KIT | KIT-Bibliothek | Impressum | Datenschutz

Syntax Augmented Machine Translation via Chart Parsing

Zollmann, Andreas; Venugopal, Ashish

Abstract:

We present translation results on the shared task ”Exploiting Parallel Texts for Statistical Machine Translation” generated by a chart parsing decoder operating on phrase tables augmented and generalized with target language syntactic categories. We use a target language parser to generate parse trees for each sentence on the target side of the bilingual training corpus, matching them with phrase table lattices built for the corresponding source sentence. Considering phrases that correspond to syntactic categories in the parse trees we develop techniques to augment (declare a syntactically motivated category for a phrase pair) and generalize (form mixed terminal and nonterminal phrases) the phrase table into a synchronous bilingual grammar. We present results on the French-to-English task for this workshop, representing significant improvements over the workshop’s baseline system. Our translation system is available open-source under the GNU General Public License.


Verlagsausgabe §
DOI: 10.5445/IR/1000166409
Veröffentlicht am 27.02.2024
Cover der Publikation
Zugehörige Institution(en) am KIT Institut für Anthropomatik und Robotik (IAR)
Publikationstyp Proceedingsbeitrag
Publikationsjahr 2006
Sprache Englisch
Identifikator KITopen-ID: 1000166409
Erschienen in HLT-NAACL 06: Statistical Machine Translation. Proceedings of the Workshop. 8-9 June 2006 New York City, USA. Ed.: P. Koehn, C. Monz
Veranstaltung ACL Workshop on Statistical Machine Translation (WMT 2006), New York City, NY, USA, 08.06.2006 – 09.06.2006
Verlag Association for Computational Linguistics (ACL)
Seiten 138–141
KIT – Die Forschungsuniversität in der Helmholtz-Gemeinschaft
KITopen Landing Page