KIT | KIT-Bibliothek | Impressum | Datenschutz

An efficient two-pass approach to synchronous-CFG driven statistical MT

Venugopal, A.; Vogel, S.

Abstract:

We present an efficient, novel two-pass approach to mitigate the computational impact resulting from online intersection of an n-gram language model (LM) and a probabilistic synchronous context-free
grammar (PSCFG) for statistical machine translation. In first pass CYK-style decoding, we consider first-best chart item approximations, generating a hypergraph of sentence spanning target language deriva-
tions. In the second stage, we instantiate specific alternative derivations from this hypergraph, using the LM to drive this search process, recovering from search errors made in the first pass. Model search
errors in our approach are comparable to those made by the state-of-the-art “Cube Pruning” approach in (Chiang, 2007) under comparable pruning conditions evaluated on both hierarchical and syntax-based grammars.


Verlagsausgabe §
DOI: 10.5445/IR/1000009557
Veröffentlicht am 20.06.2025
Cover der Publikation
Zugehörige Institution(en) am KIT Institut für Theoretische Informatik (ITI)
Publikationstyp Proceedingsbeitrag
Publikationsjahr 2007
Sprache Englisch
Identifikator KITopen-ID: 1000009557
Erschienen in Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics (NAACL HLT 2007), April 22-27, 2007, Rochester, New York, USA. Ed.: C. L. Sidner
Veranstaltung Annual Conference of the North American Chapter of the Association for Computational Linguistics : Human Language Technologies (NAACL/HLT 2007), Rochester, NY, USA, 22.04.2007 – 27.04.2007
Verlag Association for Computational Linguistics (ACL)
Seiten 500 - 507
Externe Relationen Siehe auch
KIT – Die Universität in der Helmholtz-Gemeinschaft
KITopen Landing Page