KIT | KIT-Bibliothek | Impressum | Datenschutz

A Bayesian Optimization Approach to Machine Translation Reranking

Cheng, Julius; Züfle, Maike 1; Zouhar, Vilém; Vlachos, Andreas
1 Institut für Anthropomatik und Robotik (IAR), Karlsruher Institut für Technologie (KIT)

Abstract (englisch):

Reranking, or scoring a list of prediction candidates from a machine translation system with an external scoring model and returning the highest-scoring candidate, remains a simple and effective method for improving prediction quality. However, reranking with high quality scoring models can add substantial computational cost to the translation pipeline, which we address in this work by framing list reranking as a Bayesian optimization (BayesOpt) problem over the candidate list, where unknown scores are modeled with a Gaussian process. This algorithm scores candidates iteratively, choosing next candidates by balancing between exploration, choosing to score those that differ from candidates already scored, and exploitation, choosing to score those that resemble high-scoring candidates.This procedure finds high-scoring candidates while scoring only a fraction of the candidates list; given candidate lists of 200 random samples (before deduplication), our method achieves the same CometKiwi score using only 70 scoring evaluations on average compared to scoring a random subset of 180 candidates. We also propose multi-fidelity BayesOpt for list reranking, where scores obtained from a noisier but cheaper proxy scoring model are incorporated into the search process. ... mehr


Verlagsausgabe §
DOI: 10.5445/IR/1000181942
Veröffentlicht am 22.05.2025
Cover der Publikation
Zugehörige Institution(en) am KIT Institut für Anthropomatik und Robotik (IAR)
Publikationstyp Proceedingsbeitrag
Publikationsjahr 2025
Sprache Englisch
Identifikator ISBN: 979-88-917618-9-6
KITopen-ID: 1000181942
Erschienen in Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies. Vol.: 1. Ed.: L. Chiruzzo
Veranstaltung Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics (NAACL 2025), Albuquerque, NM, USA, 29.04.2025 – 04.05.2025
Verlag Association for Computational Linguistics (ACL)
Seiten 2849–2862
Externe Relationen Abstract/Volltext
KIT – Die Universität in der Helmholtz-Gemeinschaft
KITopen Landing Page