KIT | KIT-Bibliothek | Impressum | Datenschutz

Towards Named Entity Extraction and Translation in Spoken Language Translation

Huang, Fei; Vogel, Stephan; Waibel, Alex

Abstract:

In this paper we propose a new method of detecting and translating named entities (NE) from spoken language, e.g., Chinese broadcast news. This approach detects possible NE regions from less reliably recognized hypotheses using confidence measures. Each possible NE boundary within the region is compared with candidate NEs from retrieved documents based on their acoustic similarities and semantic correlations. These candidate NEs are re-ranked by additionally incorporating general and topic-specific language models to measure the NE context consistency. This approach, combined with the HMM-based NE extraction on confidently recognized words, improves NE extraction F-score from 66% to 71% and NE translation quality from 69% to 73% over the baseline method. Systematic comparisons on NE translation quality with different speech input quality are also presented.


Verlagsausgabe §
DOI: 10.5445/IR/1000166432
Veröffentlicht am 08.03.2024
Cover der Publikation
Zugehörige Institution(en) am KIT Institut für Anthropomatik und Robotik (IAR)
Publikationstyp Proceedingsbeitrag
Publikationsjahr 2004
Sprache Englisch
Identifikator KITopen-ID: 1000166432
Erschienen in Proceedings of the First International Workshop on Spoken Language Translation: Papers, Kyoto, Japan, 30 September - 01 October 2004
Veranstaltung 1th International Workshop on Spoken Language Translation (IWSLT 2004), Kyōto, Japan, 30.09.2004 – 01.10.2004
Verlag Association for Computational Linguistics (ACL)
Seiten 131-137
KIT – Die Forschungsuniversität in der Helmholtz-Gemeinschaft
KITopen Landing Page