KIT | KIT-Bibliothek | Impressum | Datenschutz

Improving Statistical Machine Translation in the Medical Domain using the Unified Medical Language system

Eck, Matthias; Vogel, Stephan; Waibel, Alex

Abstract:

Texts from the medical domain are an important task for natural language processing. This paper investigates the usefulness of a large medical database (the Unified Medical Language System) for the translation of dialogues between doctors and patients using a statistical machine translation system. We are able to show that the extraction of a large dictionary and the usage of semantic type information to generalize the training data significantly improves the translation performance.


Verlagsausgabe §
DOI: 10.5445/IR/1000166437
Veröffentlicht am 07.03.2024
Cover der Publikation
Zugehörige Institution(en) am KIT Institut für Anthropomatik und Robotik (IAR)
Publikationstyp Proceedingsbeitrag
Publikationsjahr 2004
Sprache Englisch
Identifikator KITopen-ID: 1000166437
Erschienen in Proceedings of the 20th International Conference on Computational Linguistics, COLING 2004, Geneva, Switzerland, 23-27 August 2004
Veranstaltung 20th International Conference on Computational Linguistics (COLING 2004), Genf, Schweiz, 23.08.2004 – 27.08.2004
Verlag Association for Computational Linguistics (ACL)
Seiten 792–798
KIT – Die Forschungsuniversität in der Helmholtz-Gemeinschaft
KITopen Landing Page