KIT | KIT-Bibliothek | Impressum | Datenschutz

Maximum Entropy Language Modeling for Russian ASR

Shin, Evgeniy; Stüker, Sebastian; Kilgour, Kevin; Fügen, Christian; Waibel, Alex

Abstract:

Russian is a challenging language for automatic speech recognition systems due to its rich morphology. This rich morphology stems from Russian’s highly inflectional nature and the frequent use of preand suffixes. Also, Russian has a very free word order, changes in which are used to reflect connotations of the sentences. Dealing with these phenomena is rather difficult for traditional n-gram models. We therefore investigate in this paper the use of a maximum entropy language model for Russian whose features are specifically designed to deal with the inflections in Russian, as well as the loose word order. We combine this with a subword based language model in order to alleviate the problem of large vocabulary sizes necessary for dealing with highly inflecting languages. Applying the maximum entropy language model during re-scoring improves the word error rate of our recognition system by 1.2% absolute, while the use of the sub-word based language model reduces the vocabulary size from 120k to 40k and the OOV rate from 4.8% to 2.1%.


Verlagsausgabe §
DOI: 10.5445/IR/1000166323
Veröffentlicht am 07.02.2024
Cover der Publikation
Zugehörige Institution(en) am KIT Institut für Anthropomatik und Robotik (IAR)
Publikationstyp Proceedingsbeitrag
Publikationsjahr 2013
Sprache Englisch
Identifikator KITopen-ID: 1000166323
Erschienen in Proceedings of the 10th International Workshop on Spoken Language Translation: Papers. Ed.: J. Y. Zhang
Veranstaltung 10th International Workshop on Spoken Language Translation (IWSLT 2013), Heidelberg, Deutschland, 05.12.2013 – 06.12.2013
Verlag Association for Computational Linguistics (ACL)
KIT – Die Forschungsuniversität in der Helmholtz-Gemeinschaft
KITopen Landing Page