KIT | KIT-Bibliothek | Impressum | Datenschutz

Combination of NN and CRF Models for Joint Detection of Punctuation and Disfluencies

Cho, Eunah 1; Kilgour, Kevin 1; Niehues, Jan ORCID iD icon 1; Waibel, Alex 1
1 Institut für Anthropomatik und Robotik (IAR), Karlsruher Institut für Technologie (KIT)

Abstract:

Inserting proper punctuation marks and deleting speech disfluencies are two of the most essential tasks in spoken language
processing. This challenging task has prompted extensive research using various techniques, such as conditional random
fields. Neural networks, however, are relatively under-explored for this task. Combining different modeling techniques with different advantages has the potential to lead to improvements. In this work, we first establish the performance of joint modeling of punctuation prediction and disfluency detection using neural networks. We then combine a conditional random fields based model and a neural networks based model log-linearly, and show that the combined approach outperforms both individual models, by 2.7% and 3.5% in F-score for speech disfluency and punctuation detection, respectively. When used as a preprocessing step to machine translation this also results in an improved translation quality of 2.5 BLEU points compared to the baseline and
of 0.6 BLEU points compared to the non-combined model.


Verlagsausgabe §
DOI: 10.5445/IR/1000051096
Veröffentlicht am 06.06.2025
Scopus
Zitationen: 16
Cover der Publikation
Zugehörige Institution(en) am KIT Institut für Anthropomatik und Robotik (IAR)
Publikationstyp Proceedingsbeitrag
Publikationsjahr 2015
Sprache Englisch
Identifikator ISBN: 978-1-5108-1790-6
KITopen-ID: 1000051096
Erschienen in Speech beyond speech towards a better understanding of the most important biosignal : 16th Annual Conference of the International Speech Communication Association (INTERSPEECH 2015) : Dresden, Germany, 6-10 September 2015
Veranstaltung Proceedings of the 16th Annual Conference of the International Speech Communication Association (INTERSPEECH), September 6-10 2015, Dresden, Germany
Verlag Red Hook
Seiten 3650-3654
Externe Relationen Siehe auch
Nachgewiesen in Scopus
KIT – Die Universität in der Helmholtz-Gemeinschaft
KITopen Landing Page