KIT | KIT-Bibliothek | Impressum | Datenschutz

Embedded Named Entity Recognition using Probing Classifiers

Popovič, Nicholas 1; Färber, Michael ORCID iD icon 2
1 Karlsruher Institut für Technologie (KIT)
2 Institut für Angewandte Informatik und Formale Beschreibungsverfahren (AIFB), Karlsruher Institut für Technologie (KIT)

Abstract:

Streaming text generation, has become a common way of increasing the responsiveness of language model powered applications such as chat assistants. At the same time, extracting semantic information from generated text is a useful tool for applications such as automated fact checking or retrieval augmented generation. Currently, this requires either separate models during inference, which increases computational cost, or destructive fine-tuning of the language model. Instead, we propose an approach called EMBER which enables streaming named entity recognition in decoder-only language models without fine-tuning them and while incurring minimal additional computational cost at inference time. Specifically, our experiments show that EMBER maintains high token generation rates, with only a negligible decrease in speed of around 1% compared to a 43.64% slowdown measured for a baseline. We make our code and data available online, including a toolkit for training, testing, and deploying efficient token classification models optimized for streaming text generation.


Verlagsausgabe §
DOI: 10.5445/IR/1000180189
Veröffentlicht am 20.03.2025
Originalveröffentlichung
DOI: 10.18653/v1/2024.emnlp-main.988
Dimensions
Zitationen: 1
Cover der Publikation
Zugehörige Institution(en) am KIT Institut für Angewandte Informatik und Formale Beschreibungsverfahren (AIFB)
Publikationstyp Proceedingsbeitrag
Publikationsdatum 19.03.2024
Sprache Englisch
Identifikator ISBN: 979-88-917616-4-3
KITopen-ID: 1000180189
Erschienen in Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing. Ed.: Y. Al-Onaizan, M. Bansal, Y.N. Chen
Veranstaltung Conference on Empirical Methods in Natural Language Processing (EMNLP 2024), Miami, FL, USA, 12.11.2024 – 16.11.2024
Verlag Association for Computational Linguistics (ACL)
Seiten 17830 – 17850
Nachgewiesen in Scopus
Dimensions
OpenAlex
KIT – Die Universität in der Helmholtz-Gemeinschaft
KITopen Landing Page