KIT | KIT-Bibliothek | Impressum | Datenschutz

HILL: A Hallucination Identifier for Large Language Models

Leiser, Florian ORCID iD icon 1; Eckhardt, Sven; Leuthe, Valentin 2; Knaeble, Merlin 3; Maedche, Alexander 3; Schwabe, Gerhard; Sunyaev, Ali 1
1 Institut für Angewandte Informatik und Formale Beschreibungsverfahren (AIFB), Karlsruher Institut für Technologie (KIT)
2 Karlsruher Institut für Technologie (KIT)
3 Institut für Wirtschaftsinformatik und Marketing (IISM), Karlsruher Institut für Technologie (KIT)

Abstract:

Large language models (LLMs) are prone to hallucinations, i.e., nonsensical, unfaithful, and undesirable text. Users tend to overrely on LLMs and corresponding hallucinations which can lead to misinterpretations and errors. To tackle the problem of overreliance, we propose HILL, the "Hallucination Identifier for Large Language Models". First, we identified design features for HILL with a Wizard of Oz approach with nine participants. Subsequently, we implemented HILL based on the identified design features and evaluated HILL's interface design by surveying 17 participants. Further, we investigated HILL's functionality to identify hallucinations based on an existing question-answering dataset and five user interviews. We find that HILL can correctly identify and highlight hallucinations in LLM responses which enables users to handle LLM responses with more caution. With that, we propose an easy-to-implement adaptation to existing LLMs and demonstrate the relevance of user-centered designs of AI artifacts.


Volltext §
DOI: 10.5445/IR/1000169738
Veröffentlicht am 05.04.2024
Originalveröffentlichung
DOI: 10.48550/arXiv.2403.06710
Dimensions
Zitationen: 1
Cover der Publikation
Zugehörige Institution(en) am KIT Institut für Angewandte Informatik und Formale Beschreibungsverfahren (AIFB)
Institut für Wirtschaftsinformatik und Marketing (IISM)
Publikationstyp Forschungsbericht/Preprint
Publikationsjahr 2024
Sprache Englisch
Identifikator KITopen-ID: 1000169738
Verlag arxiv
Umfang 13 S.
Vorab online veröffentlicht am 11.03.2024
Schlagwörter Human-Computer Interaction (cs.HC)
Nachgewiesen in Dimensions
Relationen in KITopen
KIT – Die Forschungsuniversität in der Helmholtz-Gemeinschaft
KITopen Landing Page