Knowledge-based Sense Disambiguation of Multiword Expressions in Requirements Documents

Hey, Tobias ORCID iD icon; Keim, Jan; Tichy, Walter F. ORCID iD icon

Abstract (englisch):

Understanding the meaning and the senses of expressions is essential to analyze natural language requirements. Disambiguation of expressions in their context is needed to prevent misinterpretations. Current knowledge-based disambiguation approaches only focus on senses of single words and miss out on linking the shared meaning of expressions consisting of multiple words. As these expressions are common in requirements, we propose a sense disambiguation approach that is able to detect and disambiguate multiword expressions. We use a two-tiered approach to be able to use different techniques for detection and disambiguation. Initially, a conditional random field detects multiword expressions. Afterwards, the approach disambiguates these expressions and retrieves the corresponding senses using a knowledge-based approach. The knowledge-based approach has the benefit that only the knowledge base has to be exchanged to adapt the approach to new domains and knowledge. Our approach is able to detect multiword expressions with an $\text{F}_{1}$-score of 88.4% in an evaluation on 997 requirement sentences. The sense disambiguation achieves up to 57% $\text{F}_{1}$-score.

DOI: 10.5445/IR/1000139591/post
Veröffentlicht am 01.10.2022
DOI: 10.5445/IR/1000139591
Veröffentlicht am 21.01.2022
Zugehörige Institution(en) am KIT Institut für Informationssicherheit und Verlässlichkeit (KASTEL)
Institut für Programmstrukturen und Datenorganisation (IPD)
Kompetenzzentrum für angewandte Sicherheitstechnologie (KASTEL)
Publikationstyp Proceedingsbeitrag
Publikationsmonat/-jahr 09.2021
Sprache Englisch
Identifikator ISBN: 978-1-6654-1899-7
KITopen-ID: 1000139591
Erschienen in 2021 IEEE 29th International Requirements Engineering Conference Workshops (REW), Notre Dame, IN, USA, 20-24 Sept. 2021
Veranstaltung 29th International Requirements Engineering Conference Workshops (REW 2021), Notre Dame, IN, USA, 20.09.2021 – 24.09.2021
Verlag Institute of Electrical and Electronics Engineers (IEEE)
Seiten 70–76
Schlagwörter Multiword Expressions, Word Sense Disambiguation, Requirements Engineering, Natural Language Processing
