KIT | KIT-Bibliothek | Impressum | Datenschutz

Measuring the Structural Importance through Rhetorical Structure Index

Kokhlikyan, N.; Waibel, A.; Zhang, Y.; Zhang, J. Y.

Abstract:

In this paper, we propose a novel Rhetorical Structure Index (RSI) to measure the structural importance of a word or a phrase. Unlike TF-IDF and other content-driven measurements, RSI identifies words or phrases that are structural cues in an unstructured document. We show structurally motivated features with high RSI values are more useful than content-driven features for applications such as segmenting unstructured lecture transcripts into meaningful segments. Experiments show that using RSI significantly improves the segmentation accuracy compared to TF-IDF, a traditional content-based feature weighting scheme.


Verlagsausgabe §
DOI: 10.5445/IR/1000037726
Veröffentlicht am 12.06.2025
Cover der Publikation
Zugehörige Institution(en) am KIT Institut für Anthropomatik und Robotik (IAR)
Publikationstyp Proceedingsbeitrag
Publikationsjahr 2013
Sprache Englisch
Identifikator ISBN: 978-1-62748-838-9
KITopen-ID: 1000037726
Erschienen in Proceedings of the Seventh Workshop on Syntax, Semantics and Structure in Statistical Translation (SSST). Ed.: M. Carpuat, L. Specia, D. Wu
Veranstaltung Annual Conference of the North American Chapter of the Association for Computational Linguistics : Human Language Technologies (NAACL/HLT 2013), Atlanta, GA, USA, 09.06.2013 – 15.06.2013
Verlag Curran
Seiten 783-788
Externe Relationen Siehe auch
Abstract/Volltext
KIT – Die Universität in der Helmholtz-Gemeinschaft
KITopen Landing Page