KIT | KIT-Bibliothek | Impressum | Datenschutz

Wikipedia infobox type prediction using embeddings

Biswas, Russa ORCID iD icon 1; Türker, Rima 1; Bakhshandegan-Moghaddam, Farshad 1; Koutraki, Maria 1; Sack, Harald 1
1 Karlsruher Institut für Technologie (KIT)

Abstract:

Wikipedia, the multilingual, free content encyclopedia has evolved as the largest and the most popular general reference work on the Internet. Since the time of commencement of Wikipedia, crowd sourcing of articles has been one of the most salient features of this open encyclopedia. It is obvious that enormous amount of work and expertise goes in the creation of a self-content article. However, it has been observed that the infobox type information in Wikipedia articles is often incomplete, incorrect and missing. This is due to the human intervention in creating Wikipedia articles. Moreover, the type of the infoboxes in Wikipedia plays a vital role in the determination of RDF type inference in the Knowledge Graphs such as DBpedia. Hence, there arouses a necessity to have the correct infobox type information in the Wikipedia articles. In this paper, we propose an approach of predicting Wikipedia infobox type information using both word and network embeddings. Furthermore, the impact of using minimalistic information such as Table of Contents and Named Entity mentions in the abstract of a Wikipedia article in the prediction process has been analyzed as well.


Scopus
Zitationen: 5
Zugehörige Institution(en) am KIT Institut für Angewandte Informatik und Formale Beschreibungsverfahren (AIFB)
Publikationstyp Proceedingsbeitrag
Publikationsjahr 2018
Sprache Englisch
Identifikator ISSN: 1613-0073
KITopen-ID: 1000083955
Erschienen in DL4KGS 2018, Workshop on Deep Learning for Knowledge Graphs and Semantic Technologies 2018 : proceedings of the First Workshop on Deep Learning for Knowledge Graphs and Semantic Technologies (DL4KGS), co-located with the 15th Extended Semantic Web Conerence (ESWC 2018) : Heraklion, Crete, Greece, June 4, 2018. Ed.: M. cochez
Verlag RWTH Aachen
Seiten 46-55
Serie CEUR Workshop Proceedings ; 2106
Externe Relationen Abstract/Volltext
Schlagwörter Wikipedia, Infobox, Embeddings, Knowledge Graph, Classification
Nachgewiesen in Scopus
KIT – Die Forschungsuniversität in der Helmholtz-Gemeinschaft
KITopen Landing Page