Visualising Incomplete Data with Subset Multiple Correspondence Analysis

Nienkemper-Swanepoel, Johané; Roux, Niël J. le; Gardner-Lubbe, Sugnet

doi:10.5445/KSP/1000087327/33

Visualising Incomplete Data with Subset Multiple Correspondence Analysis

Nienkemper-Swanepoel, Johané; Roux, Niël J. le; Gardner-Lubbe, Sugnet

Abstract:

Determining the cause of missing values is a challenge, but an important task in order to select correct analysis techniques for missing data. This paper presents a new approach to identify the missing data mechanism (MDM) by applying cluster analysis to biplots of data having missing observations. Subset multiple correspondence analysis (sMCA) enables an isolated analysis of a chosen subset while preserving the scaffolding of the original data set. Multivariate categorical data sets are frequently represented in a coded dummy matrix, referred to as an indicator matrix. Additional category levels can be created for the indicator matrix to account for the unobserved information which has the advantage of not forfeiting any observed information. The extended indicator matrix easily partitions a data set into observed and unobserved subsets. sMCA biplots are used for the visual exploration of the subsets. Configurations of the incomplete subsets enable the recognition of non-response patterns which could aid in the identification of a particular MDM. The missing at random (MAR) MDM refers to missing responses that are dependent on the observed information and is expected to be identified by patterns and groupings occurring in the incomplete sMCA biplot. ... mehr

KITopen-Download

Verlagsausgabe

DOI: 10.5445/KSP/1000087327/33

Veröffentlicht am 20.05.2021

Export

Statistiken

Seitenaufrufe: 217
seit 20.05.2021

Downloads: 41
seit 21.05.2021

Zugehörige Institution(en) am KIT	Fakultät für Wirtschaftswissenschaften – Institut für Informationswirtschaft und Marketing (IISM)
Publikationstyp	Zeitschriftenaufsatz
Publikationsjahr	2018
Sprache	Englisch
Identifikator	ISSN: 2363-9881 KITopen-ID: 1000133158
Erschienen in	Archives of Data Science, Series A (Online First)
Band	5
Heft	1
Seiten	A33, 20 S. online

Repository KITopen

Visualising Incomplete Data with Subset Multiple Correspondence Analysis

Abstract: