KIT | KIT-Bibliothek | Impressum | Datenschutz

Identification of patient classes in low back pain data using crisp and fuzzy clustering methods

Gondeau, Alexandre; Makarenkov, Vladimir

Abstract:

We performed a cluster analysis of the low back pain dataset in the framework of the IFCS-2017 data challenge. Because the original data contained missing values, the first part of our analysis concerned the imputation of missing values using the Fully Conditional Specification model. The Local Outlier Factor method was then used to detect and eliminate the outliers. After the data normalization, we removed highly correlated variables from the transformed dataset and carried out k-means clustering of the remaining variables based on their correlations, i.e., the variables with the highest mutual correlations were assigned to the same cluster. Once the variables were assigned to different clusters, one representative per cluster, i.e., the variable with the highest contribution score at the first principal component, was selected. Among the 13 selected variables, there are representatives of each of the 6 variable domains (contextual factor, participation, pain, psychological, activity and physical impairment), specified as important in the paper by Nielsen et al. (2016). Different clustering methods, including DAPC, k-means and k-medoids, were then carried out to cluster the reduced low back pain data. ... mehr


Verlagsausgabe §
DOI: 10.5445/KSP/1000085952/06
Veröffentlicht am 13.05.2019
Cover der Publikation
Zugehörige Institution(en) am KIT Fakultät für Wirtschaftswissenschaften – Institut für Informationswirtschaft und Marketing (IISM)
Publikationstyp Zeitschriftenaufsatz
Publikationsjahr 2019
Sprache Englisch
Identifikator ISSN: 2510-0564
KITopen-ID: 1000094517
Erschienen in Archives of Data Science, Series B (Online First)
Band 1
Heft 1
Seiten B06, 17 S. online
KIT – Die Forschungsuniversität in der Helmholtz-Gemeinschaft
KITopen Landing Page