Khomkrit Yongcharoenchaiyasit. Multi-class classification model for dementia, heart failure, and aorticv valve disorder. Master's Degree(Computer Engineering). Mae Fah Luang University. Learning Resources and Educational Media Centre. : Mae Fah Luang University, 2023.
Multi-class classification model for dementia, heart failure, and aorticv valve disorder
Abstract:
Dementia is a group of symptoms that affect memory, analytical thinking, and social abilities in the elderly population. Early diagnosis of dementia is crucial for reducing or mitigating mortality rates. Machine learning algorithms, given the wide availability of electronic health records, have proven capable of disease prediction. However, developing effective disease predictive models presents a challenge due to imbalanced, low-dimensional, and complex data. For this study, the multi-class classification model was proposed to distinguish dementia from its potential causes, including heart failure and aortic valve disorder.
The extreme gradient boosting (XGBoost)-based model was proposed for multi-class classification of dementia, heart failure, and aortic valve disorder in the elderly population due to its strong capability to handle complex data and achieve high predictive accuracy. The model was trained and tested on the data collected from Chiang Rai Prachanukroh Hospital and Thoeng Hospital in Chiang Rai, Thailand, consisting of 21,201 records. To address low dimensional data, various prior knowledge-based feature creation techniques were employed to create additional features. In addition, the borderline synthetic minority over-sampling technique was applied to address the class imbalance issue.
The performance of the proposed XGBoost-based model was compared with several existing methods, including support vector machine, k-nearest neighbors, decision tree, random forest, extra trees, gradient boosting, and TabNet. Hyperparameter optimization was performed for all models. Classification performance was evaluated using the confusion matrix-based metrics including precision, recall, F1 score, accuracy, area under the receiver operating characteristic curve, and area under the precision-recall curve. In conclusion, the XGBoost-based model achieved superior performance compared to other models based on these metrics.