On the inter-dataset generalization of machine learning approaches to Parkinson’s disease detection from voic

Hireš, Máté and Drotár, Peter and Pah, Nemuel Daniel and Ngo, Quoc Cuong and Kumar, Dinesh Kant (2023) On the inter-dataset generalization of machine learning approaches to Parkinson’s disease detection from voic. International Journal of Medical Informatics, 179. pp. 1-9. ISSN 1386-5056

[thumbnail of 1-s2.0-S1386505623002551-main-2.pdf] PDF
1-s2.0-S1386505623002551-main-2.pdf
Restricted to Repository staff only

Download (1MB) | Request a copy
Official URL / DOI: https://doi.org/10.1016/j.ijmedinf.2023.105237

Abstract

Background and Objective: Parkinson's disease is the second-most-common neurodegenerative disorder that affects motor skills, cognitive processes, mood, and everyday tasks such as speaking and walking. The voices of people with Parkinson's disease may become weak, breathy, or hoarse and may sound emotionless, with slurred words and mumbling. Algorithms for computerized voice analysis have been proposed and have shown highly accurate results. However, these algorithms were developed on single, limited datasets, with participants possessing similar demographics. Such models are prone to overfitting and are unsuitable for generalization, which is essential in real-world applications. Methods: We evaluated the computerized Parkinson's disease diagnosis performance of various machine learning models and showed that these models degraded rapidly when used on different datasets. We evaluated two mainstream state-of-the-art approaches, one based on deep convolutional neural networks and another based on voice feature extraction followed by a shallow classifier (i.e., extreme gradient boosting (XGBoost)). Results: An investigation with four datasets (CzechPD, PC-GITA, ITA, and RMIT-PD) proved that even if the algorithms yielded excellent performance on a single dataset, the results obtained on new data or even a mix of datasets were very unsatisfactory. Conclusions: More work needs to be done to make computerized voice analysis methods for Parkinson's disease diagnosis suitable for real-world applications.

Item Type: Article
Uncontrolled Keywords: Parkinson’s disease Computerized voice analysis Convolutional neural network Feature extraction Cross-dataset generalization Acoustic analysis
Subjects: R Medicine > R Medicine (General)
T Technology > TK Electrical engineering. Electronics Nuclear engineering
Divisions: Faculty of Engineering > Department of Electrical Engineering
Depositing User: Eko Setiawan 194014
Date Deposited: 03 Jan 2024 02:59
Last Modified: 04 Jan 2024 02:18
URI: http://repository.ubaya.ac.id/id/eprint/45580

Actions (login required)

View Item View Item