Vocal Biomarkers for Parkinson’s Disease Classification through Hybrid Feature Selection with Beluga Whale Optimization

Hemalatha, R J and Umashankar, G and lumen, christy and sheeba, A (2025) Vocal Biomarkers for Parkinson’s Disease Classification through Hybrid Feature Selection with Beluga Whale Optimization. journal of voice. ISSN 1873-4588.

[thumbnail of Vocal Biomarkers for Parkinson’s Disease Classification through Hybrid Feature Selection with Beluga Whale Optimization - Journal of Voice.pdf] Text
Vocal Biomarkers for Parkinson’s Disease Classification through Hybrid Feature Selection with Beluga Whale Optimization - Journal of Voice.pdf

Download (558kB)

Abstract

ntroduction
Parkinson's disease (PD) is a progressive neurodegenerative disorder that often causes vocal impairments. As a result, voice analysis is a promising avenue for early and objective diagnosis. While voice is a well-established biomarker, the high dimensionality of acoustic feature sets can lead to model overfitting. This necessitates effective feature selection strategies.
Objective
The objective is to validate a hybrid feature selection pipeline that combines statistical filters with a metaheuristic wrapper algorithm. The aim is to identify a minimal, yet highly discriminative, set of acoustic features for PD classification. This approach aims to strike a balance between dimensionality reduction and high diagnostic accuracy.
Methods
This study validates a hybrid feature selection pipeline. It combines statistical filters such as Spearman correlation (Corr) and mutual information (MI) with a metaheuristic beluga whale optimization (BWO) wrapper. The framework's efficacy was benchmarked across a diverse suite of machine learning classifiers. Evaluation was conducted using 10-fold stratified group cross-validation on a dataset of 40 PD patients and 40 healthy controls. The process dynamically identifies parsimonious feature subsets within each validation fold.
Results
When paired with a random forest classifier, the Corr+MI+BWO pipeline achieved a peak F1-score of 0.966 using an average of only 21.2 features. This represents a 73% reduction in dimensionality with a minimal 2.4% performance trade-off compared to a standard filter-based approach. The most consistently selected features proved to be both statistically significant (P < 0.05) and part of a clinically coherent signature of Parkinsonian dysarthria.
Conclusion
The proposed hybrid framework is an effective methodology that successfully balances model simplicity with high predictive accuracy. This research provides a strong foundation for the development of objective, noninvasive tools for the early detection and monitoring of PD.

Item Type: Article
Subjects: Biomedical Engineering > Medical Imaging
Biomedical Engineering > Biomedical Engineering Design
Domains: Biomedical Engineering
Depositing User: Mr IR Admin
Date Deposited: 11 May 2026 11:07
Last Modified: 18 May 2026 07:00
URI: https://ir.vistas.ac.in/id/eprint/17689

Actions (login required)

View Item
View Item