Back to articles
Article
Volume: 35 | Article ID: IPAS-298
Image
Deep learning based speech emotion recognition for Parkinson patient
  DOI :  10.2352/EI.2023.35.9.IPAS-298  Published OnlineJanuary 2023
Abstract
Abstract

Speech emotions (SEs) are an essential component of human interactions and an efficient way of persuading human behavior. The recognition of emotions from the speech is an emergent but challenging area of digital signal processing (DSP). Healthcare professionals are always looking for the best ways to understand patient voices for better diagnosis and treatment. Speech emotion recognition (SER) from the human voice, particularly in a person with neurological disorders like Parkinson's disease (PD), can expedite the diagnostic process. Patients with PD are primarily passed through diagnosis via expensive tests and continuous monitoring that is time-consuming and costly. This research aims to develop a system that can accurately identify common SEs which are important for PD patients, such as anger, happiness, normal, and sadness. We proposed a novel lightweight deep model to predict common SEs. The adaptive wavelet thresholding method is employed for pre-processing the audio data. Furthermore, we generated spectrograms from the speech data instead of directly processing voice data to extract more discriminative features. The proposed method is trained on generated spectrograms of the IEMOCAP dataset. The suggested deep learning method contains convolution layers for learning discriminative features from spectrograms. The performance of the proposed framework is evaluated on standard performance metrics, which show promising real-time results for PD patients.

Subject Areas :
Views 98
Downloads 36
 articleview.views 98
 articleview.downloads 36
  Cite this article 

Habib Khan, Mohib Ullah, Fadi Al-Machot, Faouzi Alaya Cheikh, Muhammad Sajjad, "Deep learning based speech emotion recognition for Parkinson patientin Electronic Imaging,  2023,  pp 298--1 - 298-6,  https://doi.org/10.2352/EI.2023.35.9.IPAS-298

 Copy citation
  Copyright statement 
Copyright This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. 2023
ei
Electronic Imaging
2470-1173
2470-1173
Society for Imaging Science and Technology
IS&T 7003 Kilworth Lane, Springfield, VA 22151 USA