Discrimination between patients with CVDs and healthy people by voiceprint using the MFCC and Pitch
Abdelhamid Bourouhou 1  
,   Abdelilah Jilbab 1  
,   Chafik Nacir 1  
,   Zaineb Bourouhou 2  
Universite Mohammed V, Ecole Normale Superieure de l'Enseignement Technique de Rabat
Universite Mohammed V, Faculté de Médecine et de pharmacie, CHU Ibn Sina de Rabat
Abdelhamid Bourouhou   

Universite Mohammed V, Ecole Normale Superieure de l'Enseignement Technique de Rabat
Data nadesłania: 06-02-2021
Data ostatniej rewizji: 18-08-2021
Data akceptacji: 24-09-2021
Data publikacji online: 01-10-2021
Data publikacji: 01-10-2021
Heart diseases cause many deaths around the world every year, and his death rate makes him the leader of the killer diseases. But early diagnosis can be helpful to decrease those several deaths and save lives. To ensure good diagnose, people must pass a series of clinical examinations and analyzes, which make the diagnostic operation expensive and not accessible for everyone. Speech analysis comes as a strong tool that can resolve the task and give back a new way to discriminate between healthy people and cardiovascular disease patients. Our latest paper treated this task but using a dysphonia measurement to differentiate between people with cardiovascular disease and the healthy one, and we were able to reach 81.5% in prediction accuracy. This time we choose to change the method to increase the accuracy by extracting the voiceprint using 13 Mel-Frequency Cepstral Coefficients and the pitch, extracted from the people's voices provided from 75 subjects (35 has cardiovascular diseases, 40 healthy), three records of sustained vowels (aaaaa…,ooooo…and iiiiiiii….) has been collected from each one. We used the k-near-neighbor classifier to train a model and to classify the test entities. We were able to outperform the previous results, reaching 95.55% of prediction accuracy.
WHO. Global atlas on cardiovascular disease prevention and control Geneva 2011.
Rawther NN, Cheriyan J. Detection and classification of cardiac arrhythmias based on ECG and PCG using temporal and wavelet features. IJARCCE. 2015; 4.
Bouguila Z, Moukadem A, Dieterlen A, Ahmed Benyahia A, Hajjam A, Talha S, Andres E. Autonomous cardiac diagnostic based on synchronized ECG and PCG signal. 7th International Joint Conference on Biomedical Engineering Systems and Technologies—ESEO, Angers. 2014.
Ghassemian H, Kenari R. Early detection of pediatric heart disease by automated spectral analysis of phonocardiogram in children. J. Inf. Syst. Telecommun. 2015;3(2):66–75.
Nabih-Ali M, El-Dahshan E-SA, Yahia AS. Heart diseases diagnosis using intelligent algorithm based on PCGsignal analysis. Circuits Syst. 2017; 8(7): 184–190.
Bourouhou A, Jilbab A, Nacir C, Hammouch A. Classification of Cardiovascular disease using dysphonia measurement in speech. Diagnostyka. 2021;22(1):31-37.
Carey MJ, Parris ES, Lloyd-Thomas H, Bennett S. Robust prosodic features for speaker identification. Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP'96, 1996;3: 1800-1803.
Jhanwar N, Raina AK. Pitch correlogram clustering for fast speaker identification. EURASIP J. Adv. Signal Process. 2004:37280.
Atal BS. Automatic speaker recognition based on pitch contours. The Journal of the Acoustical Society of America. 1972; 52(6B): 1687–1697.
Kumar ChS, Mallikarjuna PR. Design of an automatic speaker recognition system using MFCC, vector quantization and LBG algorithm. International Journal on Computer Scienceand Engineering. 2011; 3(8): 2942–2954.
Yang ZR, et al. RONN: The bio-basis function neural network technique applied to the detection of natively disordered regions in proteins. Bioinformatics. 2005; 21(16):3369–3376.
Benba A, Jilbab A, Hammouch A. Voice analysis for detecting persons with Parkinson’s disease using MFCC and VQ. In The 2014 international conference on circuits, systems and signal processing, 23–25 September 2014. Saint Petersburg: Saint Petersburg State Polytechnic University 2014..
Young S, Evermann G, Hain T, Kershaw D, Liu X, Moore G, Odell J, Ollason D, Povey D, Valtchev V, Woodland P. The HTK book (for HTK version 3.4). Cambridge: Cambridge University Engineering Department. 2006.
Bourouhou A, Jilbab A, Nacir C; Hammouch A. Detection and localization algorithm of the S1 and S2 heart sounds. 2017 International Conference on Electrical and Information Technologies (ICEIT), Rabat. 2017:1-4.
Bourouhou A, Jilbab A, Nacir C, Hammouch A. Comparison of classification methods to detect the Parkinson disease. 2016 International Conference on Electrical and Information Technologies (ICEIT), Tangiers. 2016:421-424.
Bourouhou A, Jilbab A, Nacir C, Hammouch A. Heart sounds classification for a medical diagnostic assistance. International Journal of Online and Biomedical Engineering (iJOE). 2019;15(11):88–103.
Benba A, Jilbab A. Hammouch A. Analysis of multiple types of voice recordings in cepstral domain using MFCC for discriminating between patients with Parkinson’s disease and healthy people. Int J Speech Technol. 2016;19:449–456.