ENGLISH
La vitrine de diffusion des publications et contributions des chercheurs de l'ÉTS
RECHERCHER

Using CCA-fused cepstral features in a deep learning-based cry diagnostic system for detecting an ensemble of pathologies in newborns

Téléchargements

Téléchargements par mois depuis la dernière année

Plus de statistiques...

Khalilzad, Zahra et Tadj, Chakib. 2023. « Using CCA-fused cepstral features in a deep learning-based cry diagnostic system for detecting an ensemble of pathologies in newborns ». Diagnostics, vol. 13, nº 5.
Compte des citations dans Scopus : 3.

[thumbnail of Tadj-C-2023-26325.pdf]
Prévisualisation
PDF
Tadj-C-2023-26325.pdf - Version publiée
Licence d'utilisation : Creative Commons CC BY.

Télécharger (4MB) | Prévisualisation

Résumé

Crying is one of the means of communication for a newborn. Newborn cry signals convey precious information about the newborn’s health condition and their emotions. In this study, cry signals of healthy and pathologic newborns were analyzed for the purpose of developing an automatic, non-invasive, and comprehensive Newborn Cry Diagnostic System (NCDS) that identifies pathologic newborns from healthy infants. For this purpose, Mel-frequency Cepstral Coefficients (MFCC) and Gammatone Frequency Cepstral Coefficients (GFCC) were extracted as features. These feature sets were also combined and fused through Canonical Correlation Analysis (CCA), which provides a novel manipulation of the features that have not yet been explored in the literature on NCDS designs, to the best of our knowledge. All the mentioned feature sets were fed to the Support Vector Machine (SVM) and Long Short-term Memory (LSTM). Furthermore, two Hyperparameter optimization methods, Bayesian and grid search, were examined to enhance the system’s performance. The performance of our proposed NCDS was evaluated with two different datasets of inspiratory and expiratory cries. The CCA fusion feature set using the LSTM classifier accomplished the best F-score in the study, with 99.86% for the inspiratory cry dataset. The best F-score regarding the expiratory cry dataset, 99.44%, belonged to the GFCC feature set employing the LSTM classifier. These experiments suggest the high potential and value of using the newborn cry signals in the detection of pathologies. The framework proposed in this study can be implemented as an early diagnostic tool for clinical studies and help in the identification of pathologic newborns.

Type de document: Article publié dans une revue, révisé par les pairs
Professeur:
Professeur
Tadj, Chakib
Affiliation: Génie électrique
Date de dépôt: 06 avr. 2023 21:55
Dernière modification: 12 avr. 2023 19:04
URI: https://espace2.etsmtl.ca/id/eprint/26325

Actions (Authentification requise)

Dernière vérification avant le dépôt Dernière vérification avant le dépôt