Shayegh, Somaye Valizade et Tadj, Chakib.
2025.
« Deep audio features and self-supervised learning for early diagnosis of neonatal diseases: Sepsis and respiratory distress syndrome classification from infant cry signals ».
Electronics, vol. 14, nº 2.
Compte des citations dans Scopus : 1.
Prévisualisation |
PDF
Tadj-C-2025-30560.pdf - Version publiée Licence d'utilisation : Creative Commons CC BY. Télécharger (3MB) | Prévisualisation |
Résumé
Neonatal mortality remains a critical global challenge, particularly in resourcelimited settings with restricted access to advanced diagnostic tools. Early detection of life-threatening conditions like Sepsis and Respiratory Distress Syndrome (RDS), which significantly contribute to neonatal deaths, is crucial for timely interventions and improved survival rates. This study investigates the use of newborn cry sounds, specifically the expiratory segments (the most informative parts of cry signals) as non-invasive biomarkers for early disease diagnosis. We utilized an expanded and balanced cry dataset, applying Self-Supervised Learning (SSL) models—wav2vec 2.0,WavLM, and HuBERT—to extract feature representations directly from raw cry audio signals. This eliminates the need for manual feature extraction while effectively capturing complex patterns associated with sepsis and RDS. A classifier consisting of a single fully connected layer was placed on top of the SSL models to classify newborns into Healthy, Sepsis, or RDS groups. We fine-tuned the SSL models and classifiers by optimizing hyperparameters using two learning rate strategies: linear and annealing. Results demonstrate that the annealing strategy consistently outperformed the linear strategy, with wav2vec 2.0 achieving the highest accuracy of approximately 90% (89.76%). These findings highlight the potential of integrating this method into Newborn Cry Diagnosis Systems (NCDSs). Such systems could assist medical staff in identifying critically ill newborns, prioritizing care, and improving neonatal outcomes through timely interventions.
Type de document: | Article publié dans une revue, révisé par les pairs |
---|---|
Professeur: | Professeur Tadj, Chakib |
Affiliation: | Génie électrique |
Date de dépôt: | 13 févr. 2025 16:35 |
Dernière modification: | 04 mars 2025 14:45 |
URI: | https://espace2.etsmtl.ca/id/eprint/30560 |
Actions (Authentification requise)
![]() |
Dernière vérification avant le dépôt |