ENGLISH
La vitrine de diffusion des publications et contributions des chercheurs de l'ÉTS
RECHERCHER

Balanced neonatal cry classification: Integrating preterm and full-term data for RDS screening

Shayegh, Somaye Valizade et Tadj, Chakib. 2025. « Balanced neonatal cry classification: Integrating preterm and full-term data for RDS screening ». Information, vol. 16, nº 11.

[thumbnail of Tadj-C-2025-33142.pdf]
Prévisualisation
PDF
Tadj-C-2025-33142.pdf - Version publiée
Licence d'utilisation : Creative Commons CC BY.

Télécharger (5MB) | Prévisualisation

Résumé

Respiratory distress syndrome (RDS) is one of the most serious neonatal conditions, frequently leading to respiratory failure and death in low-resource settings. Early detection is therefore critical, particularly where access to advanced diagnostic tools is limited. Recent advances in machine learning have enabled non-invasive neonatal cry diagnostic systems (NCDSs) for early screening. To the best of our knowledge, this is the first cry-based RDS detection study to include both preterm and full-term infants in a subject-balanced design, using 76 neonates (38 RDS, 38 healthy; 19 per subgroup) and 8534 expiratory cry segments (4267 per class). Cry waveforms were converted to mono, high-pass-filtered, and segmented to isolate expiratory units. Mel-Frequency Cepstral Coefficients (MFCCs) and Filterbank (FBANK) features were extracted and transformed into fixed-dimensional embeddings using a lightweight X-vector model with mean-SDor attention-based pooling, followed by a binary classifier. Model parameters were optimized via grid search. Performance was evaluated using accuracy, precision, recall, F1-score, and ROC–AUC under stratified 10-fold cross-validation. MFCC + mean–SD achieved 93.59 ± 0.48% accuracy, while MFCC + attention reached 93.53 ± 0.52% accuracy with slightly higher precision, reducing false RDS alarms and improving clinical reliability. To enhance interpretability, Integrated Gradients were applied to MFCC and FBANK features to reveal the spectral regions contributing most to the decision. Overall, the proposed NCDS reliably distinguishes RDS from healthy cries and generalizes across neonatal subgroups despite the greater variability in preterm vocalizations.

Type de document: Article publié dans une revue, révisé par les pairs
Professeur:
Professeur
Tadj, Chakib
Affiliation: Génie électrique
Date de dépôt: 17 déc. 2025 15:23
Dernière modification: 10 janv. 2026 16:38
URI: https://espace2.etsmtl.ca/id/eprint/33142

Actions (Authentification requise)

Dernière vérification avant le dépôt Dernière vérification avant le dépôt