Structure-aware feature stylization for domain generalization

Cheraghalikhani, Milad, Noori, Mehrdad, Osowiechi, David, Hakim, Gustavo A. Vargas, Ben Ayed, Ismail et Desrosiers, Christian. 2024. « Structure-aware feature stylization for domain generalization ». Computer Vision and Image Understanding, vol. 244.

Prévisualisation

PDF
BenAyed-I-2024-28674.pdf - Version publiée
Licence d'utilisation : Creative Commons CC BY-NC.
Télécharger (877kB) | Prévisualisation

URL Officielle: https://doi.org/10.1016/j.cviu.2024.104016

Résumé

Generalizing to out-of-distribution (OOD) data is a challenging task for existing deep learning approaches. This problem largely comes from the common but often incorrect assumption of statistical learning algorithms that the source and target data come from the same i.i.d. distribution. To tackle the limited variability of domains available during training, as well as domain shifts at test time, numerous approaches for domain generalization have focused on generating samples from new domains. Recent studies on this topic suggest that feature statistics from instances of different domains can be mixed to simulate synthesized images from a novel domain. While this simple idea achieves state-of-art results on various domain generalization benchmarks, it ignores structural information which is key to transferring knowledge across different domains. In this paper, we leverage the ability of humans to recognize objects using solely their structural information (prominent region contours) to design a Structural-Aware Feature Stylization method for domain generalization. Our method improves feature stylization based on mixing instance statistics by enforcing structural consistency across the different style-augmented samples. This is achieved via a multi-task learning model which classifies original and augmented images while also reconstructing their edges in a secondary task. The edge reconstruction task helps the network preserve image structure during feature stylization, while also acting as a regularizer for the classification task. Through quantitative comparisons, we verify the effectiveness of our method upon existing state-of-the-art methods on PACS, VLCS, OfficeHome, DomainNet and Digits-DG. The implementation is available at this repository.

Type de document:	Article publié dans une revue, révisé par les pairs
Professeur:	Professeur Ben Ayed, Ismail Desrosiers, Christian
Affiliation:	Génie des systèmes, Génie logiciel et des technologies de l'information
Date de dépôt:	22 mai 2024 14:03
Dernière modification:	24 mai 2024 14:30
URI:	https://espace2.etsmtl.ca/id/eprint/28674

Actions (Authentification requise)

Dernière vérification avant le dépôt