Hybrid unsupervised representation learning and pseudo-label supervised self-distillation for rare disease imaging phenotype classification with dispersion-aware imbalance correction

Copyright © 2024 Elsevier B.V. All rights reserved..

Rare diseases are characterized by low prevalence and are often chronically debilitating or life-threatening. Imaging phenotype classification of rare diseases is challenging due to the severe shortage of training examples. Few-shot learning (FSL) methods tackle this challenge by extracting generalizable prior knowledge from a large base dataset of common diseases and normal controls and transferring the knowledge to rare diseases. Yet, most existing methods require the base dataset to be labeled and do not make full use of the precious examples of rare diseases. In addition, the extremely small size of the training samples may result in inter-class performance imbalance due to insufficient sampling of the true distributions. To this end, we propose in this work a novel hybrid approach to rare disease imaging phenotype classification, featuring three key novelties targeted at the above drawbacks. First, we adopt the unsupervised representation learning (URL) based on self-supervising contrastive loss, whereby to eliminate the overhead in labeling the base dataset. Second, we integrate the URL with pseudo-label supervised classification for effective self-distillation of the knowledge about the rare diseases, composing a hybrid approach taking advantage of both unsupervised and (pseudo-) supervised learning on the base dataset. Third, we use the feature dispersion to assess the intra-class diversity of training samples, to alleviate the inter-class performance imbalance via dispersion-aware correction. Experimental results of imaging phenotype classification of both simulated (skin lesions and cervical smears) and real clinical rare diseases (retinal diseases) show that our hybrid approach substantially outperforms existing FSL methods (including those using a fully supervised base dataset) via effective integration of the URL, pseudo-label driven self-distillation, and dispersion-aware imbalance correction, thus establishing a new state of the art.

Medienart:

E-Artikel

Erscheinungsjahr:

2024

Erschienen:

2024

Enthalten in:

Zur Gesamtaufnahme - volume:93

Enthalten in:

Medical image analysis - 93(2024) vom: 29. März, Seite 103102

Sprache:

Englisch

Beteiligte Personen:

Sun, Jinghan [VerfasserIn]
Wei, Dong [VerfasserIn]
Wang, Liansheng [VerfasserIn]
Zheng, Yefeng [VerfasserIn]

Links:

Volltext

Themen:

Dispersion-aware imbalance correction
Journal Article
Pseudo-label supervised self-distillation
Rare disease classification
Unsupervised representation learning

Anmerkungen:

Date Completed 04.03.2024

Date Revised 04.03.2024

published: Print-Electronic

Citation Status MEDLINE

doi:

10.1016/j.media.2024.103102

funding:

Förderinstitution / Projekttitel:

PPN (Katalog-ID):

NLM368576663