Modeling long-range dependencies for weakly supervised disease classification and localization on chest X-ray
2022 Quantitative Imaging in Medicine and Surgery. All rights reserved..
Background: Computer-aided diagnosis based on chest X-ray (CXR) is an exponentially growing field of research owing to the development of deep learning, especially convolutional neural networks (CNNs). However, due to the intrinsic locality of convolution operations, CNNs cannot model long-range dependencies. Although vision transformers (ViTs) have recently been proposed to alleviate this limitation, those trained on patches cannot learn any dependencies for inter-patch pixels and thus, are insufficient for medical image detection. To address this problem, in this paper, we propose a CXR detection method which integrates CNN with a ViT for modeling patch-wise and inter-patch dependencies.
Methods: We experimented on the ChestX-ray14 dataset and followed the official training-test set split. Because the training data only had global annotations, the detection network was weakly supervised. A DenseNet with a feature pyramid structure was designed and integrated with an adaptive ViT to model inter-patch and patch-wise long-range dependencies and obtain fine-grained feature maps. We compared the performance using our method with that of other disease detection methods.
Results: For disease classification, our method achieved the best result among all the disease detection methods, with a mean area under the curve (AUC) of 0.829. For lesion localization, our method achieved significantly higher intersection of the union (IoU) scores on the test images with bounding box annotations than did the other detection methods. The visualized results showed that our predictions were more accurate and detailed. Furthermore, evaluation of our method in an external validation dataset demonstrated its generalization ability.
Conclusions: Our proposed method achieves the new state of the art for thoracic disease classification and weakly supervised localization. It has potential to assist in clinical decision-making.
Medienart: |
E-Artikel |
---|
Erscheinungsjahr: |
2022 |
---|---|
Erschienen: |
2022 |
Enthalten in: |
Zur Gesamtaufnahme - volume:12 |
---|---|
Enthalten in: |
Quantitative imaging in medicine and surgery - 12(2022), 6 vom: 11. Juni, Seite 3364-3378 |
Sprache: |
Englisch |
---|
Beteiligte Personen: |
Li, Fangyun [VerfasserIn] |
---|
Links: |
---|
Themen: |
Chest X-rays (CXRs) |
---|
Anmerkungen: |
Date Revised 16.07.2022 published: Print Citation Status PubMed-not-MEDLINE |
---|
doi: |
10.21037/qims-21-1117 |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
NLM341777854 |
---|
LEADER | 01000naa a22002652 4500 | ||
---|---|---|---|
001 | NLM341777854 | ||
003 | DE-627 | ||
005 | 20231226012546.0 | ||
007 | cr uuu---uuuuu | ||
008 | 231226s2022 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.21037/qims-21-1117 |2 doi | |
028 | 5 | 2 | |a pubmed24n1139.xml |
035 | |a (DE-627)NLM341777854 | ||
035 | |a (NLM)35655823 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a Li, Fangyun |e verfasserin |4 aut | |
245 | 1 | 0 | |a Modeling long-range dependencies for weakly supervised disease classification and localization on chest X-ray |
264 | 1 | |c 2022 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a ƒaComputermedien |b c |2 rdamedia | ||
338 | |a ƒa Online-Ressource |b cr |2 rdacarrier | ||
500 | |a Date Revised 16.07.2022 | ||
500 | |a published: Print | ||
500 | |a Citation Status PubMed-not-MEDLINE | ||
520 | |a 2022 Quantitative Imaging in Medicine and Surgery. All rights reserved. | ||
520 | |a Background: Computer-aided diagnosis based on chest X-ray (CXR) is an exponentially growing field of research owing to the development of deep learning, especially convolutional neural networks (CNNs). However, due to the intrinsic locality of convolution operations, CNNs cannot model long-range dependencies. Although vision transformers (ViTs) have recently been proposed to alleviate this limitation, those trained on patches cannot learn any dependencies for inter-patch pixels and thus, are insufficient for medical image detection. To address this problem, in this paper, we propose a CXR detection method which integrates CNN with a ViT for modeling patch-wise and inter-patch dependencies | ||
520 | |a Methods: We experimented on the ChestX-ray14 dataset and followed the official training-test set split. Because the training data only had global annotations, the detection network was weakly supervised. A DenseNet with a feature pyramid structure was designed and integrated with an adaptive ViT to model inter-patch and patch-wise long-range dependencies and obtain fine-grained feature maps. We compared the performance using our method with that of other disease detection methods | ||
520 | |a Results: For disease classification, our method achieved the best result among all the disease detection methods, with a mean area under the curve (AUC) of 0.829. For lesion localization, our method achieved significantly higher intersection of the union (IoU) scores on the test images with bounding box annotations than did the other detection methods. The visualized results showed that our predictions were more accurate and detailed. Furthermore, evaluation of our method in an external validation dataset demonstrated its generalization ability | ||
520 | |a Conclusions: Our proposed method achieves the new state of the art for thoracic disease classification and weakly supervised localization. It has potential to assist in clinical decision-making | ||
650 | 4 | |a Journal Article | |
650 | 4 | |a Long-range dependencies | |
650 | 4 | |a chest X-rays (CXRs) | |
650 | 4 | |a disease classification | |
650 | 4 | |a localization | |
650 | 4 | |a vision transformer (ViT) | |
700 | 1 | |a Zhou, Lingxiao |e verfasserin |4 aut | |
700 | 1 | |a Wang, Yunpeng |e verfasserin |4 aut | |
700 | 1 | |a Chen, Chuan |e verfasserin |4 aut | |
700 | 1 | |a Yang, Shuyi |e verfasserin |4 aut | |
700 | 1 | |a Shan, Fei |e verfasserin |4 aut | |
700 | 1 | |a Liu, Lei |e verfasserin |4 aut | |
773 | 0 | 8 | |i Enthalten in |t Quantitative imaging in medicine and surgery |d 2011 |g 12(2022), 6 vom: 11. Juni, Seite 3364-3378 |w (DE-627)NLM216114624 |x 2223-4292 |7 nnns |
773 | 1 | 8 | |g volume:12 |g year:2022 |g number:6 |g day:11 |g month:06 |g pages:3364-3378 |
856 | 4 | 0 | |u http://dx.doi.org/10.21037/qims-21-1117 |3 Volltext |
912 | |a GBV_USEFLAG_A | ||
912 | |a GBV_NLM | ||
951 | |a AR | ||
952 | |d 12 |j 2022 |e 6 |b 11 |c 06 |h 3364-3378 |