A deep learning method for optimizing semantic segmentation accuracy of remote sensing images based on improved UNet
© 2023. The Author(s)..
Semantic segmentation of remote sensing imagery (RSI) is critical in many domains due to the diverse landscapes and different sizes of geo-objects that RSI contains, making semantic segmentation challenging. In this paper, a convolutional network, named Adaptive Feature Fusion UNet (AFF-UNet), is proposed to optimize the semantic segmentation performance. The model has three key aspects: (1) dense skip connections architecture and an adaptive feature fusion module that adaptively weighs different levels of feature maps to achieve adaptive feature fusion, (2) a channel attention convolution block that obtains the relationship between different channels using a tailored configuration, and (3) a spatial attention module that obtains the relationship between different positions. AFF-UNet was evaluated on two public RSI datasets and was quantitatively and qualitatively compared with other models. Results from the Potsdam dataset showed that the proposed model achieved an increase of 1.09% over DeepLabv3 + in terms of the average F1 score and a 0.99% improvement in overall accuracy. The visual qualitative results also demonstrated a reduction in confusion of object classes, better performance in segmenting different sizes of object classes, and better object integrity. Therefore, the proposed AFF-UNet model optimizes the accuracy of RSI semantic segmentation.
Medienart: |
E-Artikel |
---|
Erscheinungsjahr: |
2023 |
---|---|
Erschienen: |
2023 |
Enthalten in: |
Zur Gesamtaufnahme - volume:13 |
---|---|
Enthalten in: |
Scientific reports - 13(2023), 1 vom: 10. Mai, Seite 7600 |
Sprache: |
Englisch |
---|
Beteiligte Personen: |
Wang, Xiaolei [VerfasserIn] |
---|
Links: |
---|
Themen: |
---|
Anmerkungen: |
Date Completed 11.05.2023 Date Revised 13.05.2023 published: Electronic Citation Status PubMed-not-MEDLINE |
---|
doi: |
10.1038/s41598-023-34379-2 |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
NLM356679535 |
---|
LEADER | 01000naa a22002652 4500 | ||
---|---|---|---|
001 | NLM356679535 | ||
003 | DE-627 | ||
005 | 20231226210342.0 | ||
007 | cr uuu---uuuuu | ||
008 | 231226s2023 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.1038/s41598-023-34379-2 |2 doi | |
028 | 5 | 2 | |a pubmed24n1188.xml |
035 | |a (DE-627)NLM356679535 | ||
035 | |a (NLM)37165042 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a Wang, Xiaolei |e verfasserin |4 aut | |
245 | 1 | 2 | |a A deep learning method for optimizing semantic segmentation accuracy of remote sensing images based on improved UNet |
264 | 1 | |c 2023 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a ƒaComputermedien |b c |2 rdamedia | ||
338 | |a ƒa Online-Ressource |b cr |2 rdacarrier | ||
500 | |a Date Completed 11.05.2023 | ||
500 | |a Date Revised 13.05.2023 | ||
500 | |a published: Electronic | ||
500 | |a Citation Status PubMed-not-MEDLINE | ||
520 | |a © 2023. The Author(s). | ||
520 | |a Semantic segmentation of remote sensing imagery (RSI) is critical in many domains due to the diverse landscapes and different sizes of geo-objects that RSI contains, making semantic segmentation challenging. In this paper, a convolutional network, named Adaptive Feature Fusion UNet (AFF-UNet), is proposed to optimize the semantic segmentation performance. The model has three key aspects: (1) dense skip connections architecture and an adaptive feature fusion module that adaptively weighs different levels of feature maps to achieve adaptive feature fusion, (2) a channel attention convolution block that obtains the relationship between different channels using a tailored configuration, and (3) a spatial attention module that obtains the relationship between different positions. AFF-UNet was evaluated on two public RSI datasets and was quantitatively and qualitatively compared with other models. Results from the Potsdam dataset showed that the proposed model achieved an increase of 1.09% over DeepLabv3 + in terms of the average F1 score and a 0.99% improvement in overall accuracy. The visual qualitative results also demonstrated a reduction in confusion of object classes, better performance in segmenting different sizes of object classes, and better object integrity. Therefore, the proposed AFF-UNet model optimizes the accuracy of RSI semantic segmentation | ||
650 | 4 | |a Journal Article | |
700 | 1 | |a Hu, Zirong |e verfasserin |4 aut | |
700 | 1 | |a Shi, Shouhai |e verfasserin |4 aut | |
700 | 1 | |a Hou, Mei |e verfasserin |4 aut | |
700 | 1 | |a Xu, Lei |e verfasserin |4 aut | |
700 | 1 | |a Zhang, Xiang |e verfasserin |4 aut | |
773 | 0 | 8 | |i Enthalten in |t Scientific reports |d 2011 |g 13(2023), 1 vom: 10. Mai, Seite 7600 |w (DE-627)NLM215703936 |x 2045-2322 |7 nnns |
773 | 1 | 8 | |g volume:13 |g year:2023 |g number:1 |g day:10 |g month:05 |g pages:7600 |
856 | 4 | 0 | |u http://dx.doi.org/10.1038/s41598-023-34379-2 |3 Volltext |
912 | |a GBV_USEFLAG_A | ||
912 | |a GBV_NLM | ||
951 | |a AR | ||
952 | |d 13 |j 2023 |e 1 |b 10 |c 05 |h 7600 |