Does training with blurred images bring convolutional neural networks closer to humans with respect to robust object recognition and internal representations?
Copyright © 2023 Yoshihara, Fukiage and Nishida..
It has been suggested that perceiving blurry images in addition to sharp images contributes to the development of robust human visual processing. To computationally investigate the effect of exposure to blurry images, we trained convolutional neural networks (CNNs) on ImageNet object recognition with a variety of combinations of sharp and blurred images. In agreement with recent reports, mixed training on blurred and sharp images (B+S training) brings CNNs closer to humans with respect to robust object recognition against a change in image blur. B+S training also slightly reduces the texture bias of CNNs in recognition of shape-texture cue conflict images, but the effect is not strong enough to achieve human-level shape bias. Other tests also suggest that B+S training cannot produce robust human-like object recognition based on global configuration features. Using representational similarity analysis and zero-shot transfer learning, we also show that B+S-Net does not facilitate blur-robust object recognition through separate specialized sub-networks, one network for sharp images and another for blurry images, but through a single network analyzing image features common across sharp and blurry images. However, blur training alone does not automatically create a mechanism like the human brain in which sub-band information is integrated into a common representation. Our analysis suggests that experience with blurred images may help the human brain recognize objects in blurred images, but that alone does not lead to robust, human-like object recognition.
Medienart: |
E-Artikel |
---|
Erscheinungsjahr: |
2023 |
---|---|
Erschienen: |
2023 |
Enthalten in: |
Zur Gesamtaufnahme - volume:14 |
---|---|
Enthalten in: |
Frontiers in psychology - 14(2023) vom: 20., Seite 1047694 |
Sprache: |
Englisch |
---|
Beteiligte Personen: |
Yoshihara, Sou [VerfasserIn] |
---|
Links: |
---|
Themen: |
Convolutional neural networks |
---|
Anmerkungen: |
Date Revised 07.03.2023 published: Electronic-eCollection Citation Status PubMed-not-MEDLINE |
---|
doi: |
10.3389/fpsyg.2023.1047694 |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
NLM353807087 |
---|
LEADER | 01000naa a22002652 4500 | ||
---|---|---|---|
001 | NLM353807087 | ||
003 | DE-627 | ||
005 | 20231226060848.0 | ||
007 | cr uuu---uuuuu | ||
008 | 231226s2023 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.3389/fpsyg.2023.1047694 |2 doi | |
028 | 5 | 2 | |a pubmed24n1179.xml |
035 | |a (DE-627)NLM353807087 | ||
035 | |a (NLM)36874839 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a Yoshihara, Sou |e verfasserin |4 aut | |
245 | 1 | 0 | |a Does training with blurred images bring convolutional neural networks closer to humans with respect to robust object recognition and internal representations? |
264 | 1 | |c 2023 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a ƒaComputermedien |b c |2 rdamedia | ||
338 | |a ƒa Online-Ressource |b cr |2 rdacarrier | ||
500 | |a Date Revised 07.03.2023 | ||
500 | |a published: Electronic-eCollection | ||
500 | |a Citation Status PubMed-not-MEDLINE | ||
520 | |a Copyright © 2023 Yoshihara, Fukiage and Nishida. | ||
520 | |a It has been suggested that perceiving blurry images in addition to sharp images contributes to the development of robust human visual processing. To computationally investigate the effect of exposure to blurry images, we trained convolutional neural networks (CNNs) on ImageNet object recognition with a variety of combinations of sharp and blurred images. In agreement with recent reports, mixed training on blurred and sharp images (B+S training) brings CNNs closer to humans with respect to robust object recognition against a change in image blur. B+S training also slightly reduces the texture bias of CNNs in recognition of shape-texture cue conflict images, but the effect is not strong enough to achieve human-level shape bias. Other tests also suggest that B+S training cannot produce robust human-like object recognition based on global configuration features. Using representational similarity analysis and zero-shot transfer learning, we also show that B+S-Net does not facilitate blur-robust object recognition through separate specialized sub-networks, one network for sharp images and another for blurry images, but through a single network analyzing image features common across sharp and blurry images. However, blur training alone does not automatically create a mechanism like the human brain in which sub-band information is integrated into a common representation. Our analysis suggests that experience with blurred images may help the human brain recognize objects in blurred images, but that alone does not lead to robust, human-like object recognition | ||
650 | 4 | |a Journal Article | |
650 | 4 | |a convolutional neural networks | |
650 | 4 | |a object recognition | |
650 | 4 | |a optical blur | |
650 | 4 | |a perceptual organization | |
650 | 4 | |a visual development | |
700 | 1 | |a Fukiage, Taiki |e verfasserin |4 aut | |
700 | 1 | |a Nishida, Shin'ya |e verfasserin |4 aut | |
773 | 0 | 8 | |i Enthalten in |t Frontiers in psychology |d 2010 |g 14(2023) vom: 20., Seite 1047694 |w (DE-627)NLM205532713 |x 1664-1078 |7 nnns |
773 | 1 | 8 | |g volume:14 |g year:2023 |g day:20 |g pages:1047694 |
856 | 4 | 0 | |u http://dx.doi.org/10.3389/fpsyg.2023.1047694 |3 Volltext |
912 | |a GBV_USEFLAG_A | ||
912 | |a GBV_NLM | ||
951 | |a AR | ||
952 | |d 14 |j 2023 |b 20 |h 1047694 |