A Joint 2D-3D Complementary Network for Stereo Matching

Stereo matching is an important research field of computer vision. Due to the dimension of cost aggregation, current neural network-based stereo methods are difficult to trade-off speed and accuracy. To this end, we integrate fast 2D stereo methods with accurate 3D networks to improve performance and reduce running time. We leverage a 2D encoder-decoder network to generate a rough disparity map and construct a disparity range to guide the 3D aggregation network, which can significantly improve the accuracy and reduce the computational cost. We use a stacked hourglass structure to refine the disparity from coarse to fine. We evaluated our method on three public datasets. According to the KITTI official website results, Our network can generate an accurate result in 80 ms on a modern GPU. Compared to other 2D stereo networks (AANet, DeepPruner, FADNet, etc.), our network has a big improvement in accuracy. Meanwhile, it is significantly faster than other 3D stereo networks (5× than PSMNet, 7.5× than CSN and 22.5× than GANet, etc.), demonstrating the effectiveness of our method..

Medienart:

E-Artikel

Erscheinungsjahr:

2021

Erschienen:

2021

Enthalten in:

Zur Gesamtaufnahme - volume:21

Enthalten in:

Sensors - 21(2021), 4, p 1430

Sprache:

Englisch

Beteiligte Personen:

Xiaogang Jia [VerfasserIn]
Wei Chen [VerfasserIn]
Zhengfa Liang [VerfasserIn]
Xin Luo [VerfasserIn]
Mingfei Wu [VerfasserIn]
Chen Li [VerfasserIn]
Yulin He [VerfasserIn]
Yusong Tan [VerfasserIn]
Libo Huang [VerfasserIn]

Links:

doi.org [kostenfrei]
doaj.org [kostenfrei]
www.mdpi.com [kostenfrei]
Journal toc [kostenfrei]

Themen:

Chemical technology
Computer vision
Depth estimation
Stereo matching

doi:

10.3390/s21041430

funding:

Förderinstitution / Projekttitel:

PPN (Katalog-ID):

DOAJ030951909