Details der Publikation - Small, Versatile and Mighty: A Range-View Perception Framework

Small, Versatile and Mighty: A Range-View Perception Framework

Despite its compactness and information integrity, the range view representation of LiDAR data rarely occurs as the first choice for 3D perception tasks. In this work, we further push the envelop of the range-view representation with a novel multi-task framework, achieving unprecedented 3D detection performances. Our proposed Small, Versatile, and Mighty (SVM) network utilizes a pure convolutional architecture to fully unleash the efficiency and multi-tasking potentials of the range view representation. To boost detection performances, we first propose a range-view specific Perspective Centric Label Assignment (PCLA) strategy, and a novel View Adaptive Regression (VAR) module to further refine hard-to-predict box properties. In addition, our framework seamlessly integrates semantic segmentation and panoptic segmentation tasks for the LiDAR point cloud, without extra modules. Among range-view-based methods, our model achieves new state-of-the-art detection performances on the Waymo Open Dataset. Especially, over 10 mAP improvement over convolutional counterparts can be obtained on the vehicle class. Our presented results for other tasks further reveal the multi-task capabilities of the proposed small but mighty framework..

Medienart:	Preprint

Erscheinungsjahr:	2024
Erschienen:	2024

Enthalten in:	arXiv.org - (2024) vom: 01. März Zur Gesamtaufnahme - year:2024

Sprache:	Englisch

Beteiligte Personen:	Meng, Qiang [VerfasserIn] Wang, Xiao [VerfasserIn] Wang, JiaBao [VerfasserIn] Yan, Liujiang [VerfasserIn] Wang, Ke [VerfasserIn]

Links:	Volltext [kostenfrei]

Themen:	000 Computer Science - Computer Vision and Pattern Recognition

Förderinstitution / Projekttitel:

PPN (Katalog-ID):	XCH042778093

Internformat


LEADER	01000naa a22002652 4500
001	XCH042778093
003	DE-627
005	20240306114502.0
007	cr uuu---uuuuu
008	240306s2024 xx \|\|\|\|\|o 00\| \|\|eng c
035			\|a (DE-627)XCH042778093
035			\|a (chemrXiv)2403.00325
040			\|a DE-627 \|b ger \|c DE-627 \|e rakwb
041			\|a eng
100	1		\|a Meng, Qiang \|e verfasserin \|4 aut
245	1	0	\|a Small, Versatile and Mighty: A Range-View Perception Framework
264		1	\|c 2024
336			\|a Text \|b txt \|2 rdacontent
337			\|a Computermedien \|b c \|2 rdamedia
338			\|a Online-Ressource \|b cr \|2 rdacarrier
520			\|a Despite its compactness and information integrity, the range view representation of LiDAR data rarely occurs as the first choice for 3D perception tasks. In this work, we further push the envelop of the range-view representation with a novel multi-task framework, achieving unprecedented 3D detection performances. Our proposed Small, Versatile, and Mighty (SVM) network utilizes a pure convolutional architecture to fully unleash the efficiency and multi-tasking potentials of the range view representation. To boost detection performances, we first propose a range-view specific Perspective Centric Label Assignment (PCLA) strategy, and a novel View Adaptive Regression (VAR) module to further refine hard-to-predict box properties. In addition, our framework seamlessly integrates semantic segmentation and panoptic segmentation tasks for the LiDAR point cloud, without extra modules. Among range-view-based methods, our model achieves new state-of-the-art detection performances on the Waymo Open Dataset. Especially, over 10 mAP improvement over convolutional counterparts can be obtained on the vehicle class. Our presented results for other tasks further reveal the multi-task capabilities of the proposed small but mighty framework.
650		4	\|a Computer Science - Computer Vision and Pattern Recognition \|7 (dpeaa)DE-84
650		4	\|a 000 \|7 (dpeaa)DE-84
700	1		\|a Wang, Xiao \|4 aut
700	1		\|a Wang, JiaBao \|4 aut
700	1		\|a Yan, Liujiang \|4 aut
700	1		\|a Wang, Ke \|4 aut
773	0	8	\|i Enthalten in \|t arXiv.org \|g (2024) vom: 01. März
773	1	8	\|g year:2024 \|g day:01 \|g month:03
856	4	0	\|u https://arxiv.org/abs/2403.00325 \|z kostenfrei \|3 Volltext
912			\|a GBV_XCH
951			\|a AR
952			\|j 2024 \|b 01 \|c 03

Small, Versatile and Mighty: A Range-View Perception Framework

Zugang & Verfügbarkeit

Zugehörige Publikationen/Bände