点引导的对比学习用于单目三维目标检测

Point-Guided Contrastive Learning for Monocular 3-D Object Detection

IEEE Transactions on Cybernetics · 2021

被引 15

ABS 3

Dapeng Feng
Hang Xu
Xiaodan Liang
Xiaojun Tan
Songfang Han

中文导读

提出一种单目三维目标检测模型，通过自监督和辅助学习模仿三维点云特征，利用对比学习增强二维网络的空间感知能力，在KITTI和ApolloScape数据集上达到最优性能。

Abstract

3-D object detection is a fundamental task in the context of autonomous driving. In the literature, cheap monocular image-based methods show a significant performance drop compared to the expensive LiDAR and stereo-images-based algorithms. In this article, we aim to close this performance gap by bridging the representation capability between 2-D and 3-D domains. We propose a novel monocular 3-D object detection model using self-supervised learning and auxiliary learning, resorting to mimicking the representations over 3-D point clouds. Specifically, given a 2-D region proposal and the corresponding instance point cloud, we supervise the feature activation from our image-based convolution network to mimic the latent feature of a point-based neural network at the training stage. While state-of-the-art (SOTA) monocular 3-D detection algorithms typically convert images to pseudo-LiDAR with depth estimation and regress 3-D detection with LiDAR-based methods, our approach seeks the power of the 2-D neural network straightforwardly and essentially enhances the 2-D module capability with latent spatial-aware representations by contrastive learning. We empirically validate the performance improvement from the feature mimicking the KITTI and ApolloScape datasets and achieve the SOTA performance on the KITTI and ApolloScape leaderboard.

自动驾驶三维目标检测对比学习计算机视觉

阅读原文 ↗