CVPR 2019
Learning 2D to 3D Lifting for Object Detection in 3D for Autonomous Vehicles
3D object detection
- 2D monocular images
- autonomous driving scenarios
Proposal
- lift the 2D images to 3D representations using learned neural network
3D representations using state-of-the-art GANs - leverage existing networks workding directly on 3D data to perform 3D object detection and localization
3D data for ground plane estimation using recent 3D networks
Results
-
highter results than many methods working on actual 3D inputs acquired from physical sensors
-
a late fusion of the output of the network trained on
- generated 3D image
- real 3D image
improve performance
Learning 2D to 3D Lifting for Object Detection in 3D for Autonomous Vehicles
Introduction
Two approaches have been widespread for 3D object detection problems
-
to detect objects in 2D using monocular images and then infer in 3D
-
to use 3D data (e.g. LiDAR(激光雷达)) to detect bounding boxes directly in 3D
-
Compare the two methods
-
the methods based on 2D monocular images significantly lag behind the the method use 3D data
- methods based on monocular images attempt at implicitly inferring 3D information from the input
- availability of depth information (derived or explicit)
greatly increases the performance of methods that use 3D data
-
a monocular image based 3D object detection method will be highly practical
- closing the gap in performance with the methods requiring explicit 3D data
- cheaper and lighter 2D cameras
- expensive and bulky 3D scanners
-
Our Results are of importance as
- (i) only using monocular images at inference
the efforts that are directed towards collecting high quality 3D data can help in scenarios where explicit 3D data cannot be acquired at test time. - (ii) the method can be used as a plug-and-play module
with any existing 3D method which works with BEV images, allowing operations with seamless switching between RGB and 3D scanners while leveraging the same underlying object detection platform.
This paper refers to the following methods
-
3D reconstruction from single images
-
depth estimation
Related work
Object Detection in 3D
Images