3D-RCNN: Instance-level 3D Object Reconstruction via Render-and-Compare

October 2019

tl;dr: Mono 3DOD by estimating pose and shape of vehicles and render-and-compare loss.

Overall impression

3DOD is critical for prediction and path planning. However 3D ground truth is hard to obtain. 3D RCNN only needs 2D annotation (depth and semantic segmeantion). It also need accurate intrinsics/extrinsics to make it work.

This video seems to stem from the concept of this video of PASCAL 3D

First learn the low-dimensional space from CAD models for each subtype. PCA is used. But AutoEncoder seems also OK, such as RoI10D which are heavily inspired by this work and seems more practical.

Analysis by synthesis: Estimate the shape, pose and size parameters of the cars, and render (synthesize) the scene. Then the mask and depth map are compared with ground truth to generate loss.

The shape and pose are weakly supervised and arise from end-to-end training.

Key ideas

Estimate 2D bbox, 3D center projected on 2D from RoIAligned features. Estimate shape and pose with RoIAligned feature concatenated with the intrinsics of virtual RoI camera.

Technical details

Latent space of car shapes
- 3D RCNN 10-dim
- RoI10D 6-dim
- monoDR 8-dim
The authors argue that it is hard to predict 3D property such as shape and pose from RoIAligned features. --> RoI10D did use the RoIAligned features.
Improved multi-bin loss: weighted sum of bin center by confidence score, and L1 loss. This way there is no need to regress for the residual like the deep3dbox.
Render and compare uses operation of CUDA-OPENGL, and seems quite engineering heavy to make this work.

Notes

3D dataset with pose estimation
- Pascal 3D
- multiview car dataset: cars on turntable
PCA is simple and efficient, but RoI10D reported degeneracy of PCA models and favors 3D auto-encoder.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

3d_rcnn.md

3d_rcnn.md

3D-RCNN: Instance-level 3D Object Reconstruction via Render-and-Compare

Overall impression

Key ideas

Technical details

Notes

Files

3d_rcnn.md

Latest commit

History

3d_rcnn.md

File metadata and controls

3D-RCNN: Instance-level 3D Object Reconstruction via Render-and-Compare

Overall impression

Key ideas

Technical details

Notes