3D vision processing notebook

What are we focusing on?

  • Predicting 3D shapes from sigle image.
  • Predicting labels from 3D shapes, such as auto-drive(using depth imformation).

Topics

  • Computing correspondences
  • SLAM
  • Self-supervised learning
  • Structure from motion
  • Multi-view stereo

3D shape representations

  • Depth map RGB+depth
  • Voxel Grid
  • Pointcloud
  • Mesh, using in CG
  • Implicit Surface, Nerf(using a function to represent the 3D shapes)

Depth map

RGB image+Depth map = RGB-D Image, namely , it has been used as 2.5D.

Only recoed the relative distance, not exactly distance.

This type of data can be recorded directly for some tyoes of 3D sensors, otherwise, it can be predicted by Neural Network easily.

Date:2014/2015, using fully conv nevwork, and it can be predicted easily.WHY, first ,the model shoule be able to recognize the object first.

Problem: Scale/Depth Ambiguity:Small&clase–Large&far

BAD PERFOMEANCE? ADD LOSS, in this instance, we add scale invariant loss. We predict the surface normals(if the pixels share the same surface normal, they are included in the same surface). But it will have a bad performance if the illumination is not even.

Voxels

  • Represent a shape with a V* V * V grid of occupancies. Fill the grid with voxels.
  • Just like segmentation masks in Mask R-CNN
  • Disadvantages: no details & noTexture
  • Tow topics:Generative Voxel Shapes / 3D classification
  • Disadvantages: Memory, more lossing details. Improve: Octree

3D Pointcloud

Name one question: WHY 3D? to edit the shapes in 3D, so the discrete representation is unique in 3D tasks.

  • Can represent fine structures without huge numbers of points.
  • Requires new model architecture and new losses, etc.
  • Doesn’t explictly represent the surface of the shapes, no 3D features. extracting a mesh for rendering.
  • Point net.
  • not many Pointcloud Generation, but focusing on the understanding of pointcloud

Mesh

All 3D representation will be rendered to the mesh, the final form of the 3D model.

Key Concepts: Vertices/ Faes

Advantage: Standard+Explicitly+Adaptive(allocate more faces to areas with finedetail)

How to represent texture? U-V Map.[another topic, no need to be mentioned]

Method: Pixel2Mesh

  • Iterative mesh refinement(before my evl, I don’t know how many vertives to use)

Implicit Surface

Implicit function//Explicit shape

Nerf: generate images from different perspectives.

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 1
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论 1
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值