3DmFV: Three-Dimensional Point Cloud Classification in Real-Time Using Convolutional Neural Networks

最新推荐文章于 2023-02-11 13:11:10 发布

luthor_lee

最新推荐文章于 2023-02-11 13:11:10 发布

阅读量1.4k

点赞数

分类专栏： 3DmFV 文章标签：点云分类深度学习

本文链接：https://blog.csdn.net/luthor_lee/article/details/82970420

版权

3DmFV 专栏收录该内容

1 篇文章 0 订阅

订阅专栏

github程序链接

Fisher Vector 通俗学习

浅谈流形学习

此篇为对论文的理解。一下关键地方直接使用原文，避免误导。

Abstract—Modern robotic systems are often equipped with a direct three-dimensional (3-D) data acquisition device, e.g., LiDAR, which provides a rich 3-D point cloud representation of the surroundings. This representation is commonly used for obstacle
avoidance and mapping. Here, we propose a new approach for using point clouds for another critical robotic capability, semantic
understanding of the environment (i.e., object classification). Convolutional neural networks (CNNs), that perform extremely well
for object classification in 2-D images, are not easily extendible to 3-D point clouds analysis. It is not straightforward due to point
clouds’ irregular format and a varying number of points. The common solution of transforming the point cloud data into a 3-D voxel
grid needs to address severe accuracy versus memory size tradeoffs. In this letter, we propose a novel, intuitively interpretable, 3-D
point cloud representation called 3-D modified Fisher vectors. Our representation is hybrid as it combines a coarse discrete grid structure with continuous generalized Fisher vectors. Using the grid enables us to design a new CNN architecture for real-time point cloud classification. In a series of performance analysis experiments, we demonstrate competitive results or even better than state of the art on challenging benchmark datasets while maintaining robustness to various data corruptions.

摘要的摘要：提出一种性能优秀的三维点云表示方法：三维修正Fisher矢量（3DmFV）。

深度神经网络在图像分析中表现出色，但是点云是非结构化、无序的，点云数量也不尽相同，所以它们不能自然地适应空间阵列（网格）。目前有几种解决方法，其中之一是将3D点云数据栅格化，但是这种方法的计算成本和近似精度需要折衷。这篇论文采用的3DmFV来表示点云，通过它们与高斯混合模型（GMM）的偏差来描述点。此法和Fisher Vector相似，但是它以两种重要的方式进行修稿和推广：the proposed GMM is specified using a set of uniform Gaussians with centers on a 3D grid, and the
components characterizing the set of points, that, for Fisher vectors, are averages over this set, are generalized to other functions
of this set.（建议的GMM使用一组重在在3D栅格的归一化的高斯，对于FV来说，表征点的分量是该集合的平均值，可以推广到该集合的其他函数）。

优点：保持了点云的连续属性，保留了一些点集的精细细节，并且在某种条件下是无损的，可逆的。其次，网格状结构可以使用卷积神经网络，低分辨率也难怪呢产生出色的分类精度。最后，所提出的每个组成不能都是直观可解释的。

3DmFV网络分类架构由两部分组成，一是将输入点云转化为3DmFV表示，而是将转化后的架构输入CNN架构。如下图所示。