Pointfilter: Point Cloud Filtering via Encoder-Decoder Modeling论文解读

最新推荐文章于 2023-07-18 14:58:05 发布

置顶一学

最新推荐文章于 2023-07-18 14:58:05 发布

阅读量1.9k

点赞数 5

分类专栏：点云去噪文章标签：深度学习计算机视觉 python

本文链接：https://blog.csdn.net/weixin_43894075/article/details/112892822

版权

点云去噪专栏收录该内容

1 篇文章 1 订阅

订阅专栏

本文解读了一篇关于利用Pointfilter方法的论文，该方法通过编码器-解码器网络处理点云噪声，通过PCA预处理、刚体不变性和局部结构投影，结合高斯损失和排斥项，实现对复杂点云的去噪并保持特征。尽管在处理大量噪声时效果欠佳，但展示了学习紧凑表示并融合传统方法的优势。

摘要由CSDN通过智能技术生成

Pointfilter: Point Cloud Filtering via Encoder-Decoder Modeling 论文解读

1. METHOD

学习的是位移矢量，噪点+位移矢量=去噪后的点

1.1 Preprocessing

给定一组点云 $P$ 和 $\hat{P}$ ，定义噪声块 $\hat{\mathcal{P}}$ 和其对应的真实块 $\mathcal{P}$
$\hat{\mathcal{P_i}} = \{\hat{p_j} \mid \Vert\hat{p_j}-\hat{p_i}\Vert < r\}\\ \mathcal{P_i}= \{p_j \mid \Vert\ p_j-p_i \Vert < r\}$
$r$ 是块的半径，一般为模型包围盒对角线的5%。

一旦生成了块，在点云滤波中就要考虑两个问题：

如何避免观察域中不必要的自由度；
如何保证Pointfilter对某些几何变换敏感。

对于问题1，可以将块变换到原点（以 $\hat{p_i}$ 为中心），并且进行以下伸缩变换
$\hat{\mathcal{P_i}}=(\hat{\mathcal{P_i}}-\hat{p_i}) \verb|/|r\\ \mathcal{P_i}=(\mathcal{P_i}-\hat{p_i}) \verb|/|r$
为了保证刚体不变性，将输入块的PCA主轴与笛卡尔空间进行对齐（先对齐z轴，再对齐x轴）。

为了方便调参，设置输入块的点的默认数量为 $\mid\hat{\mathcal{P_i}}\mid=500$ 。当块点的数量小于500，填充；当点数大于500，进行随机下采样。

1.2 The Pointfilter Framework

主要想法：根据其相邻结构将每个噪声点投影到基础表面上。

The key idea of our Pointfilter is to project each noisy point onto the underlying surface according to its neighboring structure.

为了实现上诉想法，作者设计了有个编码-解码网络。

在这里插入图片描述

1.2.1 Encoder

输入：经过PCA处理的点云块。

目的：从输入块中获得复杂的表示信息。

在这里插入图片描述

编码器包含两个部分：

特征提取器（MLPs），提取不同尺度的特征。
收集器(collector)，将特征（ $N\times1024$ ）转化为一个1024维向量。(max pooling)

对于图中的shared Parameters，其实并没有什么特别的地方，其实就是MLPs。PointNet上也是这么写的。代码和PointNet差不多，不过增加了Batch Normalization，以保证每一层特征都是标准分布。

代码如下：

class pointfilter_decoder(nn.Module):
    def __init__(self):
        super(pointfilter_decoder, self).__init__()

        self.fc1 = nn.Linear(1024, 512)
        self.fc2 = nn.Linear(512, 256)
        self.fc3 = nn.Linear(256, 3)

        self.bn1 = nn.BatchNorm1d(512)
        self.bn2 = nn.BatchNorm1d(256)

        self.dropout_1 = nn.Dropout(0.3)
        self.dropout_2 = nn.Dropout(0.3)

    def forward(self, x):
        x = F.relu(self.bn1(self.fc1(x)))
        # x = self.dropout_1(x)
        x = F.relu(self.bn2(self.fc2(x)))
        # x = self.dropout_2(x)
        x = torch.tanh(self.fc3(x))

        return x

1.2.2 Decoder

解码器就比较暴力，直接FCN。但在最后要将点转回原来的坐标域，所以要乘 $R^{-1}$ 。

在这里插入图片描述

至于最后为什么有个 $+$ ，通过后面公式
$\bar{p_i}=rR^{-1}f(R(\hat{\mathcal{P_i}}-\hat{p_i}/r))+\hat{p_i}$
可知，所学习的内容其实是去噪去掉的部分，对于点来说，其实 $rR^{-1}f(R(\hat{\mathcal{P_i}}-\hat{p_i}/r))$ 就是一个位移矢量。

代码如下：

class pointfilter_decoder(nn.Module):
    def __init__(self):
        super(pointfilter_decoder, self).__init__()

        self.fc1 = nn.Linear(1024, 512)
        self.fc2 = nn.Linear(512, 256)
        self.fc3 = nn.Linear(256, 3)

        self.bn1 = nn.BatchNorm1d(512)
        self.bn2 = nn.BatchNorm1d(256)

        self.dropout_1 = nn.Dropout(0.3)
        self.dropout_2 = nn.Dropout(0.3)

    def forward(self, x):
        x = F.relu(self.bn1(self.fc1(x)))
        # x = self.dropout_1(x)
        x = F.relu(self.bn2(self.fc2(x)))
        # x = self.dropout_2(x)
        x = torch.tanh(self.fc3(x))

        return x

1.2.3 Loss function

为了尽可能的保证去噪后的点云接近于真实点云，并且能够保留尖锐的特征，作者定义了两种loss。
$L^a_{proj} = \frac{\sum_{p_j\in \mathcal{P_i}}\mid (\bar{p_i}-p_j)\cdot n^T_{p_j}\mid \cdot \phi(\Vert\bar{p_i}-p_j\Vert)}{\sum_{p_j\in \mathcal{P_i}}\phi(\Vert\bar{p_i}-p_j\Vert)}$
其中， $n_{pj}$ 是真实点云中 $p_j$ 的法向， $\phi(\Vert\bar{p_i}-p_j\Vert)$ 是高斯函数
$\phi(\Vert\bar{p_i}-p_j\Vert)=exp(\frac{\Vert\bar{p_i}-p_j\Vert^2}{\sigma_p^2})$
其中， $\sigma_p$ 定义为 $\sigma_p=\sqrt{diag/m}$ ， $d i a g$ 的大小为 $\mathcal{P_i}$ 的包围盒的对角线长度， $m=\mid\hat{\mathcal{P_i}}\mid$ 。

除了保证去噪后的点云接近真实点云，还希望去噪后的点分布均匀。为了达成这一目标，用排斥项来惩罚点的聚合。
$L=\eta L^a_{proj}+(1-\eta)L_{rep},L_{rep}=max_{p_j \in \mathcal{P_i}\mid }\bar{p_i}-p_j\mid$
其中， $\eta$ 为权衡参数，作者设 $\eta=0.97$ 进行训练。

上面这个loss的定义有点像高斯滤波。事实上，在实验的过程中，使用这一loss，确实会使尖锐的特征变得平滑。基于双边滤波在保留特征上的优势，作者增加了当前点与领域点的法向相似性作为约束，得到了如下loss:
$L^b_{proj} = \frac{\sum_{p_j\in \mathcal{P_i}}\mid (\bar{p_i}-p_j)\cdot n^T_{p_j}\mid \cdot \phi(\Vert\bar{p_i}-p_j\Vert)\theta(n_{\bar{p_i}},n_{p_j})}{\sum_{p_j\in \mathcal{P_i}}\phi(\Vert\bar{p_i}-p_j\Vert)\theta(n_{\bar{p_i}},n_{p_j})}$
其中
$\theta(n_{\bar{p_i}},n_{p_j})=exp(-\frac{1-n^T_{\bar{p_i}}n_{p_j}}{1-cos(\sigma_n)})$
$\sigma_n$ 是支持角，默认设为15°，噪声块中每个点的法向 $n_{\bar{p_i}}$ 定义为离其最近的真实块中的点的法向。

2 优缺点

2.1 优点

可以学习复杂和紧凑的点云的表示形式。
结合了一些传统的去噪方法进行loss设计。

2.2 缺点

在噪声很多的情况下，对尖锐的噪声保留效果不是很好。

一学

关注

5
点赞
踩
5

收藏

觉得还不错? 一键收藏
5
评论
Pointfilter: Point Cloud Filtering via Encoder-Decoder Modeling论文解读

Pointfilter: Point Cloud Filtering via Encoder-Decoder Modeling 论文解读1. METHOD学习的是位移矢量，噪点+位移矢量=去噪后的点1.1 Preprocessing给定一组点云PPP和P^\hat{P}P^，定义噪声块P^\hat{\mathcal{P}}P^和其对应的真实块P\mathcal{P}PPi^={pj^∣∥pj^−pi^∥<r}Pi={pj∣∥ pj−pi∥<r}\hat{\mathcal{P
复制链接

扫一扫

专栏目录