【Homography Estimation】《Deep Image Homography Estimation》

bryant_meng

已于 2022-10-20 15:51:56 修改

阅读量986

点赞数 3

分类专栏： CNN / Transformer 文章标签：深度学习计算机视觉人工智能

于 2022-07-20 10:26:37 首次发布

本文链接：https://blog.csdn.net/bryant_meng/article/details/125811906

版权

CNN / Transformer 专栏收录该内容

249 篇文章

订阅专栏

在这里插入图片描述

arXiv-2016

文章目录

1 Background and Motivation
2 Related Work
3 Advantages / Contributions
4 Method
5 Experiments
- 5.1 Datasets
- 5.2 Experiments
6 Conclusion

1 Background and Motivation

单应性Homograph估计：从传统算法到深度学习
在这里插入图片描述

用卷积直接回归单应性矩阵（transformation estimation，homography estimation），8个自由度

The homography is an essential part of monocular SLAM systems in scenarios such as:

Rotation only movements
Planar scenes
Scenes in which objects are very far from the viewer

2 Related Work

无

3 Advantages / Contributions

利用卷积神经网络学四个点的偏移来进行 Homography Estimation

4 Method

（1）The 4-point homography parameterization
在这里插入图片描述
单应性矩阵把图 $(u, v)$ 映射成了 $(u^{'}, v^{'})$

H11 H12 H21 H22 与旋转有关，H13 H23 和平移有关

Balancing the rotational and translational terms as part of an optimization problem is difficult

单应性矩阵中 9个参数相互组合有实际意义，没有完全解耦干净， 9 个参数共 8 个自由度，作者直接改学图 $(u, v)$ 映射成了 $(u^{'}, v^{'})$ 的 4 个坐标的偏移（传统方法也有这么干的）

在这里插入图片描述

学到 4 个坐标的偏移后，利用 OpenCV. 的 getPerspectiveTransform() 方法就可以计算出单应性矩阵了

输入来自源图像的 4 个点和加上偏移的 4 个新点，getPerspectiveTransform 将返回一个(3，3) 矩阵

（2）Data Generation for Homography Estimation

applying random projective transformations to a large dataset（MS COCO）

在这里插入图片描述

原图上随机选个矩形区域 p，四个顶点随机偏移，根据四个点前后坐标，计算出单应性矩阵，然后把单应性矩阵作用到原图生成新的图，新的图对应的 p 区域 p’ 和原图的 p 联合送入网络学习原图四个顶点坐标的偏移（step2）进而求出 $H^{AB}$

import cv2
import numpy as np

im1 = cv2.imread('left.jpg')
im2 = cv2.imread('right.jpg')

src_points = np.array([[581, 297], [1053, 173], [1041, 895], [558, 827]])
dst_points = np.array([[571, 257], [963, 333], [965, 801], [557, 827]])

H, _ = cv2.findHomography(src_points, dst_points)

h, w = im2.shape[:2]

im2_warp = cv2.warpPerspective(im2, H, (w, h))