【 Python高级编程】什么是图像仿射变换

最新推荐文章于 2024-06-27 16:01:16 发布

小龙

最新推荐文章于 2024-06-27 16:01:16 发布

阅读量640

点赞数 31

分类专栏： Python之ACGI学习合集【更新中】文章标签： python 仿射变换影像数据 opencv

本文为博主原创文章，未经博主允许不得转载。

本文链接：https://blog.csdn.net/qq_36631076/article/details/139752674

版权

Python之ACGI学习合集【更新中】专栏收录该内容

50 篇文章 2 订阅

订阅专栏

图像仿射变换（Affine Transformation）是图像处理中的一种基本操作，它通过对图像的几何位置进行线性变换来实现图像的缩放、旋转、平移、剪切等操作。仿射变换保持了图像中的直线性和平行性，即变换后的图像中，原来平行的线依然是平行的，原来直线的部分依然是直线。

1.仿射变换的数学基础

仿射变换可以用一个矩阵乘法和一个向量加法来表示：

$\begin{pmatrix} x' \\ y' \end{pmatrix} = \begin{pmatrix} a & b \\ c & d \end{pmatrix} \begin{pmatrix} x \\ y \end{pmatrix} + \begin{pmatrix} e \\ f \end{pmatrix}$

其中：
$(x, y)$ 是原图像中的点。
$(x^{'}, y^{'})$ 是变换后的点。
$\begin{pmatrix} a & b \\ c & d \end{pmatrix}$ 是线性变换矩阵，决定了旋转、缩放、剪切等变换。
$\begin{pmatrix} e \\ f \end{pmatrix}$ 是平移向量。

2. 常见的仿射变换

平移（Translation）：平移操作示意图像在 x 和 y 方向上移动。
$\begin{pmatrix} x' \\ y' \end{pmatrix} = \begin{pmatrix} 1 & 0 \\ 0 & 1 \end{pmatrix} \begin{pmatrix} x \\ y \end{pmatrix} + \begin{pmatrix} tx \\ ty \end{pmatrix}$

其中 $(t x, t y)$ 是平移的距离。

旋转（Rotation）：旋转操作示意图像绕某一点旋转。
$\begin{pmatrix} x' \\ y' \end{pmatrix} = \begin{pmatrix} \cos \theta & -\sin \theta \\ \sin \theta & \cos \theta \end{pmatrix} \begin{pmatrix} x \\ y \end{pmatrix}$

其中 $\theta$ 是旋转角度。

缩放（Scaling）：缩放操作示意图像按比例放大或缩小。
$\begin{pmatrix} x' \\ y' \end{pmatrix} = \begin{pmatrix} sx & 0 \\ 0 & sy \end{pmatrix} \begin{pmatrix} x \\ y \end{pmatrix}$

其中 $s x$ 和 $sy$ 是 x 和 y 方向的缩放因子。

剪切（Shearing）：剪切操作示意图像在某一方向上倾斜。
$\begin{pmatrix} x' \\ y' \end{pmatrix} = \begin{pmatrix} 1 & sh_x \\ sh_y & 1 \end{pmatrix} \begin{pmatrix} x \\ y \end{pmatrix}$

其中 $sh_x$ 和 $sh_y$ 是剪切因子。

3.在 OpenCV 中的实现

在 OpenCV 中，可以使用 cv2.getAffineTransform() 或 cv2.warpAffine() 函数来实现仿射变换。

1. 使用 `cv2.getAffineTransform()`

这是通过三个点来确定一个仿射变换矩阵。

import cv2
import numpy as np

# 读取图像
image = cv2.imread('path_to_your_image.jpg')

# 定义三个点及其变换后的位置
pts1 = np.float32([[50, 50], [200, 50], [50, 200]])
pts2 = np.float32([[10, 100], [200, 50], [100, 250]])

# 计算仿射变换矩阵
M = cv2.getAffineTransform(pts1, pts2)

# 应用仿射变换
transformed_image = cv2.warpAffine(image, M, (image.shape[1], image.shape[0]))

# 显示结果
cv2.imshow('Transformed Image', transformed_image)
cv2.waitKey(0)
cv2.destroyAllWindows()

2. 使用 `cv2.warpAffine()`

直接应用预计算的仿射变换矩阵。

import cv2
import numpy as np

# 读取图像
image = cv2.imread('path_to_your_image.jpg')

# 定义仿射变换矩阵
M = np.float32([[1, 0, 100], [0, 1, 50]])  # 平移操作

# 应用仿射变换
transformed_image = cv2.warpAffine(image, M, (image.shape[1], image.shape[0]))

# 显示结果
cv2.imshow('Transformed Image', transformed_image)
cv2.waitKey(0)
cv2.destroyAllWindows()

4，总结

仿射变换是图像处理中的一种基本变换，能够实现图像的平移、旋转、缩放和剪切等操作。通过理解仿射变换的数学基础和在 OpenCV 中的实现方法，可以灵活地对图像进行各种几何变换。

小龙

关注

31
点赞
踩
10

收藏

觉得还不错? 一键收藏
0
评论
【 Python高级编程】什么是图像仿射变换

仿射变换是图像处理中的一种基本变换，能够实现图像的平移、旋转、缩放和剪切等操作。通过理解仿射变换的数学基础和在 OpenCV 中的实现方法，可以灵活地对图像进行各种几何变换。
复制链接

扫一扫