Python学习之路（玖）-图像识别的实现

香蕉可乐荷包蛋

于 2025-05-11 19:49:04 发布

阅读量522

点赞数 9

分类专栏： python 文章标签： python 学习开发语言

本文链接：https://blog.csdn.net/Jcrows/article/details/147877815

版权

python 专栏收录该内容

11 篇文章

订阅专栏

在 Python 中，图像识别对比通常涉及 图像相似度计算 或 目标检测与匹配。-浅看一下就行，具体功能代码，后期会逐步上传资源。

一、技术方案

1. 图像相似度计算

目标：计算两幅图像的相似度。
工具：
- OpenCV：图像处理基础库。
- 特征提取：SIFT、SURF、ORB。
- 深度学习：使用预训练模型（如 VGG、ResNet）提取特征。

2. 目标检测与匹配

目标：检测图像中的目标，并进行匹配。
工具：
- YOLO/Faster R-CNN：目标检测模型。
- OpenCV：特征匹配（如 BFMatcher、FLANN）。

3. 图像哈希

目标：通过哈希值快速比较图像。
工具：
- ImageHash：计算图像的感知哈希（如 pHash、dHash）。

二、实现步骤

1. 图像相似度计算

使用 OpenCV 或 深度学习模型 计算图像相似度。

示例代码（OpenCV + SIFT）：

import cv2

# 加载图像
img1 = cv2.imread('image1.jpg', cv2.IMREAD_GRAYSCALE)
img2 = cv2.imread('image2.jpg', cv2.IMREAD_GRAYSCALE)

# 初始化 SIFT 检测器
sift = cv2.SIFT_create()

# 检测关键点和描述符
keypoints1, descriptors1 = sift.detectAndCompute(img1, None)
keypoints2, descriptors2 = sift.detectAndCompute(img2, None)

# 使用 BFMatcher 进行匹配
bf = cv2.BFMatcher()
matches = bf.knnMatch(descriptors1, descriptors2, k=2)

# 过滤匹配点
good_matches = []
for m, n in matches:
    if m.distance < 0.75 * n.distance:
        good_matches.append(m)

# 计算相似度
similarity = len(good_matches) / min(len(keypoints1), len(keypoints2))
print("Similarity:", similarity)

示例代码（深度学习 + VGG）：

from tensorflow.keras.applications.vgg16 import VGG16, preprocess_input
from tensorflow.keras.preprocessing import image
from sklearn.metrics.pairwise import cosine_similarity
import numpy as np

# 加载预训练模型
model = VGG16(weights='imagenet', include_top=False, pooling='avg')

# 加载图像并预处理
def load_and_preprocess(img_path):
    img = image.load_img(img_path, target_size=(224, 224))
    img = image.img_to_array(img)
    img = np.expand_dims(img, axis=0)
    return preprocess_input(img)

img1 = load_and_preprocess('image1.jpg')
img2 = load_and_preprocess('image2.jpg')

# 提取特征
features1 = model.predict(img1).flatten()
features2 = model.predict(img2).flatten()

# 计算余弦相似度
similarity = cosine_similarity([features1], [features2])[0][0]
print("Similarity:", similarity)

2. 目标检测与匹配

使用 YOLO 检测目标，并使用 OpenCV 进行匹配。

示例代码（YOLO + OpenCV）：

from ultralytics import YOLO
import cv2

# 加载 YOLO 模型
model = YOLO('yolov8n.pt')

# 检测图像中的目标
results1 = model('image1.jpg')
results2 = model('image2.jpg')

# 提取检测结果
boxes1 = results1[0].boxes.xyxy.cpu().numpy()
boxes2 = results2[0].boxes.xyxy.cpu().numpy()

# 计算 IoU（交并比）
def calculate_iou(box1, box2):
    x1 = max(box1[0], box2[0])
    y1 = max(box1[1], box2[1])
    x2 = min(box1[2], box2[2])
    y2 = min(box1[3], box2[3])
    intersection = max(0, x2 - x1) * max(0, y2 - y1)
    area1 = (box1[2] - box1[0]) * (box1[3] - box1[1])
    area2 = (box2[2] - box2[0]) * (box2[3] - box2[1])
    return intersection / (area1 + area2 - intersection)

# 匹配目标
for box1 in boxes1:
    for box2 in boxes2:
        iou = calculate_iou(box1, box2)
        if iou > 0.5:  # 设置阈值
            print("Matched boxes with IoU:", iou)

3. 图像哈希

使用 ImageHash 计算图像的哈希值。

示例代码（ImageHash）：

from PIL import Image
import imagehash

# 加载图像
hash1 = imagehash.phash(Image.open('image1.jpg'))
hash2 = imagehash.phash(Image.open('image2.jpg'))

# 计算哈希距离
distance = hash1 - hash2
print("Hash distance:", distance)

三、优化建议

优化点	建议
特征提取	使用深度学习模型（如 VGG、ResNet）提取更高级的特征
匹配算法	使用 FLANN 替代 BFMatcher，提升匹配效率
目标检测	使用 YOLOv8 或 Faster R-CNN 提高检测精度
哈希算法	根据需求选择 pHash（感知哈希）或 dHash（差异哈希）
硬件加速	使用 GPU 加速深度学习模型推理