使用Python和计算机视觉技术构建花卉分类器：一个详细的图像识别工具指南

本文链接：https://blog.csdn.net/qq_38334677/article/details/132422073

第一部分：项目介绍与数据预处理

1. 项目介绍

随着机器学习和计算机视觉技术的进步，图像识别在众多领域中取得了巨大的成功。其中，花卉分类是一个独特且有趣的挑战。本教程旨在引导你使用Python和计算机视觉技术构建一个针对17种不同花卉的分类器。

2. 数据集概述

我们所使用的数据集包含了17种不同类别的花卉，每种类别都有80张图片。这为我们提供了一个相对均衡的多类分类任务，每个类别的数据量相对平衡。

3. 数据预处理

为了构建一个强大的图像识别工具，首先需要对数据进行适当的预处理。这样可以确保模型能够从原始图像中更好地学习特征。

import cv2
import os
import numpy as np

data_dir = './flowers_dataset'
categories = os.listdir(data_dir)
labels = [i for i in range(len(categories))]

label_dict = dict(zip(categories,labels))

img_size = 150
data = []
target = []

for category in categories:
    folder_path = os.path.join(data_dir, category)
    img_names = os.listdir(folder_path)
        
    for img_name in img_names:
        img_path = os.path.join(folder_path, img_name)
        img = cv2.imread(img_path)

        try:
            resized = cv2.resize(img, (img_size, img_size))
            # 将彩色图像转化为灰度图，这步可选
            # grayscale = cv2.cvtColor(resized, cv2.COLOR_BGR2GRAY)
            data.append(resized)
            target.append(label_dict[category])

        except Exception as e:
            print('Exception:', e)

data = np.array(data)/255.0
data = np.reshape(data,(data.shape[0], img_size, img_size, 3))
target = np.array(target)

上述代码首先导入所需的库，然后为数据集中的每种花卉定义了标签。然后，我们读取每张图片，调整其大小，将其添加到数据列表中，并将其相应的标签添加到目标列表中。最后，我们对数据进行归一化并调整其形状，使其适合模型训练。

为了获取更详细的数据预处理和数据增强过程，以及该项目的完整步骤，您可以下载完整项目。

至此，我们已经完成了数据预处理的步骤。

第二部分：模型构建与训练

4. 模型选择与构建

为了处理这种分类任务，我们将使用深度学习中的卷积神经网络 (CNN)。CNN 在图像识别任务中已被证明是非常有效的。

import tensorflow as tf
from tensorflow.keras.models import Sequential
from tensorflow.keras.layers import Conv2D, MaxPooling2D, Flatten, Dense, Dropout

model = Sequential()

model.add(Conv2D(32, (3, 3), activation='relu', input_shape=data.shape[1:]))
model.add(MaxPooling2D(2,2))

model.add(Conv2D(64, (3, 3), activation='relu'))
model.add(MaxPooling2D(2,2))

model.add(Conv2D(128, (3, 3), activation='relu'))
model.add(MaxPooling2D(2,2))

model.add(Flatten())
model.add(Dense(512, activation='relu'))
model.add(Dropout(0.5))
model.add(Dense(17, activation='softmax'))

model.compile(optimizer='adam', loss='sparse_categorical_crossentropy', metrics=['accuracy'])

model.summary()

此模型首先通过三个卷积层和最大池化层进行特征提取，然后通过全连接层进行分类。为了防止过拟合，我们还加入了一个 Dropout 层。

5. 模型训练

有了模型后，下一步是使用数据训练它。但在进行训练之前，我们还需将数据分为训练集和验证集。

from sklearn.model_selection import train_test_split

train_data, test_data, train_target, test_target = train_test_split(data, target, test_size=0.2, random_state=42)

history = model.fit(train_data, train_target, epochs=20, validation_split=0.2)

上述代码首先将数据划分为训练集和测试集，然后使用训练集进行模型训练，并使用 20% 的数据作为验证集。

6. 结果评估

训练完成后，我们需要评估模型的性能。评估通常包括查看训练过程中的损失和准确性，以及在测试集上评估模型。

import matplotlib.pyplot as plt

# 绘制损失曲线
plt.plot(history.history['loss'], label='Training Loss')
plt.plot(history.history['val_loss'], label='Validation Loss')
plt.legend()
plt.title('Training and Validation Loss')
plt.show()

# 绘制准确性曲线
plt.plot(history.history['accuracy'], label='Training Accuracy')
plt.plot(history.history['val_accuracy'], label='Validation Accuracy')
plt.legend()
plt.title('Training and Validation Accuracy')
plt.show()

# 在测试集上评估模型
test_loss, test_acc = model.evaluate(test_data, test_target)
print(f"Test accuracy: {test_acc*100:.2f}%")

这些图表和数字为我们提供了有关模型性能的关键信息，可帮助我们进一步优化模型。

第三部分：模型预测与项目总结

7. 使用模型进行预测

模型训练完成后，我们可以用它来预测新的、未知的花卉图片的类别。以下是一个简单的预测例子：

def predict_flower_category(image_path, model):
    img = cv2.imread(image_path)
    resized = cv2.resize(img, (img_size, img_size))
    normalized = resized / 255.0
    reshaped = np.reshape(normalized, (1, img_size, img_size, 3))
    prediction = model.predict(reshaped)
    return categories[np.argmax(prediction)]

# 测试预测功能
sample_image_path = "./flowers_dataset/rose/sample.jpg"
predicted_category = predict_flower_category(sample_image_path, model)
print(f"The predicted category for the sample image is: {predicted_category}")