Python项目--智能垃圾分类识别系统开发

天天进步2015

于 2025-04-15 15:44:46 发布

阅读量1.3k

点赞数 25

分类专栏： Python项目实战文章标签： python 分类开发语言

本文链接：https://blog.csdn.net/exlink2012/article/details/147251704

版权

Python项目实战专栏收录该内容

26 篇文章

订阅专栏

1. 项目概述

随着环保意识的提高和垃圾分类政策的推行，智能垃圾分类识别系统成为一个具有实际应用价值的项目。本文将详细介绍如何使用Python开发一个基于计算机视觉和深度学习的智能垃圾分类识别系统，该系统能够通过摄像头实时识别垃圾类型，并给出相应的分类建议。

1.1 项目背景

垃圾分类是环境保护的重要一环，但对于普通民众来说，正确分类各种垃圾仍然是一个挑战。智能垃圾分类识别系统可以帮助人们快速准确地识别垃圾类型，提高垃圾分类的效率和准确性。

1.2 项目目标

开发一个能够实时识别垃圾类型的计算机视觉系统
实现至少四类垃圾（可回收物、厨余垃圾、有害垃圾、其他垃圾）的准确分类
提供友好的用户界面，方便用户使用
构建可扩展的系统架构，支持未来功能扩展

1.3 技术栈

编程语言：Python 3.8+
深度学习框架：TensorFlow 2.x / PyTorch
计算机视觉库：OpenCV
Web框架：Flask
前端技术：HTML, CSS, JavaScript
数据库：SQLite / MySQL

2. 系统设计

2.1 系统架构

智能垃圾分类识别系统采用模块化设计，主要包含以下几个模块：

数据采集与预处理模块：负责图像采集、增强和预处理
图像识别模块：使用深度学习模型识别垃圾类型
分类决策模块：根据识别结果给出分类建议
用户界面模块：提供Web界面，实现人机交互
数据管理模块：管理垃圾分类数据和用户数据

系统架构图如下：

+------------------+     +------------------+     +------------------+
|  数据采集与预处理  | --> |    图像识别模块   | --> |   分类决策模块   |
+------------------+     +------------------+     +------------------+
                                                          |
                                                          v
+------------------+     +------------------+
|   数据管理模块    | <-- |   用户界面模块    |
+------------------+     +------------------+

2.2 数据流设计

用户通过摄像头或上传图片提供垃圾图像
系统对图像进行预处理（调整大小、归一化等）
预处理后的图像输入深度学习模型进行识别
模型输出垃圾类型的概率分布
系统根据概率分布给出分类建议
用户界面展示识别结果和分类建议
系统记录识别结果和用户反馈（可选）

3. 数据集准备

3.1 数据集收集

为了训练高质量的垃圾分类模型，我们需要收集大量的垃圾图像数据。数据来源可以包括：

公开数据集：如TrashNet、TACO等
自行采集的图像数据
网络爬虫获取的图像数据

3.2 数据集划分

将收集到的数据集按照以下比例划分：

训练集：70%
验证集：15%
测试集：15%

3.3 数据增强

为了提高模型的泛化能力，我们对训练数据进行增强处理，包括：

随机旋转
随机裁剪
亮度和对比度调整
水平翻转
添加噪声

数据增强示例代码：

def augment_image(image):
    """对图像进行数据增强"""
    # 随机旋转
    angle = random.uniform(-30, 30)
    image = rotate(image, angle)
    
    # 随机裁剪并调整回原始大小
    h, w = image.shape[:2]
    crop_percent = random.uniform(0.8, 1.0)
    crop_h, crop_w = int(h * crop_percent), int(w * crop_percent)
    top = random.randint(0, h - crop_h)
    left = random.randint(0, w - crop_w)
    image = image[top:top+crop_h, left:left+crop_w]
    image = cv2.resize(image, (w, h))
    
    # 亮度和对比度调整
    alpha = random.uniform(0.8, 1.2)  # 对比度
    beta = random.uniform(-10, 10)    # 亮度
    image = cv2.convertScaleAbs(image, alpha=alpha, beta=beta)
    
    # 水平翻转
    if random.random() > 0.5:
        image = cv2.flip(image, 1)
    
    return image

4. 模型设计与训练

4.1 模型选择

考虑到垃圾分类任务的特点和实际应用场景，我们选择以下几种深度学习模型进行尝试：

MobileNetV2：轻量级模型，适合部署在资源受限的设备上
ResNet50：经典的残差网络，性能稳定
EfficientNet：在准确率和效率之间取得良好平衡的模型

4.2 模型实现

以MobileNetV2为例，实现垃圾分类模型：

def create_model(num_classes=4):
    """创建基于MobileNetV2的垃圾分类模型"""
    # 加载预训练的MobileNetV2模型，不包括顶层
    base_model = tf.keras.applications.MobileNetV2(
        input_shape=(224, 224, 3),
        include_top=False,
        weights='imagenet'
    )
    
    # 冻结基础模型的层
    base_model.trainable = False
    
    # 添加全局平均池化层和分类层
    model = tf.keras.Sequential([
        base_model,
        tf.keras.layers.GlobalAveragePooling2D(),
        tf.keras.layers.Dense(128, activation='relu'),
        tf.keras.layers.Dropout(0.2),
        tf.keras.layers.Dense(num_classes, activation='softmax')
    ])
    
    # 编译模型
    model.compile(
        optimizer=tf.keras.optimizers.Adam(learning_rate=0.001),
        loss='categorical_crossentropy',
        metrics=['accuracy']
    )
    
    return model

4.3 模型训练

使用准备好的数据集对模型进行训练：

def train_model(model, train_data, val_data, epochs=20):
    """训练垃圾分类模型"""
    # 定义回调函数
    callbacks = [
        tf.keras.callbacks.EarlyStopping(
            monitor='val_loss', patience=5, restore_best_weights=True),
        tf.keras.callbacks.ReduceLROnPlateau(
            monitor='val_loss', factor=0.2, patience=3, min_lr=1e-6),
        tf.keras.callbacks.ModelCheckpoint(
            'garbage_classifier.h5', save_best_only=True, monitor='val_accuracy')
    ]
    
    # 训练模型
    history = model.fit(
        train_data,
        validation_data=val_data,
        epochs=epochs,
        callbacks=callbacks
    )
    
    return history, model

4.4 模型评估

对训练好的模型进行评估，计算准确率、精确率、召回率和F1分数：

def evaluate_model(model, test_data):
    """评估垃圾分类模型"""
    # 获取测试集预测结果
    y_pred_prob = model.predict(test_data)
    y_pred = np.argmax(y_pred_prob, axis=1)
    
    # 获取测试集真实标签
    y_true = np.concatenate([y for _, y in test_data], axis=0)
    y_true = np.argmax(y_true, axis=1)
    
    # 计算评估指标
    accuracy = accuracy_score(y_true, y_pred)
    precision = precision_score(y_true, y_pred, average='weighted')
    recall = recall_score(y_true, y_pred, average='weighted')
    f1 = f1_score(y_true, y_pred, average='weighted')
    
    # 打印评估结果
    print(f"Accuracy: {accuracy:.4f}")
    print(f"Precision: {precision:.4f}")
    print(f"Recall: {recall:.4f}")
    print(f"F1 Score: {f1:.4f}")
    
    # 绘制混淆矩阵
    cm = confusion_matrix(y_true, y_pred)
    plt.figure(figsize=(10, 8))
    sns.heatmap(cm, annot=True, fmt='d', cmap='Blues')
    plt.xlabel('Predicted')
    plt.ylabel('True')
    plt.title('Confusion Matrix')
    plt.show()
    
    return accuracy, precision, recall, f1

5. 系统实现

5.1 数据采集与预处理模块

# data_processor.py
import cv2
import numpy as np

class DataProcessor:
    def __init__(self, target_size=(224, 224)):
        self.target_size = target_size
        
    def preprocess_image(self, image):
        """预处理图像用于模型输入"""
        # 调整图像大小
        image = cv2.resize(image, self.target_size)
        
        # 转换颜色空间（如果需要）
        if len(image.shape) == 2:  # 灰度图像
            image = cv2.cvtColor(image, cv2.COLOR_GRAY2RGB)
        elif image.shape[2] == 4:  # RGBA图像
            image = cv2.cvtColor(image, cv2.COLOR_RGBA2RGB)
        
        # 归一化像素值
        image = image.astype(np.float32) / 255.0
        
        return image
    
    def capture_image(self, camera_id=0):
        """从摄像头捕获图像"""
        cap = cv2.VideoCapture(camera_id)
        ret, frame = cap.read()
        cap.release()
        
        if ret:
            return frame
        else:
            raise Exception("无法从摄像头捕获图像")

5.2 图像识别模块

# image_classifier.py
import tensorflow as tf
import numpy as np

class GarbageClassifier:
    def __init__(self, model_path, class_names):
        """初始化垃圾分类器"""
        self.model = tf.keras.models.load_model(model_path)
        self.class_names = class_names
    
    def predict(self, image):
        """预测图像中的垃圾类型"""
        # 确保图像形状正确
        if len(image.shape) == 3:
            image = np.expand_dims(image, axis=0)
        
        # 进行预测
        predictions = self.model.predict(image)
        
        # 获取最可能的类别及其概率
        class_idx = np.argmax(predictions[0])
        probability = predictions[0][class_idx]
        class_name = self.class_names[class_idx]
        
        # 获取所有类别的概率分布
        probabilities = {
            self.class_names[i]: float(predictions[0][i])
            for i in range(len(self.class_names))
        }
        
        return {
            'class_name': class_name,
            'probability': float(probability),
            'probabilities': probabilities
        }

5.3 分类决策模块

# decision_maker.py
class GarbageDecisionMaker:
    def __init__(self, threshold=0.7):
        """初始化决策模块"""
        self.threshold = threshold
        self.disposal_guide = {
            '可回收物': '请放入蓝色垃圾桶',
            '厨余垃圾': '请放入绿色垃圾桶',
            '有害垃圾': '请放入红色垃圾桶',
            '其他垃圾': '请放入灰色垃圾桶'
        }
    
    def make_decision(self, prediction):
        """根据预测结果做出决策"""
        class_name = prediction['class_name']
        probability = prediction['probability']
        
        if probability >= self.threshold:
            decision = {
                'category': class_name,
                'confidence': 'high' if probability > 0.9 else 'medium',
                'disposal_guide': self.disposal_guide.get(class_name, '无处理建议'),
                'probability': probability
            }
        else:
            # 置信度低，可能需要人工确认
            decision = {
                'category': '无法确定',
                'confidence': 'low',
                'disposal_guide': '请人工确认垃圾类型',
                'probability': probability,
                'possible_categories': sorted(
                    prediction['probabilities'].items(),
                    key=lambda x: x[1],
                    reverse=True
                )
            }
        
        return decision

5.4 用户界面模块

使用Flask框架实现Web界面：

# app.py
from flask import Flask, render_template, request, jsonify
import cv2
import numpy as np
import base64
from data_processor import DataProcessor
from image_classifier import GarbageClassifier
from decision_maker import GarbageDecisionMaker

app = Flask(__name__)

# 初始化各模块
data_processor = DataProcessor()
classifier = GarbageClassifier(
    'models/garbage_classifier.h5',
    ['可回收物', '厨余垃圾', '有害垃圾', '其他垃圾']
)
decision_maker = GarbageDecisionMaker()

@app.route('/')
def index():
    """主页"""
    return render_template('index.html')

@app.route('/classify', methods=['POST'])
def classify():
    """处理分类请求"""
    if 'image' not in request.files:
        return jsonify({'error': '没有上传图像'}), 400
    
    # 读取上传的图像
    file = request.files['image']
    img_bytes = file.read()
    nparr = np.frombuffer(img_bytes, np.uint8)
    image = cv2.imdecode(nparr, cv2.IMREAD_COLOR)
    
    # 预处理图像
    processed_image = data_processor.preprocess_image(image)
    
    # 进行预测
    prediction = classifier.predict(processed_image)
    
    # 做出决策
    decision = decision_maker.make_decision(prediction)
    
    # 返回结果
    return jsonify(decision)

@app.route('/capture', methods=['POST'])
def capture():
    """从摄像头捕获图像并分类"""
    try:
        # 捕获图像
        image = data_processor.capture_image()
        
        # 预处理图像
        processed_image = data_processor.preprocess_image(image)
        
        # 进行预测
        prediction = classifier.predict(processed_image)
        
        # 做出决策
        decision = decision_maker.make_decision(prediction)
        
        # 将图像转换为base64编码
        _, buffer = cv2.imencode('.jpg', image)
        img_base64 = base64.b64encode(buffer).decode('utf-8')
        
        # 返回结果
        result = {
            'decision': decision,
            'image': f'data:image/jpeg;base64,{img_base64}'
        }
        return jsonify(result)
    
    except Exception as e:
        return jsonify({'error': str(e)}), 500

if __name__ == '__main__':
    app.run(debug=True)

5.5 数据管理模块

# data_manager.py
import sqlite3
import json
import os
from datetime import datetime

class DataManager:
    def __init__(self, db_path='garbage_classification.db'):
        """初始化数据管理器"""
        self.db_path = db_path
        self._init_db()
    
    def _init_db(self):
        """初始化数据库"""
        conn = sqlite3.connect(self.db_path)
        cursor = conn.cursor()
        
        # 创建分类记录表
        cursor.execute('''
        CREATE TABLE IF NOT EXISTS classification_records (
            id INTEGER PRIMARY KEY AUTOINCREMENT,
            image_path TEXT,
            predicted_class TEXT,
            confidence REAL,
            user_feedback TEXT,
            timestamp DATETIME
        )
        ''')
        
        # 创建垃圾类型信息表
        cursor.execute('''
        CREATE TABLE IF NOT EXISTS garbage_types (
            id INTEGER PRIMARY KEY AUTOINCREMENT,
            name TEXT UNIQUE,
            description TEXT,
            disposal_guide TEXT,
            examples TEXT
        )
        ''')
        
        conn.commit()
        conn.close()
        
        # 初始化垃圾类型数据
        self._init_garbage_types()
    
    def _init_garbage_types(self):
        """初始化垃圾类型数据"""
        garbage_types = [
            {
                'name': '可回收物',
                'description': '适宜回收和资源利用的垃圾',
                'disposal_guide': '投放至蓝色垃圾桶',
                'examples': '纸张、塑料、金属、玻璃等'
            },
            {
                'name': '厨余垃圾',
                'description': '易腐烂的生物质生活垃圾',
                'disposal_guide': '投放至绿色垃圾桶',
                'examples': '剩菜剩饭、果皮、蔬菜等'
            },
            {
                'name': '有害垃圾',
                'description': '对人体健康或自然环境造成直接或潜在危害的垃圾',
                'disposal_guide': '投放至红色垃圾桶',
                'examples': '废电池、废灯管、废药品、废油漆等'
            },
            {
                'name': '其他垃圾',
                'description': '除可回收物、厨余垃圾、有害垃圾以外的其他生活垃圾',
                'disposal_guide': '投放至灰色垃圾桶',
                'examples': '卫生纸、尘土、陶瓷碎片等'
            }
        ]
        
        conn = sqlite3.connect(self.db_path)
        cursor = conn.cursor()
        
        for garbage_type in garbage_types:
            cursor.execute('''
            INSERT OR IGNORE INTO garbage_types (name, description, disposal_guide, examples)
            VALUES (?, ?, ?, ?)
            ''', (
                garbage_type['name'],
                garbage_type['description'],
                garbage_type['disposal_guide'],
                garbage_type['examples']
            ))
        
        conn.commit()
        conn.close()
    
    def save_classification_record(self, image_path, predicted_class, confidence, user_feedback=None):
        """保存分类记录"""
        conn = sqlite3.connect(self.db_path)
        cursor = conn.cursor()
        
        cursor.execute('''
        INSERT INTO classification_records 
        (image_path, predicted_class, confidence, user_feedback, timestamp)
        VALUES (?, ?, ?, ?, ?)
        ''', (
            image_path,
            predicted_class,
            confidence,
            user_feedback,
            datetime.now().strftime('%Y-%m-%d %H:%M:%S')
        ))
        
        conn.commit()
        conn.close()
    
    def get_garbage_type_info(self, name):
        """获取垃圾类型信息"""
        conn = sqlite3.connect(self.db_path)
        cursor = conn.cursor()
        
        cursor.execute('''
        SELECT name, description, disposal_guide, examples
        FROM garbage_types
        WHERE name = ?
        ''', (name,))
        
        result = cursor.fetchone()
        conn.close()
        
        if result:
            return {
                'name': result[0],
                'description': result[1],
                'disposal_guide': result[2],
                'examples': result[3]
            }
        else:
            return None
    
    def get_classification_statistics(self):
        """获取分类统计信息"""
        conn = sqlite3.connect(self.db_path)
        cursor = conn.cursor()
        
        # 获取各类别的分类次数
        cursor.execute('''
        SELECT predicted_class, COUNT(*) as count
        FROM classification_records
        GROUP BY predicted_class
        ''')
        
        class_counts = {row[0]: row[1] for row in cursor.fetchall()}
        
        # 获取总分类次数
        cursor.execute('SELECT COUNT(*) FROM classification_records')
        total_count = cursor.fetchone()[0]
        
        # 获取平均置信度
        cursor.execute('SELECT AVG(confidence) FROM classification_records')
        avg_confidence = cursor.fetchone()[0]
        
        conn.close()
        
        return {
            'class_counts': class_counts,
            'total_count': total_count,
            'avg_confidence': avg_confidence
        }

6. 系统部署

6.1 环境配置

创建requirements.txt文件，列出项目依赖：

tensorflow==2.8.0
opencv-python==4.5.5.64
numpy==1.22.3
Flask==2.1.1
scikit-learn==1.0.2
matplotlib==3.5.1
seaborn==0.11.2
Pillow==9.1.0

6.2 项目结构

garbage_classification_system/
├── app.py                  # Flask应用主文件
├── data_processor.py       # 数据处理模块
├── image_classifier.py     # 图像分类模块
├── decision_maker.py       # 决策模块
├── data_manager.py         # 数据管理模块
├── models/                 # 模型文件夹
│   └── garbage_classifier.h5  # 训练好的模型
├── static/                 # 静态文件
│   ├── css/
│   │   └── style.css
│   ├── js/
│   │   └── main.js
│   └── images/
├── templates/              # HTML模板
│   ├── index.html
│   └── result.html
├── data/                   # 数据文件夹
│   ├── train/
│   ├── val/
│   └── test/
├── notebooks/              # Jupyter笔记本
│   ├── data_preparation.ipynb
│   └── model_training.ipynb
├── utils/                  # 工具函数
│   ├── __init__.py
│   └── visualization.py
├── requirements.txt        # 项目依赖
└── README.md               # 项目说明

6.3 启动服务

# 安装依赖
pip install -r requirements.txt

# 启动Flask应用
python app.py

7. 系统测试与优化

7.1 功能测试

对系统的各个功能进行测试，包括：

图像上传功能
摄像头捕获功能
垃圾分类准确性
用户界面交互

7.2 性能优化

模型优化：
- 模型量化
- 模型剪枝
- 使用TensorFlow Lite转换模型
接口优化：
- 添加缓存机制
- 异步处理请求
- 批量处理图像
UI优化：
- 响应式设计
- 加载动画
- 结果可视化

7.3 系统评估

对系统进行全面评估，包括：

分类准确率
响应时间
用户体验
资源占用

8. 未来展望

8.1 功能扩展

多模态识别：结合图像和文本描述进行更准确的分类
垃圾成分分析：分析垃圾的材质和成分
回收价值评估：评估可回收物的回收价值
移动端应用：开发移动应用，方便用户随时使用

8.2 技术改进

模型迭代：使用更先进的深度学习模型
边缘计算：将模型部署到边缘设备上
联邦学习：利用多个设备的数据进行模型训练
增强现实：结合AR技术提供更直观的分类指导

9. 总结

本文详细介绍了如何使用Python开发一个智能垃圾分类识别系统，从系统设计、数据准备、模型训练到系统实现和部署，提供了完整的开发流程和代码示例。通过这个项目，我们不仅实现了垃圾的智能分类，也为环保事业贡献了一份力量。

参考资料

TensorFlow官方文档：https://www.tensorflow.org/
OpenCV官方文档：https://docs.opencv.org/
Flask官方文档：https://flask.palletsprojects.com/
《深度学习》，Ian Goodfellow等著
《Python计算机视觉编程》，Jan Erik Solem著