基于python实现深度学习的人脸识别含报告-CSDN博客

本文链接：https://blog.csdn.net/2501_90824910/article/details/147927220

基于python实现深度学习的人脸识别含报告

在这里插入图片描述

文章目录

基于Python的深度学习人脸识别系统实现报告

基于Python的深度学习人脸识别系统实现报告

一、项目概述

本项目使用深度学习技术实现了一个简单的人脸识别系统。该系统基于Python实现，主要使用了以下技术：

OpenCV：用于图像处理和人脸检测
face_recognition库：用于人脸特征提取和识别
深度学习中的卷积神经网络（CNN）：用于人脸特征编码

系统功能包括：

人脸检测和定位
人脸特征编码
人脸匹配和识别

二、开发环境

Python 3.8
OpenCV 4.5.5
face_recognition 1.3.0
numpy 1.21.5

三、系统设计

1. 系统架构

人脸图像输入 -> 人脸检测 -> 特征编码 -> 人脸匹配 -> 输出识别结果

2. 关键技术

(1) 人脸检测

使用OpenCV的Haar级联分类器进行人脸检测，确定图像中人脸的位置。

(2) 特征编码

使用face_recognition库的face_encodings方法对检测到的人脸进行特征编码。该方法内部使用了深度卷积神经网络来生成128维的人脸特征向量。

(3) 人脸匹配

通过计算两个人脸特征向量之间的欧氏距离来判断是否为同一人，通常当距离小于0.6时认为是同一个人。
在这里插入图片描述

四、代码实现

import face_recognition
import cv2
import os
import numpy as np

class FaceRecognitionSystem:
    def __init__(self):
        # 已知人脸数据库
        self.known_faces = []
        self.known_names = []
        
    def load_known_faces(self, directory):
        """加载已知人脸数据库"""
        for filename in os.listdir(directory):
            if filename.endswith(".jpg") or filename.endswith(".png"):
                name = os.path.splitext(filename)[0]
                image_path = os.path.join(directory, filename)
                image = face_recognition.load_image_file(image_path)
                
                # 提取人脸特征编码
                face_encoding = face_recognition.face_encodings(image)[0]
                
                self.known_faces.append(face_encoding)
                self.known_names.append(name)
    
    def recognize_face(self, unknown_image_path):
        """识别人脸"""
        # 加载待识别的图片
        unknown_image = face_recognition.load_image_file(unknown_image_path)
        
        # 检测图片中的人脸
        face_locations = face_recognition.face_locations(unknown_image)
        face_encodings = face_recognition.face_encodings(unknown_image, face_locations)
        
        # 将图片转换为BGR格式以便用OpenCV显示
        pil_image = cv2.cvtColor(unknown_image, cv2.COLOR_RGB2BGR)
        
        # 遍历检测到的每个人脸
        for (top, right, bottom, left), face_encoding in zip(face_locations, face_encodings):
            # 计算与已知人脸的匹配情况
            matches = face_recognition.compare_faces(self.known_faces, face_encoding)
            name = "Unknown"
            
            # 计算与已知人脸的距离
            face_distances = face_recognition.face_distance(self.known_faces, face_encoding)
            if len(face_distances) > 0:
                best_match_index = np.argmin(face_distances)
                if face_distances[best_match_index] < 0.6:  # 距离阈值
                    name = self.known_names[best_match_index]
            
            # 在图片上绘制人脸框和名字
            cv2.rectangle(pil_image, (left, top), (right, bottom), (0, 255, 0), 2)
            cv2.rectangle(pil_image, (left, bottom - 35), (right, bottom), (0, 255, 0), cv2.FILLED)
            cv2.putText(pil_image, name, (left + 6, bottom - 6), cv2.FONT_HERSHEY_DUPLEX, 0.8, (255, 255, 255), 1)
        
        # 显示结果
        cv2.imshow('Face Recognition Result', pil_image)
        cv2.waitKey(0)
        cv2.destroyAllWindows()
    
    def webcam_face_recognition(self):
        """实时摄像头人脸识别"""
        video_capture = cv2.VideoCapture(0)
        
        while True:
            ret, frame = video_capture.read()
            
            # 缩小帧大小以加快处理速度
            small_frame = cv2.resize(frame, (0, 0), fx=0.25, fy=0.25)
            
            # 转换颜色空间
            rgb_small_frame = cv2.cvtColor(small_frame, cv2.COLOR_BGR2RGB)
            
            # 检测人脸
            face_locations = face_recognition.face_locations(rgb_small_frame)
            face_encodings = face_recognition.face_encodings(rgb_small_frame, face_locations)
            
            face_names = []
            
            for face_encoding in face_encodings:
                matches = face_recognition.compare_faces(self.known_faces, face_encoding)
                name = "Unknown"
                
                face_distances = face_recognition.face_distance(self.known_faces, face_encoding)
                if len(face_distances) > 0:
                    best_match_index = np.argmin(face_distances)
                    if face_distances[best_match_index] < 0.6:
                        name = self.known_names[best_match_index]
                
                face_names.append(name)
            
            # 显示结果
            for (top, right, bottom, left), name in zip(face_locations, face_names):
                # 调整位置以适应缩放后的图像
                top *= 4
                right *= 4
                bottom *= 4
                left *= 4
                
                # 绘制人脸框
                cv2.rectangle(frame, (left, top), (right, bottom), (0, 255, 0), 2)
                
                # 绘制名字背景和文字
                cv2.rectangle(frame, (left, bottom - 35), (right, bottom), (0, 255, 0), cv2.FILLED)
                cv2.putText(frame, name, (left + 6, bottom - 6), 
                           cv2.FONT_HERSHEY_DUPLEX, 0.8, (255, 255, 255), 1)
            
            # 显示结果帧
            cv2.imshow('Webcam Face Recognition', frame)
            
            # 按q键退出
            if cv2.waitKey(1) & 0xFF == ord('q'):
                break
        
        # 释放资源
        video_capture.release()
        cv2.destroyAllWindows()

if __name__ == "__main__":
    frs = FaceRecognitionSystem()
    
    # 加载已知人脸数据库（请确保有对应的文件夹和图片）
    frs.load_known_faces("known_faces")
    
    # 测试单张图片识别
    frs.recognize_face("test_images/test1.jpg")
    
    # 开启摄像头实时识别
    # frs.webcam_face_recognition()