MFF-pytorch 项目使用教程

最新推荐文章于 2025-05-15 01:00:00 发布

尚虹卿

最新推荐文章于 2025-05-15 01:00:00 发布

阅读量1k

点赞数 13

本文链接：https://blog.csdn.net/gitblog_00906/article/details/142805658

版权

MFF-pytorch 项目使用教程

MFF-pytorch Motion Fused Frames implementation in PyTorch, codes and pretrained models. 项目地址: https://gitcode.com/gh_mirrors/mf/MFF-pytorch

1. 项目介绍

MFF-pytorch 是一个基于 PyTorch 实现的 Motion Fused Frames (MFFs) 项目，旨在通过数据级融合策略提升手势识别的准确性。该项目是基于论文《Motion Fused Frames: Data Level Fusion Strategy for Hand Gesture Recognition》的实现，提供了代码和预训练模型。

主要功能

数据级融合策略：通过融合运动和颜色信息，提升手势识别的准确性。
预训练模型：提供了预训练模型，方便用户快速上手和验证效果。
多种数据集支持：支持 Jester、NVIDIA 动态手势数据集和 ChaLearn LAP IsoGD 数据集。

2. 项目快速启动

2.1 环境准备

首先，确保你已经安装了 Python 3.7.4 和 PyTorch 1.5.0。你可以使用 Conda 创建虚拟环境并安装依赖：

# 克隆项目
git clone https://github.com/okankop/MFF-pytorch.git
cd MFF-pytorch

# 创建虚拟环境并安装依赖
conda create -n MFF python=3.7.4
conda activate MFF
pip install -r requirements.txt

2.2 数据准备

下载 Jester 数据集或 NVIDIA 动态手势数据集或 ChaLearn LAP IsoGD 数据集，并将其解压缩到同一文件夹中。然后使用 process_dataset.py 生成训练、验证和测试的索引文件。

# 假设数据集路径如下
~/MFF-pytorch/datasets/jester/

# 生成索引文件
python process_dataset.py

2.3 模型训练

以下是训练 4-segment 网络的示例代码，使用 3 个光流帧和 1 个颜色帧（4-MFFs-3f1c 架构）：

python main.py jester RGBFlow --arch BNInception --num_segments 4 \
--consensus_type MLP --num_motion 3 --batch-size 32

2.4 模型测试

使用预训练模型进行测试：

python test_models.py jester RGBFlow pretrained_models/MFF_jester_RGBFlow_BNInception_segment4_3f1c_best.pth.tar \
--arch BNInception --consensus_type MLP --test_crops 1 --num_motion 3 --test_segments 4