

一、 配置环境(必备)

(1) VS2019【不介绍安装方法】
(2) WINDOWS10【不介绍安装方法】
(3) CUDA 10.2【不介绍安装方法】
    需确保使用 nvcc –version时,显示如下
(4) Python3.7【不介绍安装方法】
(5) Pytorch 1.8
(6) torchvision0.9.0
(7) torchaudio0.8.0
(8) cudatoolkit=11.1

二、 步骤说明

2.1 安装 Pytorch

(1) 激活虚拟环境

conda activate mmcv

(2) 安装Pytorch

conda install pytorch1.8.0 torchvision0.9.0 torchaudio0.8.0 cudatoolkit=11.1 -c pytorch -c conda-forge

(3) 安装完,验证一下,显示如下,则安装成功


2.2 安装 cocoapi


git clone


cd coco/PythonAPI
python build_ext –inplace

遇到错误:No attribute ‘filelist’
解决方法:pip install setuptools==59.5.0

python build_ext install


2.3 安装 fvcore

(2) 编译源码

python build --force develop

(3) 安装完,验证一下,显示如下,则安装成功
    需要注意的是,编译完后生成的文件都是在fvcore-master下的,也就是说,你除非cd到根目录(SlowFast/fvcore),否则是无法import fvcore成功的,编译完的文件夹如下


    而其他库在引用的时候,是使用fvcore的,于是容易出现一旦你当前不在根目录下就import fvcore失败的情况,这个时候我们可以把这个文件夹(fvcore-master)都放在 site-packages下(你可以放在对应envs下的site-package或者公开的site-packages),并重命名为fvcore-master,单独的把fvcore和fvcore.egg-info复制到和fvcore-master同级的目录下,这个时候你就可以在其他位置成功的import fvcore了
    fvcore-master下除了fvcore、fvcore.egg-info外,其他的文件夹存在安装模块和其他工具,当发现import fvcore的某项内容缺失时,请第一时间查看是不是因为这个工具没有放在fvcore下导致引用失败。

2.4 安装 detectron2



git clone

    对应编译过程中会遇到的问题是:nvcc fatal : unknown option ‘-genrate-dependencies-with-compile’
    修改site-packages\torch\include\torch\csrc\jit\runtime\argumenta_spec.h, 修改内容见下图







call “C:\Program Files (x86)\Microsoft Visual Studio\2019\Community\VC\Auxiliary\Build\vcvarsall.bat” amd64 -vcvars_ver=14.29



cl -Bv


cd detectron2
python build --force develop

    detectron2-master下除了detectron2、detectron2.egg-info外,其他的文件夹存在安装模块和其他工具,当发现import detectron2的某项内容缺失时,请第一时间查看是不是因为这个工具没有放在detectron2下导致引用失败。

2.5 安装 slowfast

git clone



python build develop

2.6 安装 pythorchvideo
    不建议直接通过pip install pytorchvideo的方式直接安装,我一开始就是直接安装了但是引用的时候存在问题,使用以下方法就不会有引用的问题


git clone


cd pytorchvideo
pip install -e .

    在install -e 后面有个点,不可以忽略,下图是源码的安装指引

2.7 更新 pytorch-image-models/timm
ImportError: cannot import name ‘Mlp‘ from ‘timm.models.layers ‘
    timm库的源码地址为,直接git clone整个库的代码,然后将timm(只要timm,见下图指向的文件夹)复制到AppData\Roaming\Python\Python37\site-packages\timm-0.1.20-py3.7.egg下


2.8 安装 win32gui
    对应错误:ModuleNotFoundError:No module name ‘win32con’
    解决方法:pip install win32gui

三、 测试

3.1 下载权重


3.2 制作label

{"bend/bow (at the waist)": 0, "crawl": 1, "crouch/kneel": 2, "dance": 3, "fall down": 4, "get up": 5, "jump/leap": 6, "lie/sleep": 7, "martial art": 8, "run/jog": 9, "sit": 10, "stand": 11, "swim": 12, "walk": 13, "answer phone": 14, "brush teeth": 15, "carry/hold (an object)": 16, "catch (an object)": 17, "chop": 18, "climb (e.g., a mountain)": 19, "clink glass": 20, "close (e.g., a door, a box)": 21, "cook": 22, "cut": 23, "dig": 24, "dress/put on clothing": 25, "drink": 26, "drive (e.g., a car, a truck)": 27, "eat": 28, "enter": 29, "exit": 30, "extract": 31, "fishing": 32, "hit (an object)": 33, "kick (an object)": 34, "lift/pick up": 35, "listen (e.g., to music)": 36, "open (e.g., a window, a car door)": 37, "paint": 38, "play board game": 39, "play musical instrument": 40, "play with pets": 41, "point to (an object)": 42, "press": 43, "pull (an object)": 44, "push (an object)": 45, "put down": 46, "read": 47, "ride (e.g., a bike, a car, a horse)": 48, "row boat": 49, "sail boat": 50, "shoot": 51, "shovel": 52, "smoke": 53, "stir": 54, "take a photo": 55, "text on/look at a cellphone": 56, "throw": 57, "touch (an object)": 58, "turn (e.g., a screwdriver)": 59, "watch (e.g., TV)": 60, "work on a computer": 61, "write": 62, "fight/hit (a person)": 63, "give/serve (an object) to (a person)": 64, "grab (a person)": 65, "hand clap": 66, "hand shake": 67, "hand wave": 68, "hug (a person)": 69, "kick (a person)": 70, "kiss (a person)": 71, "lift (a person)": 72, "listen to (a person)": 73, "play with kids": 74, "push (another person)": 75, "sing to (e.g., self, a person, a group)": 76, "take (an object) from (a person)": 77, "talk to (e.g., self, a person, a group)": 78, "watch (a person)": 79}

3.3 更改配置文件
● BATCH_SIZE>>>避免资源耗尽,先改为1,然后再渐渐调大
● MODEL_VIS>>>注释掉
● TOPK: 2>>>注释掉
● LABEL_FILE_PATH: “test_data/action.json”>>>填写label存放的地址
● INPUT_VIDEO: “test_data/100.mp4”>>>新增这个字段,并填写检测视频地
● OUTPUT_FILE: “test_data/result.mp4”>>>新增这个字段,并填写检测视频结果存放地址

  ENABLE: False
  DATASET: ava
  CHECKPOINT_FILE_PATH: test_data/SLOWFAST_32x2_R101_50_50.pkl  #path to pretrain model
  ENABLE: True
  ALIGNED: False
  BGR: False
  TEST_PREDICT_BOX_LISTS: ["person_box_67091280_iou90/ava_detection_val_boxes_and_labels.csv"]
  ALPHA: 4
  DEPTH: 101
  TRANS_FUNC: bottleneck_transform
  STRIDE_1X1: False
  NUM_BLOCK_TEMP_KERNEL: [[3, 3], [4, 4], [6, 6], [3, 3]]
  SPATIAL_DILATIONS: [[1, 1], [1, 1], [1, 1], [2, 2]]
  SPATIAL_STRIDES: [[1, 1], [2, 2], [2, 2], [1, 1]]
  LOCATION: [[[], []], [[], []], [[6, 13, 20], []], [[], []]]
  GROUP: [[1, 1], [1, 1], [1, 1], [1, 1]]
  INSTANTIATION: dot_product
  POOL: [[[2, 2, 2], [2, 2, 2]], [[2, 2, 2], [2, 2, 2]], [[2, 2, 2], [2, 2, 2]], [[2, 2, 2], [2, 2, 2]]]
  ARCH: slowfast
  MODEL_NAME: SlowFast
  LOSS_FUNC: bce
  HEAD_ACT: sigmoid
  ENABLE: False
  DATASET: ava
#    TOPK: 2
  ENABLE: True
  LABEL_FILE_PATH: "test_data/action.json" # Add local label file path here.
  INPUT_VIDEO: "test_data/100.mp4"
  OUTPUT_FILE: "test_data/result.mp4"
  DETECTRON2_CFG: "COCO-Detection/faster_rcnn_R_50_FPN_3x.yaml"
  DETECTRON2_WEIGHTS: detectron2://COCO-Detection/faster_rcnn_R_50_FPN_3x/137849458/model_final_280758.pkl

3.4 运行检测

conda activate mmcv
cd SlowFast
python tools/ --cfg demo/AVA/SLOWFAST_32x2_R101_50_50.yaml

RuntimeError: COCO-Detection/faster_rcnn_R_50_FPN_3x.yaml not available in Model Zoo!

    解决方法:查看site-packages\detectron2\model_zoo\configs\COCO-Detection\faster_rcnn_R_50_FPN_3x.yaml是否存在,如果没有的话,将 detectron2-master/configs文件下的所有文件全都复制到site-packages\detectron2\model_zoo\configs下,再次测试就可以了

