https://github.com/wei-tim/YOWO
PyTorch implementation of the article "You Only Watch Once: A Unified CNN Architecture for Real-Time Spatiotemporal Action Localization". The repositry contains code for real-time spatiotemporal action localization with PyTorch on AVA, UCF101-24 and JHMDB datasets!
Running on a text video
- You can run AVA pretrained model on any test video with the following code:
python test_video_ava.py --cfg cfg/ava.yaml