嘴型融合 wav2lip 升级版

迷途小书童的Note

于 2022-09-15 14:34:14 发布

阅读量9.5k

点赞数 5

文章标签：人工智能 tensorflow python pip 深度学习

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/djstavaV/article/details/126882601

版权

环境

windows 10 64bit
wav2lip-hq
pytorch 1.12.1+cu113

前言

前面的博文嘴型同步模型Wav2Lip，介绍了嘴型同步模型，本篇介绍的是 wav2lip 的高清版，在原有基础上，使用了超分辨率图像和人脸分割技术，来提升整体效果。

实践

首先，拉取源码

git clone https://github.com/Markfryazino/wav2lip-hq.git
cd wav2lip-hq

# 创建个新的虚拟环境
conda create -n wav2liphq python=3.8
conda activate wav2liphq

# 安装torch
pip3 install torch torchvision torchaudio --extra-index-url https://download.pytorch.org/whl/cu113

# 安装其它依赖库，将其中的torch、torchvision注释掉，前面已经安装了gpu版本
pip install -r requirements.txt

然后去下载模型，这里需要3个模型，第一个下载地址：https://drive.google.com/file/d/1aB-jqBikcZPJnFrJXWUEpvF2RFCuerSe/view?usp=sharing ，下载后拷贝到目录 checkpoints 下面；第二个模型是人脸的模型，下载地址：https://www.adrianbulat.com/downloads/python-fan/s3fd-619a316812.pth，下载后拷贝到 face_detection/detection/sfd 目录下，并重命名为 s3fd.pth；第三个是脸部的 segmentation 模型，下载地址：https://drive.google.com/open?id=154JgKpzCPW82qINcVieuPH3fZ2e0P812，拷贝到 checkpoints 目录下，并重命名为 face_segmentation.pth

最后，我们准备一个音频文件和一个视频文件来进行测试，执行命令

python.exe inference.py --checkpoint_path checkpoints\wav2lip_gan.pth --segmentation_path checkpoints\face_segmentation.pth --sr_path checkpoints\esrgan_yunying.pth --face test.mp4 --audio test.mp3 --outfile output.mp4

参考资料

https://github.com/Markfryazino/wav2lip-hq
https://github.com/zllrunning/face-parsing.PyTorch.git
https://github.com/xinntao/BasicSR.git
https://github.com/1adrianb/face-alignment
https://xugaoxiang.com/2021/03/05/wav2lip/

迷途小书童的Note

博客等级

码龄10年

425
原创

1175
点赞

5264
收藏

1426
粉丝

关注

私信

热门文章

分类专栏

PyQt5 14篇
Python实用模块推荐 9篇
Flask 8篇
Hexo博客教程 12篇
Python 26篇
人工智能 27篇
Linux 11篇
流媒体 3篇
Android 1篇

展开全部收起

最新评论

你想要的汽车ReID数据集
Peak@: 你好，你现在有没这个数据集
当YOLOv5碰上PyQt5 ...
zzw_N701: 视频的检测结果很糊
26.2k，收下这个FastAPI全栈模板！
远去的日子: 请问这个报错怎么解决啊？？ docker compose -f docker-compose.yml up -d [+] Running 4/4 ✔ Container full-stack-fastapi-template-db-1 Healthy 0.0s ✔ Container full-stack-fastapi-template-frontend-1 Running 0.0s ✔ Container full-stack-fastapi-template-adminer-1 Running 0.0s ✘ Container full-stack-fastapi-template-prestart-1 service "prestart" didn't complete successfully: exit 10.0s service "prestart" didn't complete successfully: exit 1
嘴型同步模型Wav2Lip
谨慎殷勤: 生成的结果分辨率很低怎么处理呀
手把手教你打造一个AI智能体
三只熊199: 前提是需要魔法吗？我上不了chatgpt 4o 是不是就不能生成了？

最新文章

目录

展开全部

收起

评论 7

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

打赏作者

迷途小书童的Note 请博主喝矿泉书！

¥1 ¥2 ¥4 ¥6 ¥10 ¥20

扫码支付：¥1

获取中

扫码支付

您的余额不足，请更换扫码支付或充值

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。