MacOS 下运行 GPT-SoVITS

wxl781227

已于 2024-08-16 21:06:29 修改

阅读量198

点赞数 3

文章标签： gpt gpt-sovits 声音克隆跨语言克隆效果极佳

于 2024-08-16 19:06:09 首次发布

本文链接：https://blog.csdn.net/wxl781227/article/details/141267339

版权

系统环境：

# 安装 ffmpeg
brew install ffmpeg
# 查看版本
ffmpeg -version

# 拉取项目代码
git clone --depth=1 https://github.com/RVC-Boss/GPT-SoVITS
cd GPT-SoVITS

# 安装好 Miniconda 之后，先创建一个虚拟环境：
conda create -n GPTSoVits python=3.9
conda activate GPTSoVits

# 安装依赖：
pip3 install -r requirements.txt
pip3 uninstall torch torchaudio
pip3 install --pre torch torchaudio --index-url https://download.pytorch.org/whl/nightly/cpu

# (可选)如果网络环境不好，可以考虑换源(比如清华源)：
pip3 install -i https://pypi.tuna.tsinghua.edu.cn/simple -r requirements.txt
pip3 uninstall torch torchaudio
pip3 install --pre torch torchaudio --index-url https://download.pytorch.org/whl/nightly/cpu

# 安装 huggingface-cli 用于和 huggingface hub 交互
pip3 install huggingface_hub
export HF_ENDPOINT=https://hf-mirror.com

# 下载模型, 由于模型文件较大，可能需要一段时间
# --local-dir-use-symlinks False 用于解决 macOS alias 文件的问题
# 会下载到 GPT_SoVITS/pretrained_models 文件夹下
huggingface-cli download --resume-download lj1995/GPT-SoVITS --local-dir GPT_SoVITS/pretrained_models --local-dir-use-symlinks False

#运行

python webui.py
#运行结果

Matplotlib is building the font cache; this may take a moment.

Downloading g2pw model...

Extracting g2pw model...

Running on local URL: http://0.0.0.0:9874

"/Users/wxl/anaconda3/envs/GPTSoVits/bin/python" tools/uvr5/webui.py "cpu" False 9873 False

操作简要说明：

1）UVR5界面不必须的，主要用于分离歌声与背景声音

2）指定待训练的声音文件路径：如：input/someone

3）依次执行声音拆分、降噪和ASR转写（自动标注）

4）标注界面也不是必须的（手工标注）

5）一键三连提取自监督特征和语义特征（第一步需要下载nltk_data，有可能会失败，要科学上网）

6）依次训练sovits和gpt模型

7）部署模型

8）推理

wxl781227

关注

3
点赞
踩
2

收藏

觉得还不错? 一键收藏
0
评论
MacOS 下运行 GPT-SoVITS

--local-dir-use-symlinks False 用于解决 macOS alias 文件的问题。# 安装 huggingface-cli 用于和 huggingface hub 交互。# 会下载到 GPT_SoVITS/pretrained_models 文件夹下。# 下载模型, 由于模型文件较大，可能需要一段时间。# 安装 ffmpeg。
复制链接

扫一扫