1、人声分离
处理一(推荐)uvr 5.5
处理二(里面包含人声分离及人声转换功能,可直接页面操作,功能丰富且还有视频详解,非常适合小白使用) rvc web
2、对于uvr 5.5
(vits-rvc) root@fangg-MS-7B78:/home/fangg/tts# conda create -n uvr python=3.9.1
Retrieving notices: ...working... done
Collecting package metadata (current_repodata.json): done
Solving environment: failed with repodata from current_repodata.json, will retry with next repodata source.
Collecting package metadata (repodata.json): done
Solving environment: done
==> WARNING: A newer version of conda exists. <==
current version: 23.3.1
latest version: 23.5.2
Please update conda by running
$ conda update -n base -c defaults conda
Or to minimize the number of packages updated during conda update use
conda install conda=23.5.2
## Package Plan ##
environment location: /home/fangg/tools/anacodna3/envs/uvr
added / updated specs:
- python=3.9.1
The following packages will be downloaded:
package | build
---------------------------|-----------------
pip-23.1.2 | py39h06a4308_0 2.5 MB
python-3.9.1 | hdb3f193_2 18.1 MB
setuptools-67.8.0 | py39h06a4308_0 1.0 MB
wheel-0.38.4 | py39h06a4308_0 64 KB
------------------------------------------------------------
Total: 21.7 MB
The following NEW packages will be INSTALLED:
_libgcc_mutex pkgs/main/linux-64::_libgcc_mutex-0.1-main
_openmp_mutex pkgs/main/linux-64::_openmp_mutex-5.1-1_gnu
ca-certificates pkgs/main/linux-64::ca-certificates-2023.05.30-h06a4308_0
ld_impl_linux-64 pkgs/main/linux-64::ld_impl_linux-64-2.38-h1181459_1
libffi pkgs/main/linux-64::libffi-3.3-he6710b0_2
libgcc-ng pkgs/main/linux-64::libgcc-ng-11.2.0-h1234567_1
libgomp pkgs/main/linux-64::libgomp-11.2.0-h1234567_1
libstdcxx-ng pkgs/main/linux-64::libstdcxx-ng-11.2.0-h1234567_1
ncurses pkgs/main/linux-64::ncurses-6.4-h6a678d5_0
openssl pkgs/main/linux-64::openssl-1.1.1u-h7f8727e_0
pip pkgs/main/linux-64::pip-23.1.2-py39h06a4308_0
python pkgs/main/linux-64::python-3.9.1-hdb3f193_2
readline pkgs/main/linux-64::readline-8.2-h5eee18b_0
setuptools pkgs/main/linux-64::setuptools-67.8.0-py39h06a4308_0
sqlite pkgs/main/linux-64::sqlite-3.41.2-h5eee18b_0
tk pkgs/main/linux-64::tk-8.6.12-h1ccaba5_0
tzdata pkgs/main/noarch::tzdata-2023c-h04d1e81_0
wheel pkgs/main/linux-64::wheel-0.38.4-py39h06a4308_0
xz pkgs/main/linux-64::xz-5.4.2-h5eee18b_0
zlib pkgs/main/linux-64::zlib-1.2.13-h5eee18b_0
Proceed ([y]/n)? y
Downloading and Extracting Packages
Preparing transaction: done
Verifying transaction: done
Executing transaction: done
#
# To activate this environment, use
#
# $ conda activate uvr
#
# To deactivate an active environment, use
#
# $ conda deactivate
(vits-rvc) root@fangg-MS-7B78:/home/fangg/tts#
(vits-rvc) root@fangg-MS-7B78:/home/fangg/tts#
(vits-rvc) root@fangg-MS-7B78:/home/fangg/tts# cd uvr-master
(vits-rvc) root@fangg-MS-7B78:/home/fangg/tts/uvr-master#
(vits-rvc) root@fangg-MS-7B78:/home/fangg/tts/uvr-master# conda activate uvr
(uvr) root@fangg-MS-7B78:/home/fangg/tts/uvr-master#
(uvr) root@fangg-MS-7B78:/home/fangg/tts/uvr-master#
(uvr) root@fangg-MS-7B78:/home/fangg/tts/uvr-master# pip install torch==1.13.1+cu116 torchvision==0.14.1+cu116 torchaudio==0.13.1 --extra-index-url https://download.pytorch.org/whl/cu116
Looking in indexes: https://pypi.org/simple, https://download.pytorch.org/whl/cu116
Collecting torch==1.13.1+cu116
Downloading https://download.pytorch.org/whl/cu116/torch-1.13.1%2Bcu116-cp39-cp39-linux_x86_64.whl (1977.9 MB)
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 2.0/2.0 GB 1.3 MB/s eta 0:00:00
WARNING: Retrying (Retry(total=4, connect=None, read=None, redirect=None, status=None)) after connection broken by 'ReadTimeoutError("HTTPSConnectionPool(host='pypi.org', port=443): Read timed out. (read timeout=15)")': /simple/torchvision/
...
(uvr) root@fangg-MS-7B78:/home/fangg/tts/uvr-master#
(uvr) root@fangg-MS-7B78:/home/fangg/tts/uvr-master#
(uvr) root@fangg-MS-7B78:/home/fangg/tts/uvr-master# pip install -r requirements.txt -i "https://pypi.tuna.tsinghua.edu.cn/simple/"
Looking in indexes: https://pypi.tuna.tsinghua.edu.cn/simple/
...
Successfully installed Cython-0.29.36 Dora-0.0.3 Pillow-9.3.0 PySoundFile-0.9.0.post1 PyYAML-6.0 aiohttp-3.8.4 aiosignal-1.3.1 altgraph-0.17.3 antlr4-python3-runtime-4.9.3 async-timeout-4.0.2 attrs-23.1.0 audioread-3.0.0 certifi-2022.12.7 cffi-1.15.1 coloredlogs-15.0.1 contourpy-1.1.0 cryptography-3.4.6 cycler-0.11.0 decorator-5.1.1 diffq-0.2.3 einops-0.6.0 flatbuffers-23.5.26 fonttools-4.41.0 frozenlist-1.4.0 fsspec-2023.6.0 future-0.18.2 humanfriendly-10.0 importlib-resources-6.0.0 joblib-1.3.1 julius-0.2.7 kiwisolver-1.4.4 kthread-0.2.3 librosa-0.9.2 lightning-utilities-0.9.0 llvmlite-0.39.1 matplotlib-3.7.2 mpmath-1.3.0 multidict-6.0.4 natsort-8.2.0 numba-0.56.4 numpy-1.23.4 omegaconf-2.2.3 onnx-1.14.0 onnxruntime-1.13.1 opencv-python-4.6.0.66 packaging-23.1 pandas-2.0.3 pip-22.2.2 platformdirs-3.9.1 playsound-1.3.0 pooch-1.7.0 protobuf-4.23.4 psutil-5.9.4 pycparser-2.21 pydub-0.25.1 pyglet-1.5.23 pyparsing-3.0.9 pyperclip-1.8.2 pyrubberband-0.3.0 python-dateutil-2.8.2 pytorch_lightning-2.0.0 pytz-2023.3 resampy-0.2.2 samplerate-0.1.0 scikit-learn-1.3.0 scipy-1.9.3 screeninfo-0.8.1 six-1.16.0 sklearn-0.0.post5 soundfile-0.11.0 soundstretch-1.2 sympy-1.12 threadpoolctl-3.2.0 torchmetrics-1.0.1 tqdm-4.65.0 tzdata-2023.3 urllib3-1.26.12 wget-3.2 yarl-1.9.2 zipp-3.16.2
WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv
(uvr) root@fangg-MS-7B78:/home/fangg/tts/uvr-master#
(uvr) root@fangg-MS-7B78:/home/fangg/tts/uvr-master#
(uvr) root@fangg-MS-7B78:/home/fangg/tts/uvr-master# python UVR.py
playsound is relying on another python subprocess. Please use `pip install pygobject` if you want playsound to run more efficiently.
2023-07-18 07:25:41,200 - UVR BEGIN
No splash screen.
VR_Models
1_HP-UVR 留人声(勾选Vocals Only,含有少量杂音,可勉强使用)
2_HP-UVR 留人声(勾选Vocals Only,含有大量电音,不可用)
3_HP-Vocal-UVR 留人声(勾选Vocals Only,含有少量杂音,可勉强使用)
4_HP-Vocal-UVR 留人声(勾选Vocals Only,含有大量电音,不可用)
6_HP-Karaoke-UVR 留人声(勾选Vocals Only,含有少量杂音,可勉强使用)
9_HP2-UVR 留人声(勾选Vocals Only,处理较长时间,含有少量杂音,可勉强使用)
12_SP-UVR-3B-44100 留人声(勾选Vocals Only,处理时间短,含有少量杂音,可勉强使用)
16_SP-UVR-MID-44100-2 留人声(勾选Vocals Only,处理时间短,含有少量杂音,可勉强使用)
MDX_Net_Models
UVR_MDXNET_3_9662 留人声(勾选Vocals Only,处理时间较短,含有少量杂音,可勉强使用)
UVR-MDX-NET-Inst_3 留人声(勾选Vocals Only,含有少量杂音,可勉强使用)
UVR-MDX-NET-Inst_Main 留人声(勾选Vocals Only,含有杂音较少,推荐)
UVR_MDXNET_Main 留人声(勾选Vocals Only,含有杂音较少,音频更清晰,强烈推荐)
UVR_MDXNET_9482 留人声(勾选Vocals Only,处理时间较短,含有丝丝杂音,可勉强使用)
kuielab_a_vocals 留人声(勾选Vocals Only,处理时间较短,含有杂音较少,推荐)
kuielab_b_vocals 留人声(勾选Vocals Only,处理时间较短,含有杂音较少,推荐)
结论:MDX_Net_Models的UVR_MDXNET_Main模型输出的人声效果最好(音频清晰,虽含有少量杂音,但相对来说是最少的)