音频人声分离处理

文章介绍了如何使用uvr5.5工具进行人声分离和转换,包括通过conda创建环境、安装依赖,并提到了几个不同的人声处理模型,如HP-UVR和MDX_Net_Models。在MDX_Net_Models中,UVR_MDXNET_Main模型表现最佳,输出的人声效果清晰,杂音最少。
摘要由CSDN通过智能技术生成

1、人声分离
处理一(推荐)uvr 5.5
处理二(里面包含人声分离及人声转换功能,可直接页面操作,功能丰富且还有视频详解,非常适合小白使用) rvc web

2、对于uvr 5.5

(vits-rvc) root@fangg-MS-7B78:/home/fangg/tts# conda create -n uvr python=3.9.1
Retrieving notices: ...working... done
Collecting package metadata (current_repodata.json): done
Solving environment: failed with repodata from current_repodata.json, will retry with next repodata source.
Collecting package metadata (repodata.json): done
Solving environment: done


==> WARNING: A newer version of conda exists. <==
  current version: 23.3.1
  latest version: 23.5.2

Please update conda by running

    $ conda update -n base -c defaults conda

Or to minimize the number of packages updated during conda update use

     conda install conda=23.5.2



## Package Plan ##

  environment location: /home/fangg/tools/anacodna3/envs/uvr

  added / updated specs:
    - python=3.9.1


The following packages will be downloaded:

    package                    |            build
    ---------------------------|-----------------
    pip-23.1.2                 |   py39h06a4308_0         2.5 MB
    python-3.9.1               |       hdb3f193_2        18.1 MB
    setuptools-67.8.0          |   py39h06a4308_0         1.0 MB
    wheel-0.38.4               |   py39h06a4308_0          64 KB
    ------------------------------------------------------------
                                           Total:        21.7 MB

The following NEW packages will be INSTALLED:

  _libgcc_mutex      pkgs/main/linux-64::_libgcc_mutex-0.1-main 
  _openmp_mutex      pkgs/main/linux-64::_openmp_mutex-5.1-1_gnu 
  ca-certificates    pkgs/main/linux-64::ca-certificates-2023.05.30-h06a4308_0 
  ld_impl_linux-64   pkgs/main/linux-64::ld_impl_linux-64-2.38-h1181459_1 
  libffi             pkgs/main/linux-64::libffi-3.3-he6710b0_2 
  libgcc-ng          pkgs/main/linux-64::libgcc-ng-11.2.0-h1234567_1 
  libgomp            pkgs/main/linux-64::libgomp-11.2.0-h1234567_1 
  libstdcxx-ng       pkgs/main/linux-64::libstdcxx-ng-11.2.0-h1234567_1 
  ncurses            pkgs/main/linux-64::ncurses-6.4-h6a678d5_0 
  openssl            pkgs/main/linux-64::openssl-1.1.1u-h7f8727e_0 
  pip                pkgs/main/linux-64::pip-23.1.2-py39h06a4308_0 
  python             pkgs/main/linux-64::python-3.9.1-hdb3f193_2 
  readline           pkgs/main/linux-64::readline-8.2-h5eee18b_0 
  setuptools         pkgs/main/linux-64::setuptools-67.8.0-py39h06a4308_0 
  sqlite             pkgs/main/linux-64::sqlite-3.41.2-h5eee18b_0 
  tk                 pkgs/main/linux-64::tk-8.6.12-h1ccaba5_0 
  tzdata             pkgs/main/noarch::tzdata-2023c-h04d1e81_0 
  wheel              pkgs/main/linux-64::wheel-0.38.4-py39h06a4308_0 
  xz                 pkgs/main/linux-64::xz-5.4.2-h5eee18b_0 
  zlib               pkgs/main/linux-64::zlib-1.2.13-h5eee18b_0 


Proceed ([y]/n)? y


Downloading and Extracting Packages
                                                                                                                           
Preparing transaction: done                                                                                                
Verifying transaction: done                                                                                                
Executing transaction: done                                                                                                
#
# To activate this environment, use
#
#     $ conda activate uvr
#
# To deactivate an active environment, use
#
#     $ conda deactivate

(vits-rvc) root@fangg-MS-7B78:/home/fangg/tts# 
(vits-rvc) root@fangg-MS-7B78:/home/fangg/tts# 
(vits-rvc) root@fangg-MS-7B78:/home/fangg/tts# cd uvr-master
(vits-rvc) root@fangg-MS-7B78:/home/fangg/tts/uvr-master# 
(vits-rvc) root@fangg-MS-7B78:/home/fangg/tts/uvr-master# conda activate uvr
(uvr) root@fangg-MS-7B78:/home/fangg/tts/uvr-master# 
(uvr) root@fangg-MS-7B78:/home/fangg/tts/uvr-master# 
(uvr) root@fangg-MS-7B78:/home/fangg/tts/uvr-master# pip install torch==1.13.1+cu116 torchvision==0.14.1+cu116 torchaudio==0.13.1 --extra-index-url https://download.pytorch.org/whl/cu116
Looking in indexes: https://pypi.org/simple, https://download.pytorch.org/whl/cu116
Collecting torch==1.13.1+cu116
  Downloading https://download.pytorch.org/whl/cu116/torch-1.13.1%2Bcu116-cp39-cp39-linux_x86_64.whl (1977.9 MB)
     ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 2.0/2.0 GB 1.3 MB/s eta 0:00:00
WARNING: Retrying (Retry(total=4, connect=None, read=None, redirect=None, status=None)) after connection broken by 'ReadTimeoutError("HTTPSConnectionPool(host='pypi.org', port=443): Read timed out. (read timeout=15)")': /simple/torchvision/
...
(uvr) root@fangg-MS-7B78:/home/fangg/tts/uvr-master# 
(uvr) root@fangg-MS-7B78:/home/fangg/tts/uvr-master# 
(uvr) root@fangg-MS-7B78:/home/fangg/tts/uvr-master# pip install -r requirements.txt -i "https://pypi.tuna.tsinghua.edu.cn/simple/"
Looking in indexes: https://pypi.tuna.tsinghua.edu.cn/simple/
...
Successfully installed Cython-0.29.36 Dora-0.0.3 Pillow-9.3.0 PySoundFile-0.9.0.post1 PyYAML-6.0 aiohttp-3.8.4 aiosignal-1.3.1 altgraph-0.17.3 antlr4-python3-runtime-4.9.3 async-timeout-4.0.2 attrs-23.1.0 audioread-3.0.0 certifi-2022.12.7 cffi-1.15.1 coloredlogs-15.0.1 contourpy-1.1.0 cryptography-3.4.6 cycler-0.11.0 decorator-5.1.1 diffq-0.2.3 einops-0.6.0 flatbuffers-23.5.26 fonttools-4.41.0 frozenlist-1.4.0 fsspec-2023.6.0 future-0.18.2 humanfriendly-10.0 importlib-resources-6.0.0 joblib-1.3.1 julius-0.2.7 kiwisolver-1.4.4 kthread-0.2.3 librosa-0.9.2 lightning-utilities-0.9.0 llvmlite-0.39.1 matplotlib-3.7.2 mpmath-1.3.0 multidict-6.0.4 natsort-8.2.0 numba-0.56.4 numpy-1.23.4 omegaconf-2.2.3 onnx-1.14.0 onnxruntime-1.13.1 opencv-python-4.6.0.66 packaging-23.1 pandas-2.0.3 pip-22.2.2 platformdirs-3.9.1 playsound-1.3.0 pooch-1.7.0 protobuf-4.23.4 psutil-5.9.4 pycparser-2.21 pydub-0.25.1 pyglet-1.5.23 pyparsing-3.0.9 pyperclip-1.8.2 pyrubberband-0.3.0 python-dateutil-2.8.2 pytorch_lightning-2.0.0 pytz-2023.3 resampy-0.2.2 samplerate-0.1.0 scikit-learn-1.3.0 scipy-1.9.3 screeninfo-0.8.1 six-1.16.0 sklearn-0.0.post5 soundfile-0.11.0 soundstretch-1.2 sympy-1.12 threadpoolctl-3.2.0 torchmetrics-1.0.1 tqdm-4.65.0 tzdata-2023.3 urllib3-1.26.12 wget-3.2 yarl-1.9.2 zipp-3.16.2
WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv
(uvr) root@fangg-MS-7B78:/home/fangg/tts/uvr-master# 
(uvr) root@fangg-MS-7B78:/home/fangg/tts/uvr-master# 
(uvr) root@fangg-MS-7B78:/home/fangg/tts/uvr-master# python UVR.py 
playsound is relying on another python subprocess. Please use `pip install pygobject` if you want playsound to run more efficiently.
2023-07-18 07:25:41,200 - UVR BEGIN
No splash screen.
VR_Models

1_HP-UVR 留人声(勾选Vocals Only,含有少量杂音,可勉强使用)
2_HP-UVR 留人声(勾选Vocals Only,含有大量电音,不可用)
3_HP-Vocal-UVR 留人声(勾选Vocals Only,含有少量杂音,可勉强使用)
4_HP-Vocal-UVR 留人声(勾选Vocals Only,含有大量电音,不可用)
6_HP-Karaoke-UVR 留人声(勾选Vocals Only,含有少量杂音,可勉强使用)
9_HP2-UVR 留人声(勾选Vocals Only,处理较长时间,含有少量杂音,可勉强使用)
12_SP-UVR-3B-44100 留人声(勾选Vocals Only,处理时间短,含有少量杂音,可勉强使用)
16_SP-UVR-MID-44100-2 留人声(勾选Vocals Only,处理时间短,含有少量杂音,可勉强使用)

MDX_Net_Models

UVR_MDXNET_3_9662 留人声(勾选Vocals Only,处理时间较短,含有少量杂音,可勉强使用)
UVR-MDX-NET-Inst_3 留人声(勾选Vocals Only,含有少量杂音,可勉强使用)
UVR-MDX-NET-Inst_Main 留人声(勾选Vocals Only,含有杂音较少,推荐)
UVR_MDXNET_Main 留人声(勾选Vocals Only,含有杂音较少,音频更清晰,强烈推荐)
UVR_MDXNET_9482 留人声(勾选Vocals Only,处理时间较短,含有丝丝杂音,可勉强使用)
kuielab_a_vocals 留人声(勾选Vocals Only,处理时间较短,含有杂音较少,推荐)
kuielab_b_vocals 留人声(勾选Vocals Only,处理时间较短,含有杂音较少,推荐)

结论:MDX_Net_Models的UVR_MDXNET_Main模型输出的人声效果最好(音频清晰,虽含有少量杂音,但相对来说是最少的)

评论 2
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值