TorchVGGish 开源项目教程

最新推荐文章于 2024-08-13 08:02:19 发布

齐游菊Rosemary

最新推荐文章于 2024-08-13 08:02:19 发布

阅读量355

点赞数 3

本文链接：https://blog.csdn.net/gitblog_00013/article/details/141117068

版权

TorchVGGish 开源项目教程

torchvggish项目地址:https://gitcode.com/gh_mirrors/to/torchvggish

项目介绍

TorchVGGish 是一个基于 PyTorch 的 VGGish 特征嵌入前端，用于音频分类模型。该项目是 TensorFlow 的 VGGish 模型的 PyTorch 移植版本，其权重直接从 TensorFlow 模型移植而来，因此使用 TorchVGGish 创建的嵌入将与 TensorFlow 版本相同。

项目快速启动

安装

你可以通过以下两种方式之一安装 TorchVGGish：

从 PyPI 安装最新稳定版本：
```
pip install torchvggish
```

克隆仓库并安装：

git clone https://github.com/harritaylor/torchvggish.git
cd torchvggish
python3 -m venv env
source env/bin/activate
pip install -r requirements.txt

使用示例

以下是一个简单的示例，展示如何从示例 WAV 文件创建嵌入：

from torchvggish import vggish, vggish_input

# 初始化模型并下载权重
embedding_model = vggish()
embedding_model.eval()

# 加载示例 WAV 文件
example = vggish_input.wavfile_to_examples("example.wav")

# 创建嵌入
embeddings = embedding_model(example)