AudioSet-音频数据集

不务正业的猿

于 2020-09-16 18:18:24 发布

阅读量4.6k

点赞数 4

分类专栏：下载数据集文章标签： AudioSet 音频数据集数据集

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/ispeasant/article/details/108627673

版权

下载同时被 2 个专栏收录

198 篇文章 ¥29.90 ¥99.00

订阅专栏

169 篇文章

订阅专栏

原文：

AudioSet consists of an expanding ontology of 632 audio event classes and a collection of 2,084,320 human-labeled 10-second sound clips drawn from YouTube videos. The ontology is specified as a hierarchical graph of event categories, covering a wide range of human and animal sounds, musical instruments and genres, and common everyday environmental sounds.

By releasing AudioSet, we hope to provide a common, realistic-scale evaluation task for audio event detection, as well as a starting point for a comprehensive vocabulary of sound events.

译：

AudioSet由632个音频事件类的扩展本体和2084320个人类标记的10秒声音片段集合组成。本体被指定为事件类别的层次图，涵盖了广泛的人类和动物声音、乐器和流派以及常见的日常环境声音。

通过发布AudioSet，我们希望能够为音频事件检测提供一个通用的、现实的尺度评估任务，以及一个全面的声音事件词汇表的起点。

大家可以到官网地址下载数据集，我自己也在百度网盘分享了一份。可关注本人公众号，回复“2020091601”获取下载链接。

评论 5

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

打赏作者

不务正业的猿 谢谢您的支持与鼓励！！！

¥1 ¥2 ¥4 ¥6 ¥10 ¥20

扫码支付：¥1

获取中

扫码支付

您的余额不足，请更换扫码支付或充值

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。