音乐情感分类

最新推荐文章于 2024-07-23 15:46:40 发布

Luven777

最新推荐文章于 2024-07-23 15:46:40 发布

阅读量1.3w

点赞数

分类专栏：语音处理文章标签： svm 情感算法

本文链接：https://blog.csdn.net/Luven777/article/details/80013840

版权

语音处理专栏收录该内容

0 篇文章 0 订阅

订阅专栏

音乐情感分类

Thayer模型
Thayer情感模型是二维情感模型
纵坐标表示的是能量维度，从“平静的”到“活力的”变化，反映的是主体的情感活跃程度；横坐标表示的是压力维度。
从“消极”到“积极”的变化，反映了主体的主观感受，从而将音乐情感划分为具有代表性的四类：兴奋的、愤怒的、悲伤的和放松的。

SVM算法
SVM(Support Vector Machine)指的是支持向量机，是常见的一种判别方法。在机器学习领域，是一个有监督的学习模型，通常用来进行模式识别、分类以及回归分析

LibSVM工具
LibSVM参数配置：
options:
-s svm_type : set type of SVM (default 0)
0 – C-SVC
1 – nu-SVC
2 – one-class SVM
3 – epsilon-SVR
4 – nu-SVR
svm类型配置，默认为C-SVC
-t kernel_type : set type of kernel function (default 2)
0 – linear: u’*v
1 – polynomial: (gamma*u’*v + coef0)^degree
2 – radial basis function: exp(-gamma*|u-v|^2)
3 – sigmoid: tanh(gamma*u’*v + coef0)
核函数配置默认是rbf径向基函数
下面是具体degree gamma 等参数的设置
-d degree : set degree in kernel function (default 3)
-g gamma : set gamma in kernel function (default 1/num_features)
-r coef0 : set coef0 in kernel function (default 0)
-c cost : set the parameter C of C-SVC, epsilon-SVR, and nu-SVR (default 1)
-n nu : set the parameter nu of nu-SVC, one-class SVM, and nu-SVR (default 0.5)
-p epsilon : set the epsilon in loss function of epsilon-SVR (default 0.1)
-m cachesize : set cache memory size in MB (default 100)
-e epsilon : set tolerance of termination criterion (default 0.001)
-h shrinking: whether to use the shrinking heuristics, 0 or 1 (default 1)
-b probability_estimates: whether to train a SVC or SVR model for probability estimates, 0 or 1 (default 0)
-wi weight: set the parameter C of class i to weight*C, for C-SVC (default 1)
The k in the -g option means the number of attributes in the input data.

JAudio工具
JAudio工具是语音特征提取工具，这里提取的特征表征音调，响度，音色，节奏等。
以下维度分别都用到了均值和标准差 mean and standard deviation
例：频谱变化度1维为均值，2维为标准差。
表征音调：1-2.频谱变化度 Spectral Variability
3-4.频谱峰值Peack Based Spectral Smoothness
表征响度：
5-6.短时能量均方根 Root Mean Square
7-8.低能量帧比率 Fraction of Low Energy Windows
表征音色：
9-10.过零率均值和标准差zero Crossings
11-12.频谱质心Spectral Centroid
13-38. MFCC MFCC 13维
39-58.LPC 10 维
59-60.频谱通量 Spectral Flux
61-62.频谱衰减点 Spectral Rolloff Point
63-64.频谱简洁度Compactness

表征节奏：
65-66.节拍强度Beat Sum
67-68.最强节拍Strongest Beat
69-70.最强节拍强度 Strength of Strongest Beat

参数维度及实验结果
这里使用到了表征音调，响度，音色，节奏等四大特征70维数据，用于作为音乐特征参数。实验使用到了四种情感各85首歌曲，总共340首歌曲，每10s一个片段，总共7140个片段，其中80%用于训练，20%用于预测。各准确率在80%以上。

参考文献：
中国民族音乐特征提取及分类技术的研究-孙科
 JAudio工具下载
 LibSVM工具下载

Luven777

关注

0
点赞
踩
26

收藏

觉得还不错? 一键收藏
0
评论
音乐情感分类

音乐情感分类Thayer模型 Thayer情感模型是二维情感模型纵坐标表示的是能量维度，从“平静的”到“活力的”变化，反映的是主体的情感活跃程度；横坐标表示的是压力维度。从“消极”到“积极”的变化，反映了主体的主观感受，从而将音乐情感划分为具有代表性的四类：兴奋的、愤怒的、悲伤的和放松的。
复制链接

扫一扫

专栏目录