语音识别与分类（三分类）

最新推荐文章于 2022-11-09 06:49:49 发布

岚DEMO

最新推荐文章于 2022-11-09 06:49:49 发布

阅读量9.5k

点赞数 2

本文链接：https://blog.csdn.net/c2c2c2aa/article/details/81782863

版权

该博客介绍了一个针对bed、cat、happy三个单词的语音识别项目，通过定义模型和填充数据，实现了0.95的模型准确率。github链接提供详细实现过程。

摘要由CSDN通过智能技术生成

目的：识别三个单词（bed，cat，happy）

github：https://github.com/yaokaishile/three-classification
一：导入需要的包

import librosa
import os
from sklearn.model_selection import train_test_split
from keras.utils import to_categorical
import numpy as np
from tqdm import tqdm

二、定义所需函数

def get_labels(path=DATA_PATH):
    labels = os.listdir(path)
    label_indices = np.arange(0, len(labels))
    return labels, label_indices, to_categorical(label_indices)


def wav2mfcc(file_path):
    wave, sr = librosa.load(file_path, mono=True, sr=None)

    mfcc = librosa.feature.mfcc(wave, sr=16000)

    return mfcc


def save_data_to_array(path=DATA_PATH):
    lab

最低0.47元/天解锁文章

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

岚DEMO

关注关注

2
点赞
踩
28

收藏

觉得还不错? 一键收藏
3
评论
语音识别与分类（三分类）

目的：识别三个单词（bed，cat，happy）github：https://github.com/yaokaishile/three-classification 一：导入需要的包import librosaimport osfrom sklearn.model_selection import train_test_splitfrom keras.utils import to...
复制链接

扫一扫