SpeechRecognition 开源项目教程

高鲁榕Jeremiah

于 2024-10-11 07:44:37 发布

阅读量1k

点赞数 12

本文链接：https://blog.csdn.net/gitblog_00135/article/details/142840878

版权

SpeechRecognition 开源项目教程

speech_recognition Speech recognition module for Python, supporting several engines and APIs, online and offline. 项目地址: https://gitcode.com/gh_mirrors/spee/speech_recognition

1. 项目介绍

SpeechRecognition 是一个用于 Python 的语音识别模块，支持多种引擎和 API，包括在线和离线模式。该项目由 fossasia 组织维护，旨在提供一个简单易用的接口，让开发者能够轻松地将语音识别功能集成到他们的应用程序中。

主要功能

多引擎支持: 支持 CMU Sphinx、Google Speech Recognition、Google Cloud Speech API、Wit.ai、Microsoft Azure Speech、Microsoft Bing Voice Recognition（已弃用）、Houndify API、IBM Speech to Text 和 Snowboy Hotword Detection。
在线和离线模式: 支持在线和离线语音识别。
跨平台: 适用于 Windows、Linux 和 macOS。

2. 项目快速启动

安装

首先，确保你已经安装了 Python 2.6+ 或 3.3+。然后，使用 pip 安装 SpeechRecognition 模块：

pip install SpeechRecognition

快速示例

以下是一个简单的示例，展示如何使用 SpeechRecognition 模块从麦克风捕获音频并进行语音识别：

import speech_recognition as sr

# 创建一个 Recognizer 实例
r = sr.Recognizer()

# 使用麦克风作为音频源
with sr.Microphone() as source:
    print("请说话...")
    # 调整麦克风的环境噪音阈值
    r.adjust_for_ambient_noise(source)
    # 捕获音频
    audio = r.listen(source)

try:
    # 使用 Google Web Speech API 进行语音识别
    print("你说的是: " + r.recognize_google(audio, language="zh-CN"))
except sr.UnknownValueError:
    print("无法识别语音")
except sr.RequestError as e:
    print(f"请求失败; {e}")