python语音识别
Welcome to The Complete Beginner’s Guide to Speech Recognition in Python.
欢迎使用Python语音识别完整入门指南。
In this post, I will walk you through some great hands-on exercises that will help you to have some understanding of speech recognition and the use of machine learning. Speech recognition helps us to save time by speaking instead of typing. It also gives us the power to communicate with our devices without even writing one line of code. This makes technological devices more accessible and easier to use. Speech recognition is a great example of using machine learning in real life.
在本文中,我将引导您完成一些动手实践,以帮助您对语音识别和机器学习的使用有所了解。 语音识别可以帮助我们通过说话而不是打字来节省时间。 它还使我们无需编写一行代码即可与我们的设备进行通信。 这使技术设备更易于访问且更易于使用。 语音识别是在现实生活中使用机器学习的一个很好的例子。
Another nice example of speech recognition: Google Meet web application, did you know that from the settings you can turn on the subtitles? When you turn on subtitles, a program in the back will recognize your speech and convert it to text in real life. It’s really impressive to see how fast it happens. Another cool feature of this Google Meet recognizer is that it also knows who is speaking. In this walkthrough, we will use Google’s Speech API. I can’t wait to show you how to build your own speech recognizer. Let’s get started!
语音识别的另一个很好的例子:Google Meet Web应用程序,您是否知道可以通过设置打开字幕? 当您打开字幕时,后面的程序将识别您的语音并将其转换为现实生活中的文本。 看到它发生的速度真的很令人印象深刻。 此Google Meet识别器的另一个很酷的功能是它也知道谁在讲话。 在本演练中,我们将使用Google的Speech API。 我迫不及待地向您展示如何构建自己的语音识别器。 让我们开始吧!
目录 (Table of contents)
Speech Recognition Libraries
语音识别库
Recognizer Class
识别器类别
Speech Recognition Functions
语音识别功能
Audio Preprocessing
音频预处理
Bonus (Different Scenarios)
奖金(不同方案)
语音识别库 (Speech Recognition Libraries)
- CMU Sphinx CMU狮身人面像
- Kaldi 卡尔迪
- SpeechRecognition 语音识别
- wav2letter++ wav2letter ++
“CMU Sphinx collects over 20 years of the CMU research. Some advantage of this library: CMUSphinx tools are designed specifically for low-resource platforms, flexible design, and focus on practical application development and not on research.” (
“ CMU Sphinx收集了20多年的CMU研究成果。 该库的一些优势:CMUSphinx工具专为低资源平台而设计,设计灵活,并且专注于实际应用程序开发而不是研究。” (
“Kaldi is a toolkit for speech recognition, intended for use by speech recognition researchers and professionals.” (
“ Kaldi是语音识别工具包,旨在供语音识别研究人员和专业人员使用。” (
“Speech Recognition is a library for performing speech recognition, with support for several engines