本文翻译转载自:https://github.com/chidiwilliams/buzz
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI’s Whisper.
- github : https://github.com/chidiwilliams/buzz
- 关于 whisper,可参考我的记录: https://blog.csdn.net/lovechris00/article/details/127402597
Features
- Real-time transcription and translation from your computer’s microphones to text (Demo)
- Import audio and video files and export transcripts to TXT, SRT, and VTT (Demo)
- Supports Whisper, Whisper.cpp, Whisper-compatible Hugging Face models, and the OpenAI Whisper API
- Available on Mac, Windows, and Linux
安装
给你的系统下载最新版本的 Buzz : https://github.com/chidiwilliams/buzz/releases/latest
Buzz 可以安装在 Mac, Windows, and Linux.
Mac (macOS 11.7 and later)
方式一:使用 brew
brew install --cask buzz
方式二:下载 Buzz-x.y.z.dmg
文件。
Windows
适用于 Windows 10 and later
下载运行 Buzz-x.y.z.exe
文件.
Linux
适用于 Ubuntu 20.04 and later
- 安装依赖:
sudo apt-get install libportaudio2
- 下载和提取
Buzz-x.y.z-unix.tar.gz
文件
如何使用
1、实时录制
To start a live recording:
- Select a recording task, language, quality, and microphone.
- Click Record.
Note: Transcribing audio using the default Whisper model is resource-intensive. If your computer is unable to keep up with real-time transcription, consider turning on GGML inference.
Field | Options | Default | Description |
---|---|---|---|
Task | “Transcribe”, “Translate” | “Transcribe” | “Transcribe” converts the input audio into text in the selected language, while “Translate” converts it into text in English. |
Language | See Whisper’s documentationfor the full list of supported languages | “Detect Language” | “Detect Language” will try to detect the spoken language in the audio based on the first few seconds. However, selecting a language is recommended (if known) as it will improve transcription quality in many cases. |
Quality | “Very Low”, “Low”, “Medium”, “High” | “Very Low” | The transcription quality determines the Whisper model used for transcription. “Very Low” uses the “tiny” model; “Low” uses the “base” model; “Medium” uses the “small” model; and “High” uses the “medium” model. The larger models produce higher-quality transcriptions, but require more system resources. See Whisper’s documentation for more information about the models. |
Microphone | [Available system microphones] | [Default system microphone] | Microphone for recording input audio. |
2、录制电脑音频
要录制计算机上应用程序播放的音频,您可以安装音频环回驱动程序(一种可以创建虚拟音频设备的程序)。
本指南的其余部分将在Mac上使用 BlackHole,但您可以为您的操作系统使用其他替代方案(请参阅LoopBeAudio、LoopBack和虚拟音频线)。
- BlackHole : https://github.com/ExistentialAudio/BlackHole
- LoopBeAudio: https://nerds.de/en/loopbeaudio.html
- Virtual Audio Cable : https://vac.muzychenko.net/en/
1、使用brew 安装BlackHole
BlackHole via Homebrew :
https://github.com/ExistentialAudio/BlackHole#option-2-install-via-homebrew
brew install blackhole-2ch
2、打开 Audio MIDI Setup
位于 /Applications/Utilities/Audio Midi Setup.app
3、点击左下角 +
图标,选择 Create Multi-Output Device
.
4、添加默认的 speaker 和 BlackHole 到多输出设备(multi-output device.)
5、选择此多输出设备作为扬声器(应用程序或系统范围),以便在BlackHole中播放音频。
6、 打开Buzz,选择BlackHole作为麦克风,然后像以前一样录制,以查看通过BlackHole播放的音频的 transcriptions。
导入文件
- Click Import Media File on the File menu (or the ‘+’ icon on the toolbar, or Command/Ctrl + O).
- 单击“文件”菜单上的“导入媒体文件”(或工具栏上的“+”图标,或快捷键:
Command/Ctrl + O
)。 - 选择文件
- 选择任务、语言和模型设置
- 点击运行
- 当转录状态显示“已完成”时,双击行(或选择行并单击
⤢
图标)打开转录。
Field | Options | Default | Description |
---|---|---|---|
Export As | “TXT”, “SRT”, “VTT” | “TXT” | Export file format |
Word-Level Timings | Off / On | Off | If checked, the transcription will generate a separate subtitle line for each word in the audio. Enabled only when “Export As” is set to “SRT” or “VTT”. |
查看 Live Recording section 来获取更多关于 task, language 和 quality 设置的信息。
Live Recording section : https://github.com/chidiwilliams/buzz#live-recording
从源码编译
1、安装依赖:Poetry
https://python-poetry.org/docs/#installing-with-the-official-installer
2、下载 repository
git clone --recurse-submodules https://github.com/chidiwilliams/buzz
3、安装项目依赖
poetry install
4、(可选) 使用 Whisper.cpp 推理,运行:
make buzz/whisper_cpp.py
5、(可选) 要编译翻译,运行:
make translation_mo
6、最后,运行应用程序
poetry run python main.py
或者使用下述代码编译
poetry run pyinstaller --noconfirm Buzz.spec
FAQ
1、模型保存地址: ~/.cache/whisper
Whisper.cpp
模型存储地址:
- macOS :
~/Library/Caches/Buzz
- Unix :
~/.cache/Buzz
- Windows :
C:\Users\<username>\AppData\Local\Buzz\Buzz\Cache
2、如果转录运行太慢,可以使用小一点的额 Whisper模型,或者使用 Whisper.cpp 模型。
2023-03-21