RuntimeSpeechRecognizer 使用教程

尤歌泽Vigour

于 2024-09-28 07:27:49 发布

阅读量593

点赞数 25

本文链接：https://blog.csdn.net/gitblog_00959/article/details/142607546

版权

RuntimeSpeechRecognizer 使用教程

RuntimeSpeechRecognizer Cross-platform, real-time, offline speech recognition plugin for Unreal Engine. Based on Whisper OpenAI technology, whisper.cpp. 项目地址: https://gitcode.com/gh_mirrors/ru/RuntimeSpeechRecognizer

1. 项目介绍

RuntimeSpeechRecognizer 是一个基于 OpenAI 的 Whisper 技术的跨平台实时离线语音识别插件，专为 Unreal Engine 设计。该项目支持多种语言模型，从 75 MB 到 2.9 GB 不等，并且能够在 Windows、Mac、Linux、Android、iOS 等多个平台上运行。它不需要任何静态库或外部依赖，易于集成到 Unreal Engine 项目中。

2. 项目快速启动

2.1 克隆项目

首先，克隆 RuntimeSpeechRecognizer 项目到本地：

git clone https://github.com/gtreshchev/RuntimeSpeechRecognizer.git

2.2 导入插件

将克隆的项目文件夹复制到你的 Unreal Engine 项目的 Plugins 目录下。如果没有 Plugins 目录，可以手动创建一个。

2.3 启用插件

打开你的 Unreal Engine 项目。
在编辑器中，导航到 编辑 -> 插件。
在插件列表中找到 RuntimeSpeechRecognizer，并勾选启用。
重启 Unreal Engine 编辑器以应用更改。

2.4 配置插件

在 Unreal Engine 编辑器中，导航到 项目设置 -> RuntimeSpeechRecognizer，配置所需的模型大小和语言。

2.5 使用插件

在你的蓝图中，你可以使用 RuntimeSpeechRecognizer 提供的节点来启动和停止语音识别。以下是一个简单的蓝图示例：

Begin Play
    -> Event Tick
        -> RuntimeSpeechRecognizer.StartRecognition
        -> RuntimeSpeechRecognizer.GetRecognizedText
            -> Print String