Deepgram Python SDK 使用教程-CSDN博客

本文链接：https://blog.csdn.net/gitblog_00074/article/details/141879831

Deepgram Python SDK 使用教程

deepgram-python-sdkOfficial Python SDK for Deepgram's automated speech recognition APIs.项目地址:https://gitcode.com/gh_mirrors/de/deepgram-python-sdk

1、项目介绍

Deepgram Python SDK 是一个官方提供的 Python 库，用于访问 Deepgram 的自动语音识别（ASR）API。Deepgram 提供世界级的语音和语言 AI 模型，帮助开发者在其应用程序中集成高质量的语音识别功能。

2、项目快速启动

安装

首先，确保你已经安装了 Python 3.10 或更高版本。然后，通过以下命令安装 Deepgram Python SDK：

pip install deepgram-sdk

使用示例

以下是一个简单的示例，展示如何使用 Deepgram Python SDK 来转录音频文件：

from deepgram import Deepgram
import asyncio

DEEPGRAM_API_KEY = 'YOUR_API_KEY'
PATH_TO_FILE = 'some/file.wav'

async def main():
    # 初始化 Deepgram SDK
    deepgram = Deepgram(DEEPGRAM_API_KEY)
    
    # 打开文件并进行转录
    with open(PATH_TO_FILE, 'rb') as audio:
        source = {'buffer': audio, 'mimetype': 'audio/wav'}
        response = await deepgram.transcription.pre_recorded(source, {'punctuate': True})
        print(response['results']['channels'][0]['alternatives'][0]['transcript'])

asyncio.run(main())