配置环境变量
1.abd环境
2.ffmpeg环境配置
3.下载配置torchaudio
示例代码
import torch
import whisper
if __name__ == "__main__":
device = "cuda" if torch.cuda.is_available() else "cpu"
model = whisper.load_model('medium').to(device)
result = model.transcribe('audio.mp3', language='Chinese', temperature=0.8, fp16=False)
print(result)
medium为模型的大小,越大识别越精准
audio.mp3为测试音频