An Example of Azure Speaker Recognition API

最新推荐文章于 2023-01-10 22:00:00 发布

PatrickPan2018

最新推荐文章于 2023-01-10 22:00:00 发布

阅读量882

点赞数

本文链接：https://blog.csdn.net/ItachiUchiha/article/details/81046284

版权

Java source codes in GitHub: testAzureSpeakerRecognitionAPI

If you are looking for a demo, please click here.

Currently, the documents about Speaker Recognition API are not good enough. Besides, the sample codes of Java cannot be used directly.

Basically, there are three steps to use Speaker Recognition API.

Create profile
Create enrollment
Identification

To run the source codes, steps include:

Create Cognitive Services in Azure and copy Subscription key to "Authentication.java".
Run "CreateProfile.java" and copy profile id from response to "Authentication.java".
Run "CreateEnrollment.java" with specific audio file and save operation id in response header.

Run "GetOperationStatus.java" with specific operation id.

Create more than one profile and upload more audio files for each profile, run "Identification.java" with a new audio file. Similarly, save operation id in response header.
Again, run "GetOperationStatus.java" with specific operation id.

As for audio file, the format must meet the following requirements.

It took me nearly 2 hours to find a way to create a valid audio file.

Use default sound recorder to crate a audio file(WMA).

Use online service cloudconvert to convert wma to WAV.

After that, use Wav Sample Rate Converter to change sample format.

Finally, the duration of an audio file shouldn't be too long (between 30s and 40s) otherwise the size may be too large and exceptions will probably occur when creating erollment.

Cheers!

PatrickPan2018

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
An Example of Azure Speaker Recognition API

Java source codes in GitHub:testAzureSpeakerRecognitionAPIIf you are looking for a demo, please click here.Currently, the documents about Speaker Recognition API are not good enough.Besides, the...
复制链接

扫一扫