IOS开发之语音合成（科大讯飞）详解

最新推荐文章于 2022-01-22 08:53:37 发布

iteye_11495

最新推荐文章于 2022-01-22 08:53:37 发布

阅读量366

点赞数

文章标签：移动开发人工智能 xcode

转自：http://www.daxueit.com/article/4422.html

如何实现语音合成呢？

1、注册讯飞账号，申请APPID（注意选择IOS平台）

2、加载所需要的类库

3、导入所需要的类库文件头

4、调用申请的APPID以及所需函数，完成语音合成（需要参考官方给出的SDK文件）

详细步骤：

一、首先到科大讯飞官网注册账号(http://open.voicecloud.cn/)，并创建应用获取appid，下载sdk文件

二、代码实现api调用

1.先用xcode(我这里使用的是xcode 5.1)新建好一个项目，然后在项目添加要用的类库。其中有一个是讯飞语音的类库iflyMSC，在下载的sdk文件里有，导入就行了。导入的时候要注意把iflyMSC类库拷贝到你的工程目录里，不然后果很严重！

2.导完类库之后，在建好的工程里添加好要用的头文件。

MainViewController.h

 
        #import <UIKit/UIKit.h> 
       
        #import "iflyMSC/IFlySpeechSynthesizerDelegate.h"

MainViewController.m

 
        #import "MainViewController.h" 
       
        #import <QuartzCore/QuartzCore.h> 
       
        #import <AVFoundation/AVAudioSession.h> 
       
        #import <AudioToolbox/AudioSession.h> 
       
        #import "iflyMSC/IFlySpeechConstant.h" 
       
        #import "iflyMSC/IFlySpeechUtility.h" 
       
        #import "iflyMSC/IFlySpeechSynthesizer.h"

3.完成这些准备工作之后，接下来就是堆代码的工作了。为了方便，笔者只用了两个控件：一个UITextField、一个UIButton，然后给这两个控件分别做一个Outlet和Action连接。

MainViewController.h

 
        #import <UIKit/UIKit.h> 
       
        #import "iflyMSC/IFlySpeechSynthesizerDelegate.h" 
       
        //引入语音合成类 
       
        @ 
        class 
         IFlySpeechSynthesizer; 
       
        @ 
        class 
         IFlyDataUploader; 
       
        //注意要添加语音合成代理 
       
        @interface MainViewController : UIViewController<IFlySpeechSynthesizerDelegate> 
       
        //声明语音合成的对象 
       
        @property (nonatomic, strong) IFlySpeechSynthesizer *iFlySpeechSynthesizer; 
       
        @property (strong, nonatomic) IBOutlet UITextField *content; 
       
        - (IBAction)Start:(id)sender; 
       
        @end

MainViewController.m

 
        #import "MainViewController.h" 
       
        #import <QuartzCore/QuartzCore.h> 
       
        #import <AVFoundation/AVAudioSession.h> 
       
        #import <AudioToolbox/AudioSession.h> 
       
        #import "iflyMSC/IFlySpeechConstant.h" 
       
        #import "iflyMSC/IFlySpeechUtility.h" 
       
        #import "iflyMSC/IFlySpeechSynthesizer.h" 
       
        @interface MainViewController () 
       
        @end 
       
        @implementation MainViewController 
       
        - ( 
        void 
        )viewDidLoad 
       
        { 
       
        [super viewDidLoad]; 
       
        //通过appid连接讯飞语音服务器，把@"53b5560a"换成你申请的appid 
       
        NSString *initString = [[NSString alloc] initWithFormat:@ 
        "appid=%@,timeout=%@" 
        ,@ 
        "53b5560a" 
        ,@ 
        "20000" 
        ]; 
       
        //所有服务启动前，需要确保执行createUtility 
       
        [IFlySpeechUtility createUtility:initString]; 
       
        //创建合成对象，为单例模式 
       
        _iFlySpeechSynthesizer = [IFlySpeechSynthesizer sharedInstance]; 
       
        _iFlySpeechSynthesizer.delegate = self; 
       
        //设置语音合成的参数 
       
        //合成的语速,取值范围 0~100 
       
        [_iFlySpeechSynthesizer setParameter:@ 
        "50" 
         forKey:[IFlySpeechConstant SPEED]]; 
       
        //合成的音量;取值范围 0~100 
       
        [_iFlySpeechSynthesizer setParameter:@ 
        "50" 
         forKey:[IFlySpeechConstant VOLUME]]; 
       
        //发音人,默认为”xiaoyan”;可以设置的参数列表可参考个性化发音人列表 
       
        [_iFlySpeechSynthesizer setParameter:@ 
        "xiaoyan" 
         forKey:[IFlySpeechConstant VOICE_NAME]]; 
       
        //音频采样率,目前支持的采样率有 16000 和 8000 
       
        [_iFlySpeechSynthesizer setParameter:@ 
        "8000" 
         forKey:[IFlySpeechConstant SAMPLE_RATE]]; 
       
        asr_audio_path保存录音文件路径，如不再需要，设置value为nil表示取消，默认目录是documents 
       
        [_iFlySpeechSynthesizer setParameter: 
        "tts.pcm" 
         forKey:[IFlySpeechConstant TTS_AUDIO_PATH]]; 
       
        //隐藏键盘，点击空白处 
       
        UITapGestureRecognizer *tapGr = [[UITapGestureRecognizer alloc] initWithTarget:self action:@selector(viewTapped:)]; 
       
        tapGr.cancelsTouchesInView = NO; 
       
        [self.view addGestureRecognizer:tapGr];  
       
        } 
       
        -( 
        void 
        )viewTapped:(UITapGestureRecognizer*)tapGr 
       
        { 
       
        [self.content resignFirstResponder]; 
       
        } 
       
        - ( 
        void 
        )didReceiveMemoryWarning 
       
        { 
       
        [super didReceiveMemoryWarning]; 
       
        // Dispose of any resources that can be recreated. 
       
        } 
       
        - (IBAction)Start:(id)sender  
       
        { 
       
        //启动合成会话 
       
        [_iFlySpeechSynthesizer startSpeaking:self.content.text]; 
       
        } 
       
        #pragma mark - IFlySpeechSynthesizerDelegate 
       
        //开始播放 
       
        - ( 
        void 
        ) onSpeakBegin 
       
        { 
       
        } 
       
        //缓冲进度 
       
        - ( 
        void 
        ) onBufferProgress:( 
        int 
        ) progress message:(NSString *)msg 
       
        { 
       
        NSLog(@ 
        "bufferProgress:%d,message:%@" 
        ,progress,msg); 
       
        } 
       
        //播放进度 
       
        - ( 
        void 
        ) onSpeakProgress:( 
        int 
        ) progress 
       
        { 
       
        NSLog(@ 
        "play progress:%d" 
        ,progress); 
       
        } 
       
        //暂停播放 
       
        - ( 
        void 
        ) onSpeakPaused 
       
        { 
       
        } 
       
        //恢复播放 
       
        - ( 
        void 
        ) onSpeakResumed 
       
        { 
       
        } 
       
        //结束回调 
       
        - ( 
        void 
        ) onCompleted:(IFlySpeechError *) error 
       
        { 
       
        } 
       
        @end