- 博客(180)
- 资源 (18)
- 收藏
- 关注
原创 20190510 语音识别资源整理
语音处理课程推荐|Speech Processing(2019) 台师大Speech Processing。国立台湾师范大学的陈柏琳教授。http://berlin.csie.ntnu.edu.tw/Courses/Speech Processing/Speech Processing_Main_2019S.htm陈教授教学多年,主页上还有好多其他课程。http://berlin.csi...
2020-04-13 18:39:37 5482 4
翻译 Attention模型的调查报告(有翻译)An Attentive Survey of Attention Models
这是arxiv上19年4月的论文,花了一天时间翻译了一下,但是感觉对于我这样的小白来说,然并卵。下载地址:
2019-04-17 17:18:48 564
原创 macbook安装win10没有声音
macbook安装win10没有声音我的macbook是2015年款找到右下角声音图标,右键,选择声音,选择播放选项卡,我的播放选项卡是这个样子:扬声器、数字音频之类都可以点击右键—测试,测试哪个有声音就给它设置成默认设备就好了。我的一开始默认的是数字音频,但是测试没有声音,扬声器测试的时候就有声音,设成默认以后就好了。...
2021-03-11 21:01:30 8747
原创 oppoA57 连上电脑之后没反应
今天要给妈妈的手机做照片备份,结果oppoA57 连上电脑之后没反应,看不见手机的内存卡,找了一大圈办法,弄各种驱动都没有解决。后来才发现oppo手机自带了一个app叫文件管理,里面有个远程管理,打开服务之后,只要电脑和手机在一个网络下,电脑就可以看见手机里的资料了。买手机之后第一次用的时候删掉了一大堆oppo自带的软件,幸亏这个app没有删。...
2020-10-14 14:06:30 1284
原创 Library not loaded: @loader_path/libmex.dylib
这两天要跑一个asvspoof2017的baseline,matlab的代码,可是出现一个动态库无法加载的问题,搞了好久还请了高人帮忙,终于解决了我自己的问题忘了截图了,说mexmaci64这个文件无效,跟下面差不多:问题如下:Library not loaded: @loader_path/libmex.dylibReferenced from:/Users/usr/Documents/MATLAB/SFMedu2/denseMatch/priority_queue_1.0/pq_create.
2020-06-25 11:52:30 1555 1
原创 20200621--learning-to-fool-the-speaker-recognition-master 实验记录
出错1:RuntimeError: Detected that PyTorch and torchvision were compiled with different CUDA versions. PyTorch has CUDA Version=10.2 and torchvision has CUDA Version=10.1. Please reinstall the torchvision that matches your PyTorch install.解决办法:pip install t
2020-06-22 23:56:50 574
转载 pytorch 最简单示例
# 来自B站刘二大人import torchx_data = torch.Tensor([[1.0], [2.0], [3.0]])y_data = torch.Tensor([[2.0], [4.0], [6.0]])class LinearModel(torch.nn.Module): def __init__(self): super(LinearModel, self).__init__() self.linear = torch.nn.Line
2020-06-21 16:00:17 7047
翻译 icassp2020---XMU-TS SYSTEMS FOR NIST SRE19 CTS CHALLENGE
XMU-TS SYSTEMS FOR NIST SRE19 CTS CHALLENGEHao Lu1, Jianfeng Zhou2, Miao Zhao1, Wendian Lei3, Qingyang Hong∗1, Lin Li∗21School of Informatics, Xiamen University, China,厦门大学信息学院2School of Electronic...
2020-04-22 16:21:12 1729
翻译 icassp2020--TEXT-INDEPENDENT SPEAKER VERIFICATION WITH ADVERSARIAL LEARNING ON SHORT UTTERANCES
TEXT-INDEPENDENT SPEAKER VERIFICATION WITH ADVERSARIAL LEARNING ON SHORT UTTERANCESKai Liu, Huan ZhouArtificial Intelligence Application Research Center, Huawei Technologies Shenzhen, PRCABSTRACT摘...
2020-04-22 16:18:37 1397
翻译 icassp2020会议时间安排
https://cmsworkshops.com/ICASSP2020/TechnicalProgram.asp主要是看看来了解下icassp的topic都有啥,原链接每个topic点进去还有paper列表。
2020-04-13 15:18:42 2532 4
翻译 ICASSP2020一些主题演讲
https://cmsworkshops.com/ICASSP2020/TechnicalProgram.asp@[TOC] 目录T-1: Machine Learning and Wireless Communicationsmobile communications and machine learning are two of the most exciting and rapidly...
2020-04-13 15:01:45 10519
原创 mac 安装pyaudio报错
pip install pyaudio报错说缺少:portaudio.h解决办法:pip install --global-option=‘build_ext’ --global-option=’-I/usr/local/include’ --global-option=’-L/usr/local/lib’ pyaudio参考:https://www.jianshu.com/p/7f81e...
2020-04-01 14:49:42 697 1
原创 linux如何只复制目录结构而不复制数据
find . -type d -exec mkdir -p /data/datasets/musan1/{} ;在当前目录下找类型为d的文件(即目录类型),然后执行后面的操作。当前目录是你要copy的文件夹,-p后面接的目的文件夹...
2020-03-27 15:07:02 4343
翻译 2016--AN EXTENSIBLE SPEAKER IDENTIFICATION SIDEKIT IN PYTHON
AN EXTENSIBLE SPEAKER IDENTIFICATION SIDEKIT IN PYTHONAnthony Larcher1, Kong Aik Lee2, Sylvain Meignier11LIUM - Universite ́ du Maine, France 法国 勒芒大学2Human Language Technology Department, Insti...
2020-03-13 19:00:34 996
翻译 2016--MatConvNet Convolutional Neural Networks for MATLAB
Abstract摘要MatConvNet is an implementation of Convolutional Neural Networks (CNNs) for MATLAB. The toolbox is designed with an emphasis on simplicity and flexibility. It exposes the building blocks o...
2020-03-13 18:59:21 933
翻译 AN OPEN-SOURCE SPEAKER GENDER DETECTION FRAMEWORK FOR MONITORING GENDER EQUALITY
AN OPEN-SOURCE SPEAKER GENDER DETECTION FRAMEWORK FOR MONITORING GENDER EQUALITY监测两性平等的开源说话人性别检测框架David Doukhan, Jean CarriveFrench National Institute of Audiovisual Paris, FranceFe ́licien Vallet...
2020-03-13 18:58:18 715
翻译 S4D: Speaker Diarization Toolkit in Python
S4D: Speaker Diarization Toolkit in Python1French National Audiovisual Institute (INA), Paris, France2Computer Science Laboratory of Le Mans University (LIUM - EA 4023), Le Mans, FranceSIDEKIT for ...
2020-03-13 18:57:28 884
翻译 2017--Speaker and Language Recognition and Characterization: Introduction to the CSL Special Issue
2017–Speaker and Language Recognition and Characterization: Introduction to the CSL Special IssueEduardo Lleida1, Luis Javier Rodriguez-Fuentes21 Aragon Institute for Engineering Research (I3A), Uni...
2020-03-13 18:56:37 2891
翻译 2019---Introduction to the special issue “Speaker and language characterization and recog
Introduction to the special issue “Speaker and language characterization and recognition: Voice modeling, conversion, synthesis and ethical aspects”“说话人和语言的特征和识别:声音建模、转换、合成和伦理方面”专题介绍Welcome to this ...
2020-03-13 18:56:00 579
翻译 An initial investigation on optimizing tandem speaker verification and countermeasure systems using
An initial investigation on optimizing tandem speaker verification and countermeasure systems using reinforcement learning标题:利用强化学习优化串联说话人验证与对抗系统的初步研究作者: Anssi Kanervisto, Junichi Yamagishi链接:https...
2020-02-18 22:30:32 559
翻译 Emotion Recognition Using Speaker Cues
Emotion Recognition Using Speaker Cues标题:基于说话人线索的情感识别作者: Ismail Shahin链接:https://arxiv.org/abs/2002.03566This research aims at identifying the unknown emotion using speaker cues. In this study, we...
2020-02-18 22:29:54 182
翻译 Unsupervised training of neural mask-based beamforming
Unsupervised training of neural mask-based beamformingLukas Drude?, Jahn Heymann?, Reinhold Haeb-UmbachPaderborn University, Department of Communications Engineering, Paderborn, Germany{drude, he...
2020-02-12 23:22:01 254
翻译 2016--Analysis of the DNN-based SRE systems in multi-language conditions
This paper analyzes the behavior of our state-of-the-art Deep Neural Network/i-vector/PLDA-based speaker recognition systems in multi-language conditions. On the “Language Pack” of the PRISM set, we e...
2020-02-12 23:20:51 426
翻译 The LeVoice Far-field Speech Recognition System for VOiCES from a Distance Challenge 2019
The LeVoice Far-field Speech Recognition System for VOiCES from a Distance Challenge 2019Yulong Liang, Lin Yang, Xuyang Wang, Yingjie Li, Chen Jia, Junjie WangLenovo [email protected]重点在...
2020-02-12 22:54:55 457
翻译 Far-Field End-to-End Text-Dependent Speaker Verification based on Mixed Training Data with Transfer
Far-Field End-to-End Text-Dependent Speaker Verification based on Mixed Training Data with Transfer Learning and Enrollment Data AugmentationXiaoyi Qin1,2, Danwei Cai1, Ming Li11Data Science Resea...
2020-02-12 22:51:33 607
翻译 2019--Target Speaker Extraction for Multi-Talker Speaker Verification
Target Speaker Extraction for Multi-Talker Speaker VerificationWei Rao1, Chenglin Xu2,3, Eng Siong Chng2,3, Haizhou Li11Department of Electrical and Computer Engineering, National University of S...
2020-02-12 22:48:22 1746 2
原创 canon ip 1180 喷墨打印机 mac 驱动
下载地址就在这里:http://www.downcc.com/soft/30779.html好激动啊,家里这个老式打印机终于能用了,太开心了。这个打印机用win的话是自动装驱动的,直接就能用。可怜家里唯一的win已经慢的不行了,mac能用就太好啦,开心开心~~~~...
2020-02-07 12:57:14 988
翻译 Within-sample variability-invariant loss for robust speaker recognition under noisy environments
Within-sample variability-invariant loss for robust speaker recognition under noisy environments标题:样本内变异性-噪声环境下稳健说话人识别的不变损失作者: Danwei Cai, Ming Li备注:Accepted at ICASSP 2020链接:https://arxiv.org/abs...
2020-02-04 18:15:51 806
翻译 Multi-channel Acoustic Modeling using Mixed Bitrate OPUS Compression
Multi-channel Acoustic Modeling using Mixed Bitrate OPUS Compression标题:使用混合比特率opus压缩的多声道声学建模作者: Aparna Khare, Minhua Wu链接:https://arxiv.org/abs/2002.00122Recent literature has shown that a learned...
2020-02-04 16:35:11 184
翻译 DropClass and DropAdapt: Dropping classes for deep speaker representation learning
DropClass and DropAdapt: Dropping classes for deep speaker representation learning标题:DropClass和DropAdapt:用于深层说话人表示学习的丢弃类作者: Chau Luu, Steve Renals备注:Submitted to Speaker Odyssey 2020链接:https://arx...
2020-02-04 14:12:18 641
翻译 Single Channel Speech Enhancement Using Temporal Convolutional Recurrent Neural Networks
Single Channel Speech Enhancement Using Temporal Convolutional Recurrent Neural Networks标题:基于时域卷积递归神经网络的单通道语音增强作者: Jingdong Li, Changliang Li链接:https://arxiv.org/abs/2002.00319Jingdong Li∗ Hui Zha...
2020-02-04 13:27:43 733
原创 如何替换mac word中的换行符为空格
要是win版的word,直接替换就很方便,找到所有^p,然后替换为空格即可。但是mac版的word,直接找^p,根本找不到。试了好多次,终于发现解决办法了。mac版的word默认是显示段落标记的,需要先在word的偏好设置中,把显示段落标记的地方勾掉,在视图里,显示非打印字符,下面有个全部,把全部前面的默认的√,勾掉即可。回来再找^p,就找到了,替换为空格即可。...
2020-02-04 13:19:45 5410
翻译 Self-Attentive Speaker Embeddings for Text-Independent Speaker Verification
Self-Attentive Speaker Embeddings for Text-Independent Speaker VerificationYingke Zhu1, Tom Ko2, David Snyder3, Brian Mak1, Daniel Povey31Department of Computer Science & EngineeringThe Hong Ko...
2020-02-04 01:13:33 1949 1
翻译 SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
SpecAugment: A Simple Data Augmentation Method for Automatic Speech RecognitionDaniel S. Park∗, William Chan, Yu Zhang, Chung-Cheng Chiu, Barret Zoph, Ekin D. Cubuk, Quoc V. LeGoogle Brain{danielsp...
2020-02-01 23:55:48 4021
原创 探索说话人识别数据集时要注意的问题
Note:In the speaker id community the words “train”, “test” and “development”are used in a different sense from in the speech recognition community. Inspeaker-id land, the “development” data is the...
2019-12-20 17:09:11 368
翻译 2018--Analysis of Length Normalization in End-to-End Speaker Verification System
Weicheng Cai2, Jinkun Chen2, Ming Li11Data Science Research Center, Duke Kunshan University, Kunshan, China2School of Electronics and Information Technology, Sun Yat-sen University, Guangzhou, China...
2019-12-20 16:33:21 376
翻译 2019-utterance-level end-to-end language identification using attention-based cnn-blstm--icassp 2019
Weicheng Cai1,2,Danwei Cai1, Shen Huang3and Ming Li1∗1Data Science Research Center, Duke Kunshan University, Kunshan, China2School of Electronics and Information Technology, Sun Yat-sen University, ...
2019-12-20 16:09:41 330
翻译 2019-SPEAKER RECOGNITION FOR MULTI-SPEAKER CONVERSATIONS USING X-VECTORS
SPEAKER RECOGNITION FOR MULTI-SPEAKER CONVERSATIONS USING X-VECTORSDavid Snyder , Daniel Garcia-Romero, Gregory Sell, Alan McCree, Daniel Povey, Sanjeev KhudanpurCenter for Language and Speech Proce...
2019-12-20 12:35:37 496
语音识别大神dan-povery介绍kaldi的ppt.rar
2019-10-30
An Attentive Survey of Attention Models注意力模型调查报告(有翻译)
2019-04-17
王赟大神的全部论文
2019-04-17
A Neural Attention Model for Abstractive Sentence Summarization
2019-04-17
说话人识别数据集--Spoken Speaker Identification based on Gaussian Mixture Models-2
2019-03-22
说话人识别数据集--Spoken Speaker Identification based on Gaussian Mixture Models-1
2019-03-22
使用GMMs进行语音性别检测
2019-03-22
语音识别数据集-speech analytic--性别识别--Voice Gender Detection using GMMs-2
2019-03-22
语音识别数据集-speech analytic--性别识别--Voice Gender Detection using GMMs-1
2019-03-22
黄勇-知了课堂-flask-40-50课源代码及数据库文件
2018-12-11
knn算法--整理byGraceyan
2018-10-21
数字0到9和英文大小写字母手写识别训练集
2018-10-21
空空如也
TA创建的收藏夹 TA关注的收藏夹
TA关注的人