Matlab实现Compow协议,基于i-vector的说话人识别

ivectormatlabmsrit-master

README.md

ivector

.gitignore

B-20091230-umt

Bahoke-20130721-qpx

Bareford-20101110-qkl

BruceCouper-20080305-owt

CNJ-20120418-fce

Caatbells-20111019-dqd

CheesyBreed-20080413-rna

ColinBeckingham-20091104-uhr

ColinBeckingham-20100125-nfg

Coren-20141121-bhk

Cristbal-20140516-yeu

DanR-20090223-fys

David-20140127-ifv

DavidL-20091116-kth

DavidSowerby-20130308-ycz

Decent-20110516-egq

Derek-20090226-atf

DermotColeman-20111125-uom

Fandark-20100822-acy

Firefox005-20100820-qku

Flowerbot-20111128-eio

GaylandGGump-20141207-jhc

Graham-20100520-guc

GusSCalabrese-20100331-jxn

H2CO3-20120122-dli

HansTheil-20111211-lth

MFCC

audspec.m

bark2hz.m

cep2spec.m

deltas.m

dolpc.m

fft2barkmx.m

fft2melmx.m

hz2bark.m

hz2mel.m

invaudspec.m

invmelfcc.m

invpostaud.m

invpowspec.m

ispecgram.m

lifter.m

lpc2cep.m

lpc2spec.m

mel2hz.m

melfcc.m

postaud.m

powspec.m

process_options.m

rastafilt.m

rastaplp.m

readhtk.m

spec2cep.m

MSRIT

SAD.m

audspec.m

bark2hz.m

cep2spec.m

cmvn.m

compute_bw_stats.m

compute_eer.m

cosscore.m

deltas.m

demo_gmm_ubm.m

demo_gmm_ubm_artificial.m

demo_ivector_plda.m

demo_ivector_plda_artificial.m

demo_mfcc.m

dolpc.m

energy.m

extract_ivector.m

fea_warping.m

featureExtract.m

fft2barkmx.m

fft2melmx.m

gmm_em.m

gplda_em.m

hamming.m

htkread.m

htkwrite.m

hz2bark.m

hz2mel.m

invaudspec.m

invmelfcc.m

invpostaud.m

invpowspec.m

ispecgram.m

lda.m

length_norm.m

lifter.m

lpc2cep.m

lpc2spec.m

main.m

main2.m

mapAdapt.m

mel2hz.m

melfcc.m

melfcc_demoimp.m

mfcc.m

postaud.m

powspec.m

process_options.m

rastafilt.m

rastaplp.m

readhtk.m

rm_dc_n_dither.m

sample.wav

score_gmm_trials.m

score_gplda_trials.m

spec2cep.m

train_tv_space.m

utilities.m

wcmvn.m

winconv.m

zerocross.m

Mike-20081010-igv

Recordings

Enrollment

enrollmentasdada.wav

enrollmentasdasd.wav

enrollmentnicole.wav

enrollmentpedro.wav

enrollmentpedro1.wav

enrollmentpedro12.wav

enrollmentpedrofin.wav

enrollmentpedromod.wav

enrollmentricardo.wav

enrollmentsasd.wav

Scoring

scoringnictest.wav

scoringnictest2.wav

scoringpedroscore.wav

scoringpedrotest.wav

scoringptest.wav

scoringricardotst.wav

scoringtester.wav

scoringtester1.wav

scoringtester3.wav

T.mat

V.mat

VOICEBOX

Contents.m

TareaLPC.docx

TareaLPC.m

Thumbs.db

activlev.m

activlevg.m

atan2sc.m

autolpc.m

axisenlarge.m

bark2frq.m

berk2prob.m

bitsprec.m

cblabel.m

ccwarpf.m

cent2frq.m

cep2pow.m

cholesky.m

choosenk.m

choosrnk.m

correlogram.m

distchar.m

distchpf.m

disteusq.m

distisar.m

distispf.m

distitar.m

distitpf.m

ditherq.m

dlyapsq.m

dualdiag.m

durbin.m

dypsa.m

enframe.m

entropy.m

erb2frq.m

estnoiseg.m

estnoisem.m

ewgrpdel.m

fig2emf.m

figbolden.m

filtbankm.m

filterbank.m

finishat.m

fopenmkd.m

frac2bin.m

fram2wav.m

frq2bark.m

frq2cent.m

frq2erb.m

frq2mel.m

frq2midi.m

func_lev_durb.m

func_pitch.m

func_vd_msf.m

func_vd_zc.m

fxpefac.m

fxrapt.m

gammabank.m

gausprod.m

gaussmix.m

gaussmixd.m

gaussmixg.m

gaussmixk.m

gaussmixm.m

gaussmixm_cart.m

gaussmixm_cart.mat

gaussmixp.m

gaussmixt.m

glotlf.m

glotros.m

gmmlpdf.m

histndim.m

hostipinfo.m

huffman.m

imagehomog.m

importsii.m

irdct.m

irfft.m

kmeanhar.m

kmeanlbg.m

lambda2rgb.m

lattice.m

ldatrace.m

lin2pcma.m

lin2pcmu.m

lognmpdf.m

logsum.m

lpc_analysis.m

lpcaa2ao.m

lpcaa2dl.m

lpcaa2rf.m

lpcao2rf.m

lpcar2am.m

lpcar2cc.m

lpcar2db.m

lpcar2ff.m

lpcar2fm.m

lpcar2im.m

lpcar2ls.m

lpcar2pf.m

lpcar2pp.m

lpcar2ra.m

lpcar2rf.m

lpcar2rr.m

lpcar2zz.m

lpcauto.m

lpcbwexp.m

lpccc2ar.m

lpccc2cc.m

lpccc2db.m

lpccc2ff.m

lpccc2pf.m

lpcconv.m

lpccovar.m

lpccw2zz.m

lpcdb2pf.m

lpcdl2aa.m

lpcff2pf.m

lpcfq2zz.m

lpcifilt.m

lpcim2ar.m

lpcis2rf.m

lpcla2rf.m

lpclo2rf.m

lpcls2ar.m

lpcpf2cc.m

lpcpf2ff.m

lpcpf2rr.m

lpcpp2cw.m

lpcpp2pz.m

lpcpz2zz.m

lpcra2pf.m

lpcra2pp.m

lpcrand.m

lpcrf2aa.m

lpcrf2ao.m

lpcrf2ar.m

lpcrf2is.m

lpcrf2la.m

lpcrf2lo.m

lpcrf2rr.m

lpcrr2am.m

lpcrr2ar.m

lpcss2zz.m

lpczz2ar.m

lpczz2cc.m

lpczz2ss.m

m2htmlpwd.m

maxfilt.m

maxgauss.m

meansqtf.m

mel2frq.m

melbankm.m

melcepst.m

midi2frq.m

minspane.m

mintrace.m

modspect.m

momfilt.m

mos2pesq.m

nearnonz.m

order10.png

order16.png

order4.png

overlapadd.m

pa10.png

pa16.png

pa4.png

pcma2lin.m

pcmu2lin.m

peak2dquad.m

permutes.m

pesq2mos.m

phon2sone.m

polygonarea.m

polygonwind.m

polygonxline.m

potsband.m

pow2cep.m

prob2berk.m

psycdigit.m

psycest.m

psycestu.m

psychofunc.m

qrabs.m

qrdivide.m

qrdotdiv.m

qrdotmult.m

qrmult.m

qrpermute.m

quadpeak.m

randfilt.m

randiscr.m

randvec.m

rdct.

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
MSR Identity Toolbox: A Matlab Toolbox for Speaker Recognition Research Version 1.0 Seyed Omid Sadjadi, Malcolm Slaney, and Larry Heck Microsoft Research, Conversational Systems Research Center (CSRC) s.omid.sadjadi@gmail.com, {mslaney,larry.heck}@microsoft.com This report serves as a user manual for the tools available in the Microsoft Research (MSR) Identity Toolbox. This toolbox contains a collection of Matlab tools and routines that can be used for research and development in speaker recognition. It provides researchers with a test bed for developing new front-end and back-end techniques, allowing replicable evaluation of new advancements. It will also help newcomers in the field by lowering the “barrier to entry”, enabling them to quickly build baseline systems for their experiments. Although the focus of this toolbox is on speaker recognition, it can also be used for other speech related applications such as language, dialect and accent identification. In recent years, the design of robust and effective speaker recognition algorithms has attracted significant research effort from academic and commercial institutions. Speaker recognition has evolved substantially over the past 40 years; from discrete vector quantization (VQ) based systems to adapted Gaussian mixture model (GMM) solutions, and more recently to factor analysis based Eigenvoice (i-vector) frameworks. The Identity Toolbox provides tools that implement both the conventional GMM-UBM and state-of-the-art i-vector based speaker recognition strategies. A speaker recognition system includes two primary components: a front-end and a back-end. The front-end transforms acoustic waveforms into more compact and less redundant representations called acoustic features. Cepstral features are most often used for speaker recognition. It is practical to only retain the high signal-to-noise ratio (SNR) regions of the waveform, therefore there is also a need for a speech activity detector (SAD) in the fr
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值