A Note on Kaldi's PLDA Implementation

最新推荐文章于 2020-11-02 13:42:25 发布

MoussaTintin

最新推荐文章于 2020-11-02 13:42:25 发布

阅读量859

点赞数

分类专栏：原创机器学习概率统计语音技术文章标签： Kaldi EM

本文链接：https://blog.csdn.net/JackyTintin/article/details/79744063

版权

Kaldi’s PLDA implementation is based on [1], the so-called two-covariance PLDA by [2]. The authors derive a clean update formula for the EM training and give a detailed comment in the source code. Here we add some explanations to make formula derivation more easy to catch.

A pdf version of this note can be found here

1. Background

Recall that PLDA assume a two stage generative process:
1) generate the class center according to

y \sim N (μ, Φ b)

$y \sim \mathcal{N}(\mu, \Phi_b)$
2) then, generate the observed data by:

x \sim N (y, Φ w)

$x \sim \mathcal{N}(y, \Phi_w)$

Here, $\mu$ is estimated by the global mean value:

μ = \sum k = 1 K \sum i = 1 n k z k i

$\mu = \sum_{k=1}^K \sum_{i=1}^{n_k} z_{ki}$
here

zki z k i $z_{ki}$ depicts the

i i $i$ -th sample of the

k

$k$ -th class.

So let’s to the estimation of $\Phi_b$ and $\Phi_w$ .

Note that, as $\mu$ is fixed, we remove it from all samples. Hereafter, we assume all samples have pre-processed by removing $mu$ from them.

The prior distribution of an arbitrary sample $z$ is:

p (z) \sim N (0, Φ_{w} + Φ_{w})

$p(z) \sim \mathcal{N}(0, \Phi_w + \Phi_w)$
Let’s suppose the mean of a particular class is

m m $m$ , and suppose that that class had

n

$n$ examples.

m = 1 n \sum i = 1 n z i \sim N (0, Φ w + Φ w n)

$m = \frac{1}{n}\sum_{i=1}^n z_i \sim \mathcal{N}(0, \Phi_w + \frac{\Phi_w}{n} )$

i.e. $m$ is Gaussian-distributed with zero mean and variance equal to the between-class variance plus $1/n$ times the within-class variance. Now, $m$ is observed (average of all observed samples).