Kullback-Leibler (KL) loss

最新推荐文章于 2024-04-13 09:47:14 发布

P_Y_L_U

最新推荐文章于 2024-04-13 09:47:14 发布

阅读量2.5k

点赞数 1

分类专栏： math

math 专栏收录该内容

9 篇文章 1 订阅

订阅专栏

Kullback-Leibler( $\mathrm {KL}$ ) loss
（离散）For discrete probability distributions $F (x)$ and $G (x)$ , the Kullback-Leibler ( $\mathrm {KL}$ ) loss from $F (x)$ to $G (x)$ is defined[5] to be
$\mathrm {KL}\{F(x)\|G(x)\} = \sum_{i=1}^nF(x)\log\frac{F(x)}{G(x)}.$
（连续）For distributions $F (x)$ and $G (x)$ of a continuous random variable, the Kullback–Leibler( $\mathrm {KL}$ ) loss is defined to be
$\mathrm {KL}\{F(x)\|G(x)\} = \int_{-\infty}^{\infty}f(x)\log\frac{f(x)}{g(x)}dx$
where $f (x)$ and $g (x)$ is the densities function of $F (x)$ and $G (x)$ .

The Kullback–Leibler loss is always non-negative(始终非负), that is
$\mathrm {KL}\{F(x)\|G(x)\}\geqslant0.$
The Kullback–Leibler( $\mathrm {KL}$ ) loss $\mathrm {KL}\{F(x)\|G(x)\}$ is convex(凸的) in the pair of probability mass functions $(f,g)$ , i.e. if $(f_{1},g_{1})$ and $(f_{2},g_{2})$ are two pairs of probability mass functions, then ${\mathrm {KL}\{\lambda f_{1}+(1-\lambda )f_{2}\|\lambda g_{1}+(1-\lambda )g_{2}\}\leq \lambda \mathrm {KL} (f_{1}\|g_{1})+(1-\lambda )\mathrm {KL} (f_{2}\|g_{2})}$ for $0\leq\lambda\leq1$ .

eg: Multivariate normal distributions
Suppose that we have two multivariate normal distributions, with means $\mu _{0},\mu _{1}$ and with (nonsingular) covariance matrices $\Sigma _{0},\Sigma _{1}$ . If the two distributions have the same dimension, k, then the Kullback–Leibler( $\mathrm{KL}$ ) loss between the distributions is as follows:
$\mathrm{KL}({\mathcal {N}}_{0}\|{\mathcal {N}}_{1})={1 \over 2}\left\{\mathrm {tr} \left(\Sigma _{1}^{-1}\Sigma _{0}\right)+\left(\mu _{1}-\mu _{0}\right)^{\text{T}}\Sigma _{1}^{-1}(\mu _{1}-\mu _{0})-k+\log \left({\det \Sigma _{1} \over \det \Sigma _{0}}\right)\right\}.$