信息论相对熵的凸性证明

相对熵的凸性证明

相对熵的定义式: D ( p ∣ ∣ q ) = ∑ x ∈ X p ( x ) log ⁡ 2 p ( x ) q ( x ) D(p||q) = \sum_{x\in X} p(x)\log_2^{\cfrac {p(x)}{q(x)}} D(pq)=xXp(x)log2q(x)p(x)
欲证明相对熵是下凸函数即证明不等式 D ( λ p 1 ( x ) + ( 1 − λ p 2 ( x ) ) ∣ ∣ λ q 1 ( x ) + ( 1 − λ q 2 ( x ) ) ) ≤ λ D ( p 1 ( x ) ∣ ∣ q 1 ( x ) ) + ( 1 − λ ) D ( p 2 ( x ) ∣ ∣ q 2 ( x ) ) D(\lambda p_1(x)+(1-\lambda p_2(x))||\lambda q_1(x)+(1-\lambda q_2(x))) \le \lambda D(p_1(x)||q_1(x))+(1- \lambda)D(p_2(x)||q_2(x)) D(λp1(x)+(1λp2(x))λq1(x)+(1λq2(x)))λD(p1(x)q1(x))+(1λ)D(p2(x)q2(x))成立
对不等式左项使用对数和不等式
D ( λ p 1 ( x ) + ( 1 − λ p 2 ( x ) ) ∣ ∣ λ q 1 ( x ) + ( 1 − λ q 2 ( x ) ) ) = ∑ x ∈ X ( ( λ p 1 ( x ) + ( 1 − λ p 2 ( x ) ) ) log ⁡ 2 ( λ p 1 ( x ) + ( 1 − λ p 2 ( x ) ) ( λ q 1 ( x ) + ( 1 − λ q 2 ( x ) ) D(\lambda p_1(x)+(1-\lambda p_2(x))||\lambda q_1(x)+(1-\lambda q_2(x))) = \sum_{x \in X}((\lambda p_1(x)+(1-\lambda p_2(x)))\log_2^{\cfrac {(\lambda p_1(x)+(1-\lambda p_2(x))}{(\lambda q_1(x)+(1-\lambda q_2(x))}} D(λp1(x)+(1λp2(x))λq1(x)+(1λq2(x)))=xX((λp1(x)+(1λp2(x)))log2(λq1(x)+(1λq2(x))(λp1(x)+(1λp2(x))
把这里面的 λ p 1 ( x ) + ( 1 − λ p 2 ( x ) ) \lambda p_1(x)+(1-\lambda p_2(x)) λp1(x)+(1λp2(x))看作同一 概率矢量的累加和则可以使用对数和不等式
( ( λ p 1 ( x ) + ( 1 − λ p 2 ( x ) ) ) log ⁡ 2 ( λ p 1 ( x ) + ( 1 − λ p 2 ( x ) ) ( λ q 1 ( x ) + ( 1 − λ q 2 ( x ) ) ≤ λ p 1 ( x ) l o g 2 λ p 1 ( x ) λ q 1 ( x ) + ( 1 − λ ) p 2 ( x ) l o g 2 ( 1 − λ ) p 2 ( x ) ( 1 − λ ) q 2 ( x ) ((\lambda p_1(x)+(1-\lambda p_2(x)))\log_2^{\cfrac {(\lambda p_1(x)+(1-\lambda p_2(x))}{(\lambda q_1(x)+(1-\lambda q_2(x))}} \le \lambda p_1(x)log_2^{\cfrac {\lambda p_1(x)}{\lambda q_1(x)}}+(1-\lambda)p_2(x)log_2^{\cfrac {(1-\lambda)p_2(x)}{(1-\lambda)q_2(x)}} ((λp1(x)+(1λp2(x)))log2(λq1(x)+(1λq2(x))(λp1(x)+(1λp2(x))λp1(x)log2λq1(x)λp1(x)+(1λ)p2(x)log2(1λ)q2(x)(1λ)p2(x)
对不等式两边求和则得到 ∑ x ∈ X ( ( λ p 1 ( x ) + ( 1 − λ p 2 ( x ) ) ) log ⁡ 2 ( λ p 1 ( x ) + ( 1 − λ p 2 ( x ) ) ( λ q 1 ( x ) + ( 1 − λ q 2 ( x ) ) ≤ ∑ x ∈ X λ p 1 ( x ) l o g 2 λ p 1 ( x ) λ q 1 ( x ) + ∑ x ∈ X ( 1 − λ ) p 2 ( x ) l o g 2 ( 1 − λ ) p 2 ( x ) ( 1 − λ ) q 2 ( x ) \sum_{x\in X} ((\lambda p_1(x)+(1-\lambda p_2(x)))\log_2^{\cfrac {(\lambda p_1(x)+(1-\lambda p_2(x))}{(\lambda q_1(x)+(1-\lambda q_2(x))}} \le \sum_{x\in X} \lambda p_1(x)log_2^{\cfrac {\lambda p_1(x)}{\lambda q_1(x)}}+\sum_{x\in X}(1-\lambda)p_2(x)log_2^{\cfrac {(1-\lambda)p_2(x)}{(1-\lambda)q_2(x)}} xX((λp1(x)+(1λp2(x)))log2(λq1(x)+(1λq2(x))(λp1(x)+(1λp2(x))xXλp1(x)log2λq1(x)λp1(x)+xX(1λ)p2(x)log2(1λ)q2(x)(1λ)p2(x)
D ( λ p 1 ( x ) + ( 1 − λ p 2 ( x ) ) ∣ ∣ λ q 1 ( x ) + ( 1 − λ q 2 ( x ) ) ) ≤ λ D ( p 1 ( x ) ∣ ∣ q 1 ( x ) ) + ( 1 − λ ) D ( p 2 ( x ) ∣ ∣ q 2 ( x ) ) D(\lambda p_1(x)+(1-\lambda p_2(x))||\lambda q_1(x)+(1-\lambda q_2(x))) \le \lambda D(p_1(x)||q_1(x))+(1- \lambda)D(p_2(x)||q_2(x)) D(λp1(x)+(1λp2(x))λq1(x)+(1λq2(x)))λD(p1(x)q1(x))+(1λ)D(p2(x)q2(x))成立

  • 3
    点赞
  • 6
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值