The Proof of Hoeffding's Inequality

最新推荐文章于 2025-04-26 00:08:36 发布

NOOBIMP

最新推荐文章于 2025-04-26 00:08:36 发布

阅读量502

点赞数

文章标签：机器学习人工智能数据分析深度学习概率论

本文链接：https://blog.csdn.net/weixin_43917231/article/details/104262945

版权

Hoeffding’s Inequality:

Suppose $x_1,x_2,……x_N$ are independent random variables, and $x_i∈[a_i, b_i]$ , $i = 1, 2, \dots \dots, N$ ; $\bar{X}$ is the expirical mean of $x_1,x_2,……x_N$ , namely, $\bar{X}=\frac{1}{N}\sum_{i=1}^{N}{x_i}$ , then for any $t > 0$ , the following inequality holds：
$\bar{X}-E(\bar{X})\geq t ]\leq exp\left(-\frac{2N^2t^2}{\sum_{i=1}^{N}{(b_i-a_i)^2}} \right)$
$P[E(\bar{X})- \bar{X}\geq t ]\leq exp\left(-\frac{2N^2t^2}{\sum_{i=1}^{N}{(b_i-a_i)^2}} \right)$

Hoeffding’s lemma:

Suppose $x$ is an random variable, $x \in [a, b]$ , and $E (x) = 0$ , then for any $t > 0$ ，the following inequality holds：
$E(e^{tx})\leq exp\frac{t^2(b-a)^2}{8}$

We prove the lemma first:

Obviously, $f(x)=e^{tx}$ is a convex function, so for any $α \in [0, 1]$ , we have:
$f(αx_1+(1-α)x_2)\le αf(x_1)+(1-α)f(x_2)$
在这里插入图片描述
Let $α=\frac{b-x}{b-a}, \forall x∈[a,b]$ , then $αx_1+(1-α)x_2=x$ , so we have:
$e^{tx} \leq \frac{b-x}{b-a} e^{ta}+\frac{x-a}{b-a} e^{tb}, \forall x \in[a, b]$
take expectations on both sides, and because of $E (x) = 0$ , there is:
$E(e^{tx}) \leq \frac{b}{b-a} e^{ta}-\frac{a}{b-a} e^{tb}$
Try to simplify it.
Let $p=-\frac{a}{b-a}$ , then $right=(1-p)e^{-t(b-a)p}+pe^{-t(b-a)(p-1)}$ ,
let $h = t (b - a)$ , then $right=(1-p)e^{-hp}+pe^{h(1-p)}=e^{-hp}(1-p+pe^h)$ , note the function $L(h)=-hp+ln(1-p+pe^h),h>0$ , now we need to prove $\le \frac{t^{2}(b-a)^{2}}{8}=\frac{h^{2}}{8}$ .

Give two ways to evident this inequality above:

Construct function, $g(h)=L(h)-\frac{h^{2}}{8}=-hp+ln(1-p+pe^h)-\frac{h^{2}}{8},h>0$
Obviously, $g'(0)=0,g''(h)=\frac{(1-p) p e^{h}}{[(1-p)+(p e^{h})]^2}-\frac{1}{4} \le0$ , according to the monotonicity, $g(h)\le0$ , namely $\le \frac{h^{2}}{8}=\frac{t^{2}(b-a)^{2}}{8}$ .
Apply taylor expansion to function $g (h)$ , $g(h)=0+0+\frac{g''(0)}{2!}(h-0)^2+o(h^2)=\frac{1}{2}(\frac{(1-p) p }{[(1-p)+(p )]^2}-\frac{1}{4})h^2 +o(h^2)\le0$ , namely $\le \frac{h^{2}}{8}=\frac{t^{2}(b-a)^{2}}{8}$ .