A Probabilistic Perspective-chapter2-sol

FlameofInferno

已于 2022-10-26 13:42:08 修改

阅读量95

点赞数

文章标签：概率论机器学习

于 2022-10-26 13:41:14 首次发布

本文链接：https://blog.csdn.net/haozhangphy/article/details/127391327

版权

2.1.1.
$A=one\ child\ is\ boy\\ |A|=3\\ P=\frac{2}{3}$
2.1.2.
First child is boy.
$P=\frac{1}{2}$

2.3:
$\begin{align} var[x+y]=E[(x+y)^ 2]-E^2[x+y] \\ var[x+y]=E[x^ 2]+E[y^ 2]+2E[xy]-(E[x]+E[y])^2\\ var[x+y]=var[x]+var[y]+2(E[xy]-E[x]E[y])=var[x]+var[y]+2conv(x,y) \end{align}$
2.4
$P (i ll ∣ p os i t i v e) = 0.99, P (i ll) = 1 e - 4$
the answer is 0.99.
text example:
$p (p os i t i v e ∣ i ll) = 0.8, p (i ll) = 0.004, p (p os i t i v e ∣ n o t i ll) = 0.1$
$p (p os i t i v e) = p (p os i t i v e ∣ i ll) p (i ll) + p (p os i t i v e ∣ n o t i ll) p (n o t i ll) = 0.8 * 0.004 + 0.1 * (1 - 0.004) = 0.1028$
$p(ill|positive)=\frac{p(positive|ill)p(ill)}{p(positive)}=\frac{0.8*0.004}{0.1028}=0.031$

2.5
A=prize behind first picked door
B=prize behind final picked door
$P(A)=1/3,\ P(~A)=2/3$
$P (B) = P (B ∣ A) P (A) + P (B ∣ A) P (A) = 0 * 1/3 + 1 * 2/3 = 2/3$

2.6
1.
$P(H|e_1,e_2)=\frac{P(e_1,e_2|H)P(H)}{P(e_1,e_2)}$
answer is ii.

$P(e_1,e_2|H)=P(e_1|H)P(e_2|H)$
i, ii is sufficient.
$P(e_1,e_2)=\sum_H P(H)P(e_1|H)P(e_2|H)$
iii is sufficient.

2.7
wikipedia
$x=U(0,1),\ y=U(0,1),\ z=x\ xor\ y$

2.8
$x\bot y|z \rightarrow p(x,y|z)=h(x,y)g(y,z)$
It is trival that $h (x, z) = p (x ∣ z), g (y, z) = p (y ∣ z)$ .
vice versa,
$\begin{align} p(x|z)&=\sum_y{p(x,y|z)}\\ &=g(x,z)\sum_y{h(y,z)}\\ p(y|z)&=\sum_y{p(x,y|z)}\\ &=h(y,z)\sum_x{g(x,z)}\\ 1&=p(x,y|z)\\ &=\sum_{x,y}h(x,z)g(y,z)\\ &=\sum_x h(x,z)\sum_y g(y,z)\\ then,\\ p(x|z)p(y|z)&=g(x,z)h(y,z)\sum_x h(x,z)\sum_y g(y,z)\\ &=g(x,z)h(y,z) \end{align}$

2.9
(i) true
(ii)false

2.10
$\begin{align} p(y)&=p(x)\frac{dy}{dx}\\ \frac{dy}{dx}&=-\frac{1}{x^2}\\ IG(x|a,b)=\frac{b^a}{\Gamma(a)}x^{-(a+1)}e^{-\frac{b}{x}} \end{align}$

2.11
Intergral $\theta$ first,
$\begin{align} Z^ 2&=\int_0^ {2\pi}d\theta\int_{-\infty }^{\infty }r\exp(-\frac{r^2}{2\sigma^2})dr\\ &=2\pi\int_{0}^{\infty }r\exp(-\frac{r^2}{2\sigma^2})dr \end{align}$

$KaTeX parse error: Expected 'EOF', got '\end' at position 168: …frac{\sigma^2} \̲e̲n̲d̲{align}$
So, $Z^2=2\pi\sigma^2$ ,then $Z=\sigma\sqrt{2\pi}$

2.12
$\begin{align} I(X,Y)=&\sum_{x,y}p(x,y)\log\frac{p(x,y)}{p(x)p(y)}\\ =&\sum_{x,y}p(x,y)\log\frac{p(x|y)}{p(x)}\\ =&\sum_{x,y}p(x,y)(\log p{x|y}-\log p(x)\\ =&-H(x|y)-\sum_x \log p(x)(\sum_y p(x,y))\\ =&-H(x|y)+H(x) \end{align}$

2.13
$\begin{align} I(X,Y)=&H(x)-H(x|y)\\ =&H(x)+H(y)-H(x,y)\\ =&\log{2\pi e \sigma^2}+\frac{1}{2}\log{(2\pi e)^2 \sigma^4(1-\rho^2)} \end{align}$
For $\rho=0$ ,
$I(x,y)=\log{2\pi e \sigma^2}+\frac{1}{2}\log{(2\pi e)^2 \sigma^4}=2\log{2\pi e \sigma^2}=H(x)+H(y)$
When $C o v (x, y) = 0$ , mutual information is simply sum of single information of two variables, which knowing $x$ does not give any information about $y$ and vice versa.
For $\rho=\pm 1$ ,
$I(x,y)=\infty$
All information conveyed by $x$ is shared with $y$ : knowing $x$ determines the value of $y$ and vice versa.

2.14
(i)
obvious.
(ii)
It is easy to prove non negativity of entropy $H (x) > 0$ .
$I(x,y)\geq 0\rightarrow r\geq 0$
$I(x,y)\geq 0$ is obvious due to its formula.
(iii)
$I (x, y) = 0$ , x,y are independent
(iiii)
$I (x, y) = 1$ , x is fully dependent of y.

2.15
$\begin{align} \theta=&\argmin_\theta{KL(P_{emp}||q(\theta))}\\ =&\argmin_\theta{E(P_{emp}\log \frac{P_{emp}}{q(\theta)})}\\ =&\argmin_\theta{E(P_{emp}(\log{P_{emp}}-\log{q(\theta)}) )}\\ =&H_{emp}-\argmax_\theta{E(P_{emp}\log{q(\theta)} )}\\ =&\argmax_\theta{E(P_{emp}\log{q(\theta)} )}\\ =&\argmax_\theta{\sum_{x\in Dataset}\log{q(x;\theta)}} \end{align}$

2.16
pdf of beta distribution:
$\frac{x^{\alpha-1}(1-x)^{\beta-1}}{B(\alpha,\beta)}$
mode:
$\begin{align} \frac{d}{dx}\frac{x^{\alpha-1}(1-x)^{\beta-1}}{B(\alpha,\beta)}&=0\\ x&=\frac{\alpha-1}{\alpha+\beta-2} \end{align}$
$\begin{align} E(x^N)&=\frac{1}{B(\alpha,\beta)}\int x^{\alpha+N-1}(1-x)^{\beta-1}\\ &=\frac{B(\alpha+N,\beta)}{B(\alpha,\beta)} \end{align}$
mean:
$\begin{align} E(x)&=\frac{1}{B(\alpha,\beta)}\int x^{\alpha+N-1}(1-x)^{\beta-1}\\ &=\frac{a}{a+b} \end{align}$
var:
$E(x^2)-E^2(x)=\frac{ab}{(a+b)^2(a+b+a)}-\frac{a^2}{(a+b)^2}$

2.17
The leftest point’s coordinate $f (x, y) = min (x, y)$ ,
$\begin{align} p(f(x,y)=m)=&p(x=m,y>=m)+p(x>=m,y=m)\\ =&2(1-m)\\ E(m)=&\int_0^1 2m(1-m)dm\\ &=\int_0^1 2m-2m^2dm\\ &=\left. m^2-\frac{2}{3}m^3\right|_0^1\\ &=\frac{1}{3} \end{align}$
The problem can also be solved in 3-d coordinates. The body is a cone with height 1 and bottom area 1.