Markov Models and Hidden Markov Models

@yuqing_wang

于 2020-12-17 10:53:45 发布

阅读量134

点赞数

分类专栏： Introduction to AI

本文链接：https://blog.csdn.net/weixin_43199124/article/details/111308685

版权

Introduction to AI 专栏收录该内容

8 篇文章 0 订阅

订阅专栏

Markov Models

可以被视为：chain like , infinite length bayes net

模型构成
state: value of x at a given time
initial distribution
transition model

stationary assumption: 转移概率保持不变

Markov property: past and future are independent given present

注：second order distribution
given $x_{t-1},x_{t-2}$ ,条件独立

The Mini-Forward Algorithm

we can sum out and marginalize: $o(d^j)$ 存储
$p(w_{i+1})=\sum_{w_i}p(w_i)p(w_{i+1}|w_i)$
iterative

Stationary distribution

the distribution we end up in is independent of the initial distribution
influence of the initial distribution get less and less over time
可以解方程求解

Hidden Markov Models

observe some evidence at timestep influence the belief distribution

模型构成
$W_i$ :state variable
$F_i$ :evidence variable
initial distribution
transition model : stationary
emission probabilities: $P(E_t|W_t)$ stationary as well

markov property :
$w_{i+1}\perp \{w_0,w_1,\cdots,w_{i-1}\}|w_i$
$f_{i}\perp \{w_0,f_0,w_1,f_1,\cdots,w_{i-1},f_{i-1}\}|w_i$

property
$P(X_1,E_1,\cdots,X_t,E_t)=P(X_1)P(E_1|X_1)\prod \limits_{i=2}^tP(X_t|X_{t-1})P(E_t|X_{t})$

belief distribution:
$B(W_i)=p(w_i|f_1,\cdots,f_i)$
$B'(W_i)=p(w_i|f_1,\cdots,f_{i-1})$

The forward Algorithm

belief distribution: $B(X_t)$
time elapse update
$B'(W_{i+1})=p(W_{i+1}|f_{1:i})=\sum_{w_i}p(W_{i+1}|w_i)B(w_i)$
我们可以算到最后，进行一次归一化

observation update
$B(W_{i+1})\propto p(f_{i+1}|w_{i+1})B'(W_{i+1})$

Filtering

$B(X_t)=P(X_t|e_{1:t})$
$B'(X_t)=P(X_t|e_{1:t-1})$
$B'(X_{t+1})=\sum_{x_t}B(x_t)P(X_{t+1}|x_t)$
$B(X_{t+1})= P(X_{t+1}|e_{1:t+1})\propto P(X_{t+1},e_{t+1}|e_{1:t})= P(X_{t+1}|e_{1:t})P(e_{t+1}|X_{t+1})=B'(X_{t+1})P(e_{t+1}|X_{t+1})$
$B(X_{t+1})\propto P(e_{t+1}|X_{t+1})\sum_{x_t}B(X_t)P(X_{t+1}|X_t)$

other tasks for HMM

flittering: $P(X_t|e_{1:t})$
prediction: $P(X_{t+k}|e_{1:t})$
smoothing: $P(X_{k}|e_{1:t})$
compute the posterior disrtibution over a past state, given all evidence up to the present
most likely explanation: $argmax_{x_{1:N}}p(x_{1:N}|e_{1:N})=argmax_{x_{1:N}}p(x_{1:N},e_{1:N})$
given a sequence of observations, find the sequence of states that is most likely to have generated those observations

Viterbi Algorithm

goal: compute $argmax_{x_{1:N}}p(x_{1:N}|e_{1:N})=argmax_{x_{1:N}}p(x_{1:N},e_{1:N})$
直接计算联合分布，要占据过多的存储空间，因此使用dynam programming
在这里插入图片描述

$m_t[x_t]=max_{x_{1:t-1}}p(x_{1:t},e_{1:t})$
$m_t[x_t]=max_{x_{1:t-1}}p(e_t|x_t)p(x_t|x_{t-1})p(x_{1:t-1},e_{1:t-1})=p(e_t|x_t)max_{x_{t-1}}p(x_t|x_{t-1})max_{x_{1:t-2}}p(x_{1:t-1},e_{1:t-1})$
在这里插入图片描述
polynomial space and time

Particle filtering

simulate the motion of a set of particles through a state graph to approximate the probability
store a list of n particles , $n < < d$ ,but still enough to estimate
d:domain

Particle filtering simulation

｜X｜太大，不能存储 $B (X)$
e.g. 连续情况

initialization:
没有固定要求
we can sample randomly, uniformly or from the initial distribution
$∣ N ∣ < < ∣ X ∣$
time elapse update:
update according to the transition model

observation update:
在sample前乘权重 $p(e_i|w_i)$
在这里插入图片描述

@yuqing_wang

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
Markov Models and Hidden Markov Models

Markov Models模型构成initial distrbutiontransition modelstationary assumption: 转移概率保持不变Markov propertyMini-Forward Algorithmmarginalize:o(dj)o(d^j)o(dj)存储Stationary distributionHidden Markov Modelsobserve some evidence at timestep influence the bel
复制链接

扫一扫