MCMC算法之模拟退火（Simulated annealing）算法（Matlab代码）

最新推荐文章于 2021-02-01 17:59:43 发布

Eric2016_Lv

最新推荐文章于 2021-02-01 17:59:43 发布

阅读量3.8k

点赞数 1

分类专栏：机器学习算法 Matlab 数据挖掘

本文链接：https://blog.csdn.net/Eric2016_Lv/article/details/79701646

版权

机器学习同时被 3 个专栏收录

64 篇文章 19 订阅

订阅专栏

数据挖掘

26 篇文章 1 订阅

订阅专栏

算法

23 篇文章 0 订阅

订阅专栏

1. Introduction: Simulated annealing for global optimization：

Instead of wanting to approximate $p(x)$ , we want to find the global maximum. For example, if $p(x)$ is the likelihood or posterior distribution, we often want to compute the ML and maximum a posteriori (MAP) estimates. As mentioned earlier, we could run a Markov chain of invariant distribution $p(x)$ and estimate the global mode by

x^= a r g m i n x (i), i = 1, \dots, N p (x (i))

$\hat{x}=\mathrm{arg~min}_{x^{(i)},~i=1,\cdots,N}~p(x^{(i)})$
This method is inefficient because the random samples only rarely come from the vicinity of the mode. Unless the distribution has large probability mass around the mode, computing resources will be wasted exploring areas of no interest.
A more principled strategy is to adopt simulated annealing. This technique involves simulating a non-homogeneous Markov chain whose invariant distribution at iteration

i i $i$ is no longer equal to

p (x)

$p(x)$ , but to

p i (x) \propto p 1 / T i (x)

$p_i (x) ∝ p^{1/T_i}(x)$
where

Ti T i $T_i$ is a decreasing cooling schedule with

limi→∞Ti=0 l i m i → ∞ T i = 0 $\mathrm{lim}_{i→∞} T_i = 0$ .
The reason for doing this is that, under weak regularity assumptions on

p(x) p ( x ) $p(x)$ ,

p∞(x) p ∞ ( x ) $p^∞(x)$ is a probability density that concentrates itself on the set of global maxima of

p(x) p ( x ) $~p(x)$ .
Didi Lv: $p^∞(x)$ make the smaller modes become to zero, and the maximum mode becomes $∞$ .

2. Problem:

Running the Simulated annealing algorithm with a Gaussian proposal distribution $q(x^*| x(i )) = N(x(i ), 10)$ and a bimodal target distribution $p(x) ∝ 0.3~exp(−0.2x^2) +0.7~exp(−0.2(x − 10)^2)$ for 5000 iterations with $T_i = (C~ \mathrm{ln}(i +T_0))^{−1}$ , where $C$ and $T_0$ are problem-dependent. In our test, we set $C=1, ~ T_0=1$ .

3. Cases:

Case 1: General acceptance probability for Simulated annealing:

Acceptance probability: $A(x, x^) = \mathrm{min}\{1, \frac{p(x^)^{\frac{1}{T_i}}q(x | x^)}{p(x)^{\frac{1}{T_i}}q(x^|x)}\}$

Case 2: Metropolis algorithm for Simulated annealing:

Acceptance probability: $q(x^| x) = q(x| x^)\Rightarrow A(x, x^) = \mathrm{min}\{1, \frac{p(x^)^{\frac{1}{T_i}}}{p(x)^{\frac{1}{T_i}}}\}$

Case 3: Independent sampler for Simulated annealing:

Acceptance probability: $q(x^| x) = q(x^)\Rightarrow A(x, x^) = \mathrm{min}\{1, \frac{p(x^)^{\frac{1}{T_i}}q(x)}{p(x)^{\frac{1}{T_i}}q(x^*)}\}$

4. Pseudo code：

这里写图片描述

5. Matlab code:

% Metropolis(-Hastings) algorithm
% true (target) pdf is p(x) where we know it but can't sample data. 
% proposal (sample) pdf is q(x*|x)=N(x,10) where we can sample.
%% 
clc
clear; 

X(1)=0; 
N=5e3;
p = @(x) 0.3*exp(-0.2*x.^2) + 0.7*exp(-0.2*(x-10).^2); 
C = 1; T_0 = 2;
T = @(x) 1.0/(C*log(x+T_0));
dx=0.5; xx=-10:dx:20; fp=p(xx); plot(xx,fp) % plot the true p(x)
%% MH algorithm
sig=(10); 
for i=1:N-1
    u=rand;
    x=X(i); 
    xs=normrnd(x,sig); % new sample xs based on existing x from proposal pdf.
    pxs=p(xs);
    px=p(x); 
    qxs=normpdf(xs,x,sig);
    qx=normpdf(x,xs,sig); % get p,q.
     if u<min(1,pxs^(1/T(i))*qx/(px^(1/T(i))*qxs))  % case 1: pesudo code
%     if u<min(1,pxs^(1/T(i))/(px)^(1/T(i)))        % case 2: Metropolis algorithm
%     if u<min(1,pxs^(1/T(i))/qxs/(px^(1/T(i))/qx)) % case 3: independent sampler
        X(i+1)=xs;
    else
        X(i+1)=x; 
    end
end
% compare pdf of the simulation result with true pdf.
N0=1;  close all;figure; %N/5; 
nb=histc(X(N0+1:N),xx); 
bar(xx+dx/2,nb/(N-N0)/dx); % plot samples.
A=sum(fp)*dx; 
hold on; plot(xx,fp/A,'r') % compare.
% figure(2); plot(N0+1:N,X(N0+1:N)) % plot the traces of x.

% compare cdf with true cdf.
F1(1)=0;
F2(1)=0;
for i=2:length(xx) 
  F1(i)=F1(i-1)+nb(i)/(N-N0); 
  F2(i)=F2(i-1)+fp(i)*dx/A;
end

figure
plot(xx,[F1' F2'])
max(F1-F2) % this is the true possible measure of accuracy.