Study notes for Sampling

最新推荐文章于 2024-06-09 17:40:36 发布

Felix_夜雨

最新推荐文章于 2024-06-09 17:40:36 发布

阅读量938

点赞数

分类专栏： Machine Learning 文章标签： machine learning study notes 机器学习

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/u010693617/article/details/8967134

版权

Machine Learning 专栏收录该内容

23 篇文章 0 订阅

订阅专栏

Preliminary

Unbiased Estimator. Remember that estimators are random variables; an estimator is unbiased if its expected value is equal to the true value of the parameter being estimated. To use regression as an example. Suppose you measured two variables x and y where the true (linear) relationship is given by: y=5*x+2. Of course, any sample that you draw will have noise in it, so you have to estimate the true values of the slope and intercept. Suppose that you draw a thousand samples of x, y and calculate the least squares estimators for each sample (assuming that the noise is normally distributed). As you do that, you'll notice two things:
1. All of your estimates are different (because the data is noisy)
2. The mean of all of those estimates starts to converge on the true values (5 and 2)
The second occurs because the estimator is unbiased.
Markov Chain. A markov chain is a mathematical system that undergoes transitions from one state to another, between a finite or countable number of possible states. It is a random process usually characterized as memoryless: the next state depends only on the current state and not on the sequence of events that preceded it. This specific kind of memoryless is called Markov property.
1. Mixing: no matter which state (node) the random process starts, the process eventually stabilizes to a stationary state, known as mixing. A good example can be seen here.
2. Mixing time of a Markov chain is the time until the Markov chain is "close" to its steady state distribution, i.e. stationary distribution Pi.
3. Rapid/fast mixing refers to the mixing time grows at most polynomially fast in log(n), where n is the number of states of the chain. Tools for providing rapid mixing include arguments based on conductance and the method of coupling.

Monte Carlo Methods

Monte Carlo is the art of approximating an expectation by the sample mean of a function of simulated random variables. (Eric C. Anderson, 1999).
Monte Carlo is about invoking laws of large numbers to approximate expectations.
Monte Carlo methods (or Monte Carlo experiments are a broad class of computational algorithms that rely on repeated random sampling (i.e., simulations) to obtain numerical results (in order to determine the properties of some phenomenon or behaviors).
Monte Carlo methods are mainly used in three distinct problems: optimization, numerical integration and generation of samples from a probability distribution.

Importance Sampling

Importance sampling is a Monte-Carlo scheme which does not involve a Markov Chain, every sample is an unbiased estimator.
It is used when one cannot sample from P but has a proposal distribution Q.
Importance sampling is used for the purpose of numerical integration.

References

Eric C. Anderson, Monte Carlo Methods and Importance Sampling, Statistical Genetics, 1999.
Bengio and Senecal, Quick training of probabilistic neural nets by importance sampling, 2003.
Sampling: http://en.wikipedia.org/wiki/Sampling_(statistics)
Monte Carlo method: http://en.wikipedia.org/wiki/Monte_Carlo_method

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
Study notes for Sampling

Study notes for Sampling
复制链接

扫一扫

专栏目录

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。