离散均匀分布的估计| Serial Number Estimation

最新推荐文章于 2024-04-22 16:06:57 发布

天天学习的零柒贰幺

最新推荐文章于 2024-04-22 16:06:57 发布

阅读量825

点赞数

文章标签：概率论

本文链接：https://blog.csdn.net/weixin_42294255/article/details/105169240

版权

Serial Number Estimation

Content

Serial Number Estimation

Parameter description

If $\theta$ is contained in a function, $n$ would be the total sample numbers, and $\hat{\theta}$ would be the estimator for actual maximum ID.
If $M$ is contained in a function, $k$ would be the total sample numbers, and $N$ would be the actual maximum ID. As for $M$ , that is a random variable of maximum ID in a random sample.

Method 1: Probability of each sample

The estimator used to predict the maximum value can also be determined by assuming that the probability of getting each sample is uniform where $\theta$ represents the actual maximum ID in each day.

$\frac{1}{\theta}$

Method 2: Probability of maximum sample

According to the Assumption 1, we consider the observed maximum ID as a r.v M, and take the maximum ID we encountered in one specific day as m (i.e. $x_{n:n}$ ). Assume that the N is the actual maximum ID and k represents the number of ill sample, the probability mass function (PMF) of getting the maximum ID can be expressed as follows:

$\frac{C_{k-1}^{m-1}}{C_k^N}$

Point Estimate

Estimators intuited from discrete uniform distribution

Estimator 1: 2*Mean-1

Consider continuous distribution for this problem, i.e. $UNIF(0,\theta)$

$For~UNIF(0,\theta),E(X)=\frac{\theta}{2},Var(X)=\frac{\theta^2}{12}$

We consider the following estimator:
$\widehat{\theta}_1=\frac{2}{n}\sum^n_{i=1}X_i-1~for~discrete ~distrubution\\ \widehat{\theta}_1=\frac{2}{n}\sum^n_{i=1}X_i~for~continuous ~distrubution\\ E(\widehat{\theta}_1)=E(\frac{2}{n}\sum^n_{i=1}X_i-1)=2E(\overline{X})=\theta\\ Var(\widehat{\theta}_1)=\frac{4}{n^2}\sum_{i=1}^nVar(X_i)=\frac{4}{n^2}\sum_{i=1}^n\frac{\theta^2}{12}=\frac{\theta^2}{3n}\\ \therefore \widehat{\theta}_1~is~an~unbiased~estimator ~with~Var=\frac{\theta^2}{3n}\\$

Estimator 2: Max + Avg GAP

Consider other form of improvement from MLE estimator, i.e. using average approach to estimate the GAP between maximum and the upper limit:

$\widehat{\theta}_2 = X_{n:n}+\frac{1}{n-1}\sum_{i>j}(X_i-X_j-1)\quad\dots for~discrete~case\\ \widehat{\theta}_2 = X_{n:n}+\frac{1}{n-1}\sum_{i>j}(X_i-X_j) \quad\dots for~continuous~case$

Calculate the expected value and variance to determine if this estimator is biased or not.
$E(\widehat{\theta}_2) = E(X_{n:n}) + \frac{1}{n-1}\sum_{i>j}E{(X_i-X_j)} = \frac{n\theta}{n+1} \\ Var(\hat{\theta}_2) = \frac{n\theta^2}{(n+1)(n-1)(n+2)}$
Therefore, $\theta_2$ is a biased estimator.

Estimator3: Min+max estimator

We know that maximum sample ID is what’s closed to the upper limit, and we could add more information to it. Intuitively, we first consider minimum sample ID + maximum sample ID:
$\widehat{\theta}_3=x_{1:n}+x_{n:n}\\ F_{X_{n:n}}(x) =[F_X(x)]^n=\frac{x^n}{\theta^n},f_{X_{n:n}}(x)=n\frac{x^{n-1}}{\theta^n}\\ E[X_{n:n}]=\int xn\frac{x^{n-1}}{\theta^n}dx=\frac{n}{n+1}\theta\\ E[X_{n:n}^2] =\int x^2n\frac{x^{n-1}}{\theta^n}dx=\frac{n}{n+2}\theta^2\\ F_{X_{1:n}}(x) =1-[1-F_X(x)]^n=1-(\frac{\theta-x}{\theta})^n,f_{X_{1:n}}(x)=\frac{n (\theta-x)^{n-1}}{\theta^n}\\ E[X_{1:n}]=\int x\frac{n (\theta-x)^{n-1}}{\theta^n}dx=\frac{1}{n+1}\theta\\ E[X_{1:n}^2] =\int x^2\frac{n (\theta-x)^{n-1}}{\theta^n}dx=\frac{2}{n(n+1)}\theta^2\\ E(\widehat{\theta}_3)=E(x_{1:n})+E(x_{n:n})=\theta\\ Var(\hat{\theta}_3)=Var(X_{1:n})+Var(X_{n:n})+2Cov(X_{1:n},X_{n:n})\\ =\frac{2}{n(n+1)}\theta^2-(\frac{1}{n+1}\theta)^2 +\frac{n}{n+2}\theta^2-(\frac{n}{n+1}\theta)^2+2Cov(X_{1:n},X_{n:n})\\ Since~the~ joint~ distribution~ of ~the~ order~ statistics~ of~ the~ uniform~ distribution~is\\ f_{u_i,v_j}(u,v)=n!\frac{u^{i-1}}{(i-1)!}\frac{(v-u)^{j-i-1}}{(j-i-1)!}\frac{(1-v)^{n-j}}{(n-j)!}\\ Cov(u_k,v_j)=\frac{j(n-k-1)}{(n-1)^2(n+2)}\\ Var(\hat{\theta}_3)=\frac{2\theta^2}{n(n+2)}+\frac{2n^2\theta^2}{(n+1)^2(n+2)}$

最低0.47元/天解锁文章

天天学习的零柒贰幺

关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
离散均匀分布的估计| Serial Number Estimation

Serial Number EstimationContentSerial Number EstimationParameter descriptionMethod 1: Probability of each sampleMethod 2: Probability of maximum samplePoint EstimateEstimators intuited from discrete ...
复制链接

扫一扫