Review - 5703 Statistical Inference and Modeling_diagnostics analysis for skew-normal linear regres-CSDN博客

本文链接：https://blog.csdn.net/qq_41103204/article/details/122817256

Lecture 1 Intro

LLN (Law of large numbers)
CLT (Central Limit Theorem)
CMT (Continuous Mapping Theorem)
ST (Slutsky’s Theorem)
Delta Method

Lecture 2 Estimation

Estimation:

unbiased
consistent
Accuracy of an estimator: $Var(\hat{\theta}) + bias(\theta, \hat{\theta}) = Var(\hat{\theta}) + [E(\hat{\theta}) - \theta)]^2$
Relative efficiency

Two estimation methods

Method of Moments

Theorem

Maximum Likelihood Estimator

Fisher information
Theorem (consistent, unbiased)

Optimality in estimation

Cramer-Rao lower bound

Lecture 3 Confidence intervals and hypothesis testing

Lecture 4 Exponential Family

Lecture 5 Normal Distribution

Lecture 6 Missing Data

Lecture 7 Markov Chains

Lecture 8 Time Series

Measure of dependence

(Auto) Covariance function:
$\gamma(s, t) = cov(Y_t, Y_s) = E[(Y_t - \mu_t)(Y_s - \mu_s)]$
(Auto) Correlation function:
$\rho(s, t) = cor(Y_t, Y_s) = \frac{\gamma(s, t)}{\sqrt{\gamma(s, s) \gamma(t, t) }}$

Stationary

Definition: For any finite subset, the joint distribution of $Y_{t+s}$ and $Y_s$ is the same.

Relaxed version: Second order (weak) stationary 二阶平稳性

$E(Y_t) = \mu$
$cov(Y_s, Y_{s+t}) ,\ Does\ not\ depend\ on\ s.$

$\rho_t = cor(Y_0, Y_t)$

Write Noise

Definition: A stochastic process { $Y_t$ } is called white noise if its elements are uncorrelated, with mean $E(Y_t) = 0$ and $Var(Y_t) = \sigma^2$

$\rho_t = 0$

Autoregressive models

$A R (1)$ :
$Y_t - \mu = \alpha (Y_{t-1} - \mu) + \epsilon_t$

White noise $\epsilon_t$ is independent of $\cdots Y_t-2, Y_{t-1}$ . It is also called innovation as it adds something new to the process. Without innovations, $Y_t$ would just be some scaled version of $Y_{t-1}$

$Var(Y_0) = \alpha^2 Var(Y_{t-1}) + \sigma^2$ => $\gamma_0 = \alpha^2 \gamma_0 + \sigma^2$
{ $Y_t$ } is stationary <=> | $\alpha$ | < 1
AR(1) is Markov process.
The only nonzero partial autocorrelation is $\rho'_1 = \alpha$ .

$A R (p)$ :

$Y_t - \mu = \sum_{j=1}^p \alpha_j (Y_{t-j} - \mu) + \epsilon_t$

Moving average models

$M A (q)$ :

$Y_t - \mu = \sum_{j=1}^q \beta_j \epsilon_{t-j} + \epsilon_t$

$E(Y_t) = \mu$ and $Var(Y_t) = \sigma^2 (1 + \beta_1^2 + \dots + \beta_q^2)$ for all t.
This process is stationary and such that $\rho_t = 0$ for t > q.

ARMA models

$A R M A (P, Q)$ :
$Y_t - \mu = \sum_{j=1}^p \alpha_j (Y_{t-j} - \mu) + \sum_{j=1}^q \beta_j \epsilon_{t-j} + \epsilon_t$

Lecture 9 Linear Regression Models

Statistical model

$Y_i = \alpha + \beta_1 X_{i1} + \beta_2 X_{i2} + \dots + \beta_d X_{id} + \epsilon_i,\ i = 1, 2, \dots d$

The noise term $\epsilon_i$ is i.i.d. with $E(\epsilon_i) = 0$ , $Var(\epsilon_ij) = \sigma^2$ and independent of $X_i$ for j = 1, … d.
Provided n >= p and X is full rank i.e. rank(X) = 1 + d = p, we have a closed form solution.
$\hat{\beta} = (X^TX)^{-1})X^TY$
$\hat{\beta}$ is unbiased, since
$\hat{\beta} = (X^TX)^{-1})X^T(X\beta + \epsilon)$
$\hat{\beta} = \beta + (X^TX)^{-1} X^T \epsilon$
$E(\hat{\beta}) = \beta$
Statical properties:
- Consistency: When n --> $\infty$ , $\hat{\beta}$ --> $\beta$ .
- Asymptotic normality $\sqrt{n} (\hat{\beta} - \beta)$ --> $\sigma^2Q^{-1})$ , $Q = E(XX^T)$

Normal Linear model

Assume that the error $\epsilon_i$ in the model are i.i.d. $\sigma^2)$ .

$Y_i|X_i = x_i ~ N(x_i^T\beta, \sigma^2)$
$\hat{\beta}$ is also the MLE of $\beta$
$E(\hat{\beta}) = \beta$
$Var(\hat{\beta}) = E[(\beta + (X^TX)^{-1} X^T \epsilon - \beta)(\beta + (X^TX)^{-1} X^T \epsilon - \beta)^T] = \sigma^2(X^TX)^{-1}$
$\hat{\sigma}^2 = \frac{1}{n-p} ||Y - X||$