自回归模型的建模与参数估计-Python

最新推荐文章于 2024-06-20 07:45:00 发布

Zhanwei Liu

最新推荐文章于 2024-06-20 07:45:00 发布

阅读量3.1k

点赞数 1

分类专栏：预测 python 一元线性回归文章标签： python 机器学习线性代数

本文链接：https://blog.csdn.net/Will_Zhan/article/details/116501567

版权

python 同时被 3 个专栏收录

21 篇文章 2 订阅

订阅专栏

预测

6 篇文章 1 订阅

订阅专栏

一元线性回归

3 篇文章 0 订阅

订阅专栏

AR§模型的参数估计

设{ $X_t$ }适合 $X_t=a_1X_{t-1}+\cdots+a_pX_{t-p}+\epsilon_t$ ,式中{ $\epsilon_t$ }为独立同分布白噪声序列； $E\epsilon^2_t=\sigma^2$ ，来自{ $X_t$ }的样本为 $X_1，X_2，\cdots，X_n$ ，要估计 $a_1，a_2，\cdots，a_p，\sigma^2$ 。

创建一个AR(2)过程

$x_t = 0.6 x_{t-1} - 0.75 x_{t-2} + \epsilon_t$

import matplotlib.pyplot as plt
import numpy as np
from statsmodels.tsa.stattools import acf
from statsmodels.graphics.tsaplots import plot_acf, plot_pacf

n = 5000
mean = 0
std = 1 
lag = 20

np.random.seed(0)
x_t = list(np.random.normal(mean, std, size=2))
epsilon_t = np.random.normal(mean, std, size=n) # 创建白噪声
for i in range(2, n+2):
    x_t.append(0.6 * x_t[i-1] - 0.75 * x_t[i-2] + epsilon_t[i-2])
# 绘制白噪声
plt.plot(x_t)
plt.title("AR2 process")
plt.show()

# 计算自相关函数(ACF)
plot_acf(x=x_t, lags=lag, title="ACF x_t")
plot_pacf(x=x_t, lags=lag, title="PACF x_t")
plt.show()

使用statsmodels模块估计参数和下文对比

# use statsmodel to fit this series
import statsmodels as sm
arma_mod20 = sm.tsa.arima.model.ARIMA(x_t, order=(2,0,0)).fit()
arma_mod20.summary()

输出：
在这里插入图片描述
其中const即是对时间序列样本均值进行估计

np.mean(x_t)

-0.01320250123146827

其他参数可以使用下面三种方法进行估计。

Yule-Walker 估计法
由Yule-Walker方程： $\vec b_p=\vec\Gamma_p\vec\alpha$
式中：
$\vec b_p=\left[ \begin{matrix} \gamma_1 \\ \gamma_2 \\ \vdots \\ \gamma_p \end{matrix}\right], \vec \Gamma_p=\left[ \begin{matrix} \gamma_0 & \gamma_1 & \cdots & \gamma_{p-1} \\ \gamma_1 & \gamma_0 & \cdots & \gamma_{p-2}\\ \vdots & \vdots& \ddots & \vdots \\ \gamma_{p-1} & \gamma_{p-2}& \vdots & \gamma_{0} \end{matrix}\right],$
$\vec \alpha = \left[ \begin{matrix} a_1 \\ a_2 \\ \vdots \\ a_p \end{matrix}\right]$
$\gamma_k$ 为自协方差函数可以用样本的自协方差函数 $\hat{\gamma}_k$ 代替，然后可以计算参数 $a_1，a_2，\cdots，a_p$ ， $\sigma^2$ 的Yule-Walker估计为
$\hat{\sigma}^2=\hat{\gamma}_0-\hat a_1 \hat \gamma_1-\cdots -\hat a_p \hat \gamma_p$ $\left[ \begin{matrix} \hat{a}_1 \\ \hat{a}_2 \\ \vdots \\ \hat{a}_p \end{matrix}\right] = \left[ \begin{matrix} \hat \gamma_0 & \hat \gamma_1 & \cdots & \hat \gamma_{p-1} \\ \hat \gamma_1 & \hat \gamma_0 & \cdots & \hat \gamma_{p-2}\\ \vdots & \vdots& \ddots & \vdots \\ \hat \gamma_{p-1} & \hat \gamma_{p-2}& \vdots & \hat \gamma_{0} \end{matrix}\right]^{-1} \left[ \begin{matrix} \hat \gamma_1 \\ \hat \gamma_2 \\ \vdots \\ \hat \gamma_p \end{matrix}\right] ,$
自协方差函数
$\gamma(h)=\text{cov}(X(t),X(t-h))=E[X(t)-\mu_t][X(t-h)-\mu_{t-h}]=E[X(t)X(t-h)]-\mu_t\mu_{t-h}$

def inverse_yule_walker_ar(x, p):
    """
    Implementation for a direct inverse solution on Yule-Walker equations
    @param x: the dataset, a numpy 1D array
    @param p: lags, p in AR(p)
    @return: a 1-D numpy array phi of shape p. phi[i] = parameter for x_{t-i} in a AR(p) model
    """
    # compute sample autovariance function
    x_bar = np.mean(x)
    covp = []
    for i in range(p+1):
        lenx = len(x)-i
        xt = x[i:]
        xtp = x[:lenx]
        covp.append(np.sum((xt-x_bar)*(xtp-x_bar))/len(x))
    # compute the p * p R_p matrix
    R_p = np.zeros([p,p])
    for j in range(p):
        for k in range(p):
            R_p[j,k] = covp[np.abs(j-k)]
    # compute phi = R_p^{-1} covp
    phi = np.linalg.inv(R_p).dot(np.array(covp[1:]))
    return phi

最小二乘法
$X_t=a_1X_{t-1}+\cdots+a_pX_{t-p}+\epsilon_t$ 取 $t=p+1,p+2,\cdots,n$
$\left[ \begin{matrix} X_{p+1} \\ X_{p+2} \\ \vdots \\ X_{n} \end{matrix}\right] = \left[ \begin{matrix} X_p & X_{p-1} & \cdots & X_{1} \\ X_{p+1} & X_{p+2} & \cdots & X_{2}\\ \vdots & \vdots& \ddots & \vdots \\ X_{n-1} & X_{n-2} & \vdots & X_{n-p} \end{matrix}\right] \left[ \begin{matrix} a_1 \\ a_2 \\ \vdots \\ a_p \end{matrix}\right] + \left[ \begin{matrix} \epsilon_{p+1} \\ \epsilon_{p+2} \\ \vdots \\ \epsilon_{n} \end{matrix}\right] ,$
用矩阵形式表示： $\vec X=\vec X \vec\alpha+\vec \epsilon$
应用回归分析中的最小二乘法：
$\vec {\hat \alpha}=(\vec X^T\vec X)^{-1}\vec X^T\vec Y$
$\vec {\hat \sigma^2}=\frac{1}{n-p}\vec Y^T[\vec I -\vec X(\vec X^T\vec X)^{-1}\vec X^T]\vec Y$

def OLS_ar(x, p):
    """
    @param x: the dataset, a numpy 1D array
    @param p: lags, p in AR(p)
    @return: a 1-D numpy array phi of shape p. phi[i] = parameter for x_{t-i} in a AR(p) model
    """
    # construct Y
    lenx = len(x)-p
    Y = x[p:]
    X = np.zeros((lenx,p))
    for i in range(lenx):
        for j in range(p):
            X[i,j] = x[np.abs(p-j-1+i)]
    return np.dot(np.dot(np.linalg.inv(np.dot(X.T,X)),X.T),Y)

极大似然法
设 $\{\epsilon_t\}$ 为正态白噪声序列，则 $X^T=(X_1,X_2,\cdots,X_n)^T \sim N_n(\boldsymbol 0,\boldsymbol \Sigma)$ ,似然函数
$\begin{aligned} \text{ln}L(\boldsymbol \alpha,\sigma^2)&=\text{ln}\frac{1}{(2\pi)^{n/2}|\boldsymbol \Sigma|^{1/2}}\text{e}^{-\frac{1}{2}\boldsymbol x^T\boldsymbol \Sigma^{-1}\boldsymbol x} \\ &=\text{ln}\frac{1}{(2\pi)^{n/2}}-\frac{1}{2}\text{ln}|\boldsymbol \Sigma|-\frac{1}{2}\boldsymbol x^T\boldsymbol \Sigma^{-1}\boldsymbol x \end{aligned}$
$\boldsymbol \Sigma$ 是 $X^T$ 的协方差矩阵，是一个 $n$ 阶正定实对称方阵， $|\boldsymbol \Sigma|$ 是 $\boldsymbol \Sigma$ 的行列式
证明详情不再赘述：https://blog.csdn.net/sunbobosun56801/article/details/99753664
最后得到的结论即是极大似然估计和最小二乘估计和最小二乘估计结果是一致的。