时序分析 (11) - 时序平滑(下)

最新推荐文章于 2023-05-30 10:15:53 发布

Magic Ktwc37

最新推荐文章于 2023-05-30 10:15:53 发布

阅读量898

点赞数 1

分类专栏：时序分析文章标签： Holt Winter Smoothing python R EWMA Damped方法

本文链接：https://blog.csdn.net/weixin_43171270/article/details/102987023

版权

时序分析专栏收录该内容

50 篇文章 327 订阅

订阅专栏

时序分析 (11)

时序平滑(下)

在本篇文章中的前两部分，我们已经介绍了多种平滑技术，重点讲解了在时序分析和预测中占有重要地位的指数平滑技术。在最后一篇中，我们需要进入实践部分。我们将比较多种平滑技术对四指数数据的平滑效果。

我们只对国内股票和香港股票的净值数据进行平滑

SMA
窗口为30天

data = indexs_sub[['国内股票','香港股票']].copy()

data['gg_30_ma'] = pd.rolling_mean(data['国内股票'],30)
data['xg_30_ma'] = pd.rolling_mean(data['香港股票'],30)
data['gg_60_ma'] = pd.rolling_mean(data['国内股票'],60)
data['xg_60_ma'] = pd.rolling_mean(data['香港股票'],60)
data['gg_90_ma'] = pd.rolling_mean(data['国内股票'],90)
data['xg_90_ma'] = pd.rolling_mean(data['香港股票'],90)

data[['国内股票','gg_30_ma','gg_60_ma','gg_90_ma']].plot(figsize=(16,8))

在这里插入图片描述

data[['香港股票','xg_30_ma','xg_60_ma','xg_90_ma']].plot(figsize=(16,8))

在这里插入图片描述
看上去，效果还算不错。很明显的是，滑动窗口越小，平滑结果越接近原时序。

下面我们计算一下窗口为30天的SMA与原时序数据的RMSE，MAE。

mse_gg = mean_squared_error(data.iloc[30:]['国内股票'],data.iloc[30:]['gg_30_ma'])
rmse_gg = np.sqrt(mse_gg)

mae_gg = mean_absolute_error(data.iloc[30:]['国内股票'],data.iloc[30:]['gg_30_ma'])
mse_xg = mean_squared_error(data.iloc[30:]['香港股票'],data.iloc[30:]['xg_30_ma'])

rmse_xg = np.sqrt(mse_xg)
mae_xg = mean_absolute_error(data.iloc[30:]['香港股票'],data.iloc[30:]['xg_30_ma'])

res = pd.DataFrame([[mse_gg,rmse_gg,mae_gg],[mse_xg,rmse_xg,mae_xg]],
index=['国内股票','香港股票'],
columns=['SMA30_MSE','SMA30_RMSE','SMA30_MAE']).T
res

在这里插入图片描述

EWMA
在需要更加精确的情景下，一般值都需要进行回归估算，我们这里采用通用的方法来简单估算
α = 2/(1 + span)

data['gg_ewma_30'] = data[['国内股票']].ewm(span=30).mean()
data['gg_ewma_60'] = data[['国内股票']].ewm(span=60).mean()
data['gg_ewma_90'] = data[['国内股票']].ewm(span=90).mean()
data['xg_ewma_30'] = data[['香港股票']].ewm(span=30).mean()
data['xg_ewma_60'] = data[['香港股票']].ewm(span=60).mean()
data['xg_ewma_90'] = data[['香港股票']].ewm(span=90).mean()

data[['国内股票','gg_ewma_30','gg_ewma_60','gg_ewma_90']].plot(figsize=(16,8))

在这里插入图片描述

data[['香港股票','xg_ewma_30','xg_ewma_60','xg_ewma_90']].plot(figsize=(16,8))

在这里插入图片描述
计算EWMA30天的RMSE,MAE

mse_gg_ewma = mean_squared_error(data.iloc[30:]['国内股票'],data.iloc[30:]['gg_ewma_30'])
rmse_gg_ewma = np.sqrt(mse_gg_ewma)
mae_gg_ewma = mean_absolute_error(data.iloc[30:]['国内股票'],data.iloc[30:]['gg_ewma_30'])
mse_xg_ewma = mean_squared_error(data.iloc[30:]['香港股票'],data.iloc[30:]['xg_ewma_30'])
rmse_xg_ewma = np.sqrt(mse_xg_ewma)
mae_xg_ewma = mean_absolute_error(data.iloc[30:]['香港股票'],data.iloc[30:]['xg_ewma_30'])

res_ewma = pd.DataFrame([[mse_gg_ewma,rmse_gg_ewma,mae_gg_ewma],[mse_xg_ewma,rmse_xg_ewma,mae_xg_ewm
         index=['国内股票','香港股票'],
         columns=['EWMA30_MSE','EWMA30_RMSE','EWMA30_MAE']).T
res_ewma

在这里插入图片描述

res = pd.concat([res,res_ewma])
res

在这里插入图片描述
相比于SMA，EWMA的RMSE和MAE都有所下降。

Holt’s Linear Trend method

data.to_csv('data/index4_data.csv',encoding='utf-8')

def plotseasonal(res, axes, title,color='steelblue' ):
    res.observed.plot(ax=axes[0], legend=False, title=title, color=color)
    axes[0].set_ylabel('Observed')
    res.trend.plot(ax=axes[1], legend=False, color=color)
    axes[1].set_ylabel('Trend')
    res.seasonal.plot(ax=axes[2], color=color, legend=False)
    axes[2].set_ylabel('Seasonal')
    res.resid.plot(ax=axes[3], color=color, legend=False)
    axes[3].set_ylabel('Residual')

gg_hl = sm.tsa.seasonal_decompose(data['国内股票'],freq=30)
xg_hl = sm.tsa.seasonal_decompose(data['香港股票'],freq=30)
fig, axes = plt.subplots(ncols=2, nrows=4, sharex=True, figsize=(16,8))
plotseasonal(gg_hl, axes[:,0],'国内股票')
plotseasonal(xg_hl, axes[:,1],'香港股票', color='orange')
plt.tight_layout()

在这里插入图片描述

from statsmodels.tsa.holtwinters import ExponentialSmoothing,Holt,SimpleExpSmoothing,HoltWintersResult
fit_HL_gg = Holt(np.asarray(data['国内股票'])).fit(smoothing_level = 0.3,smoothing_slope = 0.1)
data['gg_HL']=fit_HL_gg.predict(start=0,end=data.shape[0]-1)
fit_HL_xg = Holt(np.asarray(data['香港股票'])).fit(smoothing_level = 0.3,smoothing_slope = 0.1)
data['xg_HL']=fit_HL_xg.predict(start=0,end=data.shape[0]-1)

Holt Exponential

fit_HE_gg = Holt(np.asarray(data['国内股票']),exponential=True).fit(smoothing_level = 0.3,smoothing_
data['gg_HE']=fit_HE_gg.predict(start=0,end=data.shape[0]-1)

fit_HE_xg = Holt(np.asarray(data['香港股票']),exponential=True).fit(smoothing_level = 0.3,smoothing_
data['xg_HE']=fit_HE_xg.predict(start=0,end=data.shape[0]-1)

Damped Trend

fit_DT_gg = Holt(np.asarray(data['国内股票']),damped=True).fit(smoothing_level = 0.3,smoothing_slope
data['gg_DT']=fit_DT_gg.predict(start=0,end=data.shape[0]-1)

fit_DT_xg = Holt(np.asarray(data['香港股票']),damped=True).fit(smoothing_level = 0.3,smoothing_slope
data['xg_DT']=fit_DT_xg.predict(start=0,end=data.shape[0]-1)

data[['香港股票','xg_HL','xg_HE','xg_DT']].plot(figsize=(16,8),title="香港股票")

在这里插入图片描述
图中显示的是样本内(in sample)拟合结果，很明显是过拟合了。
下面计算MSE,RMSE,MAE

mse_gg_HL = mean_squared_error(data.iloc[30:]['国内股票'],data.iloc[30:]['gg_HL'])
rmse_gg_HL = np.sqrt(mse_gg_HL)
mae_gg_HL = mean_absolute_error(data.iloc[30:]['国内股票'],data.iloc[30:]['gg_HL'])
mse_gg_HE = mean_squared_error(data.iloc[30:]['国内股票'],data.iloc[30:]['gg_HE'])
rmse_gg_HE = np.sqrt(mse_gg_HE)
mae_gg_HE = mean_absolute_error(data.iloc[30:]['国内股票'],data.iloc[30:]['gg_HE'])
mse_gg_DT = mean_squared_error(data.iloc[30:]['国内股票'],data.iloc[30:]['gg_DT'])
rmse_gg_DT = np.sqrt(mse_gg_DT)
mae_gg_DT = mean_absolute_error(data.iloc[30:]['国内股票'],data.iloc[30:]['gg_DT'])

mse_xg_HL = mean_squared_error(data.iloc[30:]['香港股票'],data.iloc[30:]['xg_HL'])
rmse_xg_HL = np.sqrt(mse_xg_HL)
mae_xg_HL = mean_absolute_error(data.iloc[30:]['香港股票'],data.iloc[30:]['xg_HL'])
mse_xg_HE = mean_squared_error(data.iloc[30:]['香港股票'],data.iloc[30:]['xg_HE'])
rmse_xg_HE = np.sqrt(mse_xg_HE)
mae_xg_HE = mean_absolute_error(data.iloc[30:]['香港股票'],data.iloc[30:]['xg_HE'])
mse_xg_DT = mean_squared_error(data.iloc[30:]['香港股票'],data.iloc[30:]['xg_DT'])
rmse_xg_DT = np.sqrt(mse_xg_DT)
mae_xg_DT = mean_absolute_error(data.iloc[30:]['香港股票'],data.iloc[30:]['xg_DT'])

res_HL = pd.DataFrame([[mse_gg_HL,rmse_gg_HL,mae_gg_HL],[mse_xg_HL,rmse_xg_HL,mae_xg_HL]],
               index=['国内股票','香港股票'],
               columns=['HL_MSE','HL_RMSE','HL_MAE']).T
res_HE = pd.DataFrame([[mse_gg_HE,rmse_gg_HE,mae_gg_HE],[mse_xg_HE,rmse_xg_HE,mae_xg_HE]],
               index=['国内股票','香港股票'],
               columns=['HE_MSE','HE_RMSE','HE_MAE']).T
res_DT = pd.DataFrame([[mse_gg_DT,rmse_gg_DT,mae_gg_DT],[mse_xg_DT,rmse_xg_DT,mae_xg_DT]],
               index=['国内股票','香港股票'],
               columns=['DT_MSE','DT_RMSE','DT_MAE']).T

res = pd.concat([res,pd.concat([res_HL,res_HE,res_DT])])
res

在这里插入图片描述

Holt Winters

fit_hw_gg = ExponentialSmoothing(np.asarray(data['国内股票']) ,seasonal_periods=30 ,trend='add', sea
fit_hw_xg = ExponentialSmoothing(np.asarray(data['香港股票']) ,seasonal_periods=30 ,trend='add', sea

data['gg_HW']=fit_hw_gg.predict(start=0,end=data.shape[0]-1)
data['xg_HW']=fit_hw_xg.predict(start=0,end=data.shape[0]-1)

data[['国内股票','gg_HW']].plot(figsize=(16,8),title="国内股票 Holt Winter")

在这里插入图片描述

data[['香港股票','xg_HW']].plot(figsize=(16,8),title="香港股票 Holt Winter")

在这里插入图片描述

mse_gg_HW = mean_squared_error(data.iloc[30:]['国内股票'],data.iloc[30:]['gg_HW'])
rmse_gg_HW = np.sqrt(mse_gg_HW)
mae_gg_HW = mean_absolute_error(data.iloc[30:]['国内股票'],data.iloc[30:]['gg_HW'])
mse_xg_HW = mean_squared_error(data.iloc[30:]['香港股票'],data.iloc[30:]['xg_HW'])
rmse_xg_HW = np.sqrt(mse_xg_HW)
mae_xg_HW = mean_absolute_error(data.iloc[30:]['香港股票'],data.iloc[30:]['xg_HW'])

res_HW = pd.DataFrame([[mse_gg_HW,rmse_gg_HW,mae_gg_HW],[mse_xg_HW,rmse_xg_HW,mae_xg_HW]],
             index=['国内股票','香港股票'],
             columns=['HW_MSE','HW_RMSE','HW_MAE']).T
res_HW

在这里插入图片描述

res = pd.concat([res,res_HW])
res

在这里插入图片描述
Holt Winter 获得了最好的RMSE和MAE。

总结：

这一次我们使用Python实现了多种指数平滑模型，其中有些实现方法并不是非常精细，主要是参数还缺乏严格的回归测算和验证。这仅仅是一个演示而已。

附录：采用R代码来实现

library(forecast)
DataR <- read.csv(file="data/index4_data.csv", header=TRUE, sep=",",encoding="UTF-8")

fit_HL=holt(as.ts(DataR$国内股票),alpha=0.8,beta=0.2,initial="simple",h=5)
fit_ET=holt(as.ts(DataR$国内股票),alpha=0.8,beta=0.2,initial="simple",exponential=TRUE,h=5)
fit_DT=holt(as.ts(DataR$国内股票), alpha=0.8, beta=0.2, damped=TRUE, initial="optimal", h=5)

plot(as.ts(DataR$国内股票), ylab="国内股票", xlab="Year")
lines(fitted(fit_HL), col="blue")
lines(fitted(fit_ET), col="red")
lines(fitted(fit_DT), col="green")

lines(fit_HL$mean, col="blue", type="o")
lines(fit_ET$mean, col="red", type="o")
lines(fit_DT$mean, col="green", type="o")
legend("topleft", lty=1, col=c("black","blue","red","green"),
     c("Data","Holt's linear trend","Exponential trend","Additive damped trend"))

在这里插入图片描述

df = ts.union(fit_HL$fitted,fit_ET$fitted,fit_DT$fitted,dframe=TRUE)

fit_HL=holt(as.ts(DataR$香港股票),alpha=0.8,beta=0.2,initial="simple",h=5)
fit_ET=holt(as.ts(DataR$香港股票),alpha=0.8,beta=0.2,initial="simple",exponential=TRUE,h=5)
fit_DT=holt(as.ts(DataR$香港股票), alpha=0.8, beta=0.2, damped=TRUE, initial="optimal", h=5)

plot(as.ts(DataR$香港股票), ylab="香港股票", xlab="Year")
lines(fitted(fit_HL), col="blue")
lines(fitted(fit_ET), col="red")
lines(fitted(fit_DT), col="green")
lines(fit_HL$mean, col="blue", type="o")
lines(fit_ET$mean, col="red", type="o")
lines(fit_DT$mean, col="green", type="o")
legend("topleft", lty=1, col=c("black","blue","red","green"),
   c("Data","Holt's linear trend","Exponential trend","Additive damped trend"))

在这里插入图片描述

df1 = ts.union(fit_HL$fitted,fit_ET$fitted,fit_DT$fitted,dframe=TRUE)

Holt Winter Smoothing

fit1 <- hw(ts(DataR$国内股票,frequency=7),seasonal="additive")
fit2 <- hw(ts(DataR$国内股票,frequency=7),seasonal="multiplicative")

plot(ts(DataR$国内股票,frequency=7),ylab="国内股票Holt Winter",
type="o", xlab="Week")
lines(fitted(fit1), col="red", lty=2)
lines(fitted(fit2), col="green", lty=2)
lines(fit1$mean, type="o", col="red")
lines(fit2$mean, type="o", col="green")
legend("topleft",lty=1, pch=1, col=1:3,
    c("data","Holt Winters' Additive","Holt Winters' Multiplicative"))

在这里插入图片描述

colnames(df) <- c('gg_HL','gg_ET','gg_DT')
colnames(df1) <- c('xg_HL','xg_ET','xg_DT')

Innovation State Space Model
指定ETS(ANN)模型，就是加性错误、无趋势和无周期

fit_ets_gg <- ets(as.ts(DataR$国内股票), model="ANN")
plot(forecast(fit_ets_gg, h=3), ylab="国内股票")
fit_ets_gg$par

alpha
0.999899642674678
l
2625.83228814494
在这里插入图片描述

fit_model_ets <- ets(as.ts(DataR$国内股票))
summary(fit_model_ets)

在这里插入图片描述

write.csv(cbind(DataR,df,df1),file="data/index_new.csv",fileEncoding="UTF-8",quote=FALSE)

Magic Ktwc37

关注

1
点赞
踩
4

收藏

觉得还不错? 一键收藏
0
评论
时序分析 (11) - 时序平滑(下)

时序分析 (11)时序平滑(下)    在本篇文章中的前两部分，我们已经介绍了多种平滑技术，重点讲解了在时序分析和预测中占有重要地位的指数平滑技术。在最后一篇中，我们需要进入实践部分。我们将比较多种平滑技术对四指数数据的平滑效果。我们只对国内股票和香港股票的净值数据进行平滑SMA窗口为30天data = indexs_sub[['国内股票',...
复制链接

扫一扫

专栏目录