Financial Time Series Data Processing for Machine Learning 笔记

最新推荐文章于 2023-12-08 13:51:41 发布

|加勒比海带

最新推荐文章于 2023-12-08 13:51:41 发布

阅读量175

点赞数 1

分类专栏： Time Series 文章标签： Financial Time Series Label

本文链接：https://blog.csdn.net/weixin_41295283/article/details/97013258

版权

Time Series 专栏收录该内容

1 篇文章 0 订阅

订阅专栏

论文地址：https://arxiv.org/abs/1907.03010

1.Scaling

1.MinMax

$\frac{x-x_{min}}{x_{max}-x_{min}}(max-min)+min$
max和min 是feature range。

2.Standardization

$\frac{x-\mu}{\sigma} \\ \mu:mean\\ \sigma: standard deviation$

2.Split

$train=\{S_0,\cdots,S_{ts-1}\}\qquad(1)\\ val=\{S_{ts},\cdots,S_{ts+vs-1}\}\qquad(2)\\ test=\{S_{ts+vs},\cdots,S_{K-1}\}\qquad(3)$

3.Labeling

Label	Description
N bars Up/Down	Classifier on $C_{t+n}>C_t$
N bars price change	Regressor on $C_{t+n}-C_t$
N bars log returns	Regressor on $log(\frac{C_{t+n}}{C_t})$
N bars Moving Average	Classifier on $MA_{t+n}>MA_t$
N bars trend Strength	Regressor on Trend
N bars trend Direction	Classifier on Trend
%Q after N bars	Regressor on %Q
QClass after N bars	Classifier on Qclass

If $C_t$ is the closing price at time $t$ , assume the corresponding time series slice of size $m$ ending by $C_t$ is defined by $S_t = C_{t−m+1},\cdots, C_t$
Then
$\%Q^{t+n}_{t+1} =\frac{HH^{t+n}_{t+1}-C_t}{HH^{t+n}_{t+1}-LL^{t+n}_{t+1}}$
Where

$n$ : time horizon in numbers of bars
$\%Q^{t+n}_{t+1}$ : %Q between $t + 1$ and $t + n$
$HH^{t+n}_{t+1}$ : Highest High price between $t + 1$ and $t + n$
$LL^{t+n}_{t+1}$ : Lowest Low price between $t + 1$ and $t + n$

So %Q is interpreted as following:

%Q = 1 when we have a perfect up move without any drawdown during the next $n$ bars
%Q = 0 when we have a perfect down move without any drawup during the next $n$ bars
%Q = 0:5 when we have an equally Up and Down move during the next $n$ bars

Class	Condition	Meaning
0	$\%Q>=0.6$	Up
1	$0.4<\%Q<0.6$	Neutral
2	$\%Q<=0.4$	Down

|加勒比海带

关注

1
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
Financial Time Series Data Processing for Machine Learning 笔记

1.Scaling1.MinMaxz=x−xminxmax−xmin(max−min)+minz = \frac{x-x_{min}}{x_{max}-x_{min}}(max-min)+minz=xmax−xminx−xmin(max−min)+minmax和min 是feature range。2.Standardizationz=x−μσμ:meanσ:standard...
复制链接

扫一扫