Recurrent neural networks for multivariate time series with missing values文献阅读

这篇文章有附加材料,有比较详尽的数据处理等过程的介绍

0. Title

Multivariate Time Series with Missing Values

1. Research Objective

In this paper, we develop a novel deep learning model based on GRU, namely GRU-D, to effectively exploit two representations of informative missingness patterns, i.e., masking and time interval.

  • Masking informs the model which inputs are observed (or missing).
  • time interval encapsulates the input observation patterns.
    Thus not only capturing the long-term temporal dependencies of time series observations but also utilizing the missing patterns to improve the prediction results

2. Background and Problems

  • Multivariate time series data often inevitably carry missing observations due to various reasons which are usually informative missingness.
  • Traditional methods dealing with missing values in time series do not capture variable correlations and may not capture complex pattern to perform imputation.
  • In other methods, the missing patterns are not effectively explored in the prediction model, thus leading to suboptimal analyses results.

3. Method

Notations

  • a msking vector m t ∈ { 0 , 1 } D m_t \in \{0,1\}^D mt{0,1}D is introduced to denote which variables are missing at time step t t t.
    在这里插入图片描述
  • a time interval δ t d ∈ R \delta_t^d\in R δtdR for each variable d d d since its last observation

在这里插入图片描述

Model structure

在这里插入图片描述

Handling missing values

There are three straightforward ways to handle missing values without applying any imputation approaches or making any modifications to GRU network architecture.

  • GRU-Mean:simply to replace each missing observation with the mean of the variable across the training examples.
    在这里插入图片描述
  • GRU-Forward:assume any missing value is the same as its last measurement and use forward imputation.
    在这里插入图片描述
  • GRU-Simple:simply indicates which variables are missing and how long they have been missing as a part of input by concatenating the measurement
    在这里插入图片描述
  • GRU-Simple w/o m m m(without masking)
  • GRU-Simple w/o δ δ δ(without time interval)

GRU-D: model with trainable decays.

Two properties of the missing values in time series:

  • First, the value of the missing variable tend to be close to some default value if its last observation happens a long time ago.
  • Second, the influence of the input variables will fade away over time if the variable has been missing for a while.

Therefore, we propose a GRU-based model called GRU-D, in which a decay mechanism is designed for the input variables and the hidden states to capture the aforementioned properties.
A vector of decay rates was introduced for two purpose:

  • First, each input variable in health care time series has its own meaning and importance in medical applications. The decay rates should differ from variable to variable based on the underlying properties associated with the variables.
  • Second, as we see lots of missing patterns are informative and potentially useful in prediction tasks but unknown and possibly complex, we aim at learning decay rates from the training data rather than fixed a priori.
    在这里插入图片描述
    There are two decay mechanisms to utilize the missingness directly with the input feature values and implicitly in the RNN states.
  • First, for a missing variable, we use an input decay γ x \gamma_x γx to decay it over time toward the empirical mean
    在这里插入图片描述
  • hidden state decay γ h \gamma_h γh:this has an effect of decaying the extracted features (GRU hidden states) rather than raw input variables directly.
    在这里插入图片描述
    And new model becomes:
    在这里插入图片描述

4. Evaluation

5. Conclusion

  • These empirical findings validate our assumption that GRU-D utilizes the missing patterns only when the correlations are high and relies on the observed values when the correlations between labels and missing rates are low.

6. Notes

  • The author found that the value of missing rate is correlated with the labels, and the missing rate of variables with low missing rate are usually highly (either positive or negative) correlated with the labels which are mortality and ICD-9 diagnosis categories.

References

  • 0
    点赞
  • 2
    收藏
    觉得还不错? 一键收藏
  • 2
    评论
评论 2
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值