RNN’s problem vanishing gradient 解决方案: LSTM GRU vs residual connections DenseNet HighwayNet Bidirectional RNNs Multi-layer RNNs(stacked RNNs) exploding gradient gradient clipping In summary