参考资源
[1] Ch 12.1:Model Free Reinforcement learning algorithms (Monte Carlo, SARSA, Q-learning)
[2] A new Q(
λ
\lambda
λ)
[3] Fast online Q(
λ
\lambda
λ)
[4] SARSA vs Q-learning
[5] Summary of Tabular Methods in Reinforcement Learning
[6] The Multi-Armed Bandit Problem and Its Solutions
[7] Exploration Strategies in Deep Reinforcement Learning
[8] 10 Real-Life Applications of Reinforcement Learning
[9] RL — Reinforcement Learning Algorithms Comparison
[10] Diego Unzueta