Reinforcement Learning
Heuristic Algorithm; Decision making
No Knownledge
One more thing
展开
-
Imitation Learning
Imitation Learning原创 2023-09-12 16:34:31 · 45 阅读 · 0 评论 -
Value-based vs Policy-based Reinforcement Learning
强化学习; Value-based Reinforcement Learning; Policy-based Reinforcement Learning原创 2023-08-14 16:38:57 · 95 阅读 · 0 评论 -
价值学习(Value-Based Reinforcement Learning)
强化学习;价值学习原创 2023-08-13 23:21:05 · 40 阅读 · 0 评论 -
策略学习(Policy-Based Reinforcement Learning)
强化学习;策略学习原创 2023-08-13 22:23:12 · 257 阅读 · 0 评论 -
置信域策略优化Trust Region Policy Optimization (TRPO)
置信域策略优化;Trust Region Policy Optimization; TRPO原创 2023-08-13 21:03:42 · 149 阅读 · 0 评论 -
Softmax Strategy
强化学习;智能决策原创 2023-08-13 15:53:44 · 787 阅读 · 0 评论 -
The Epsilon-Greedy Algorithm
machine learning; decision-making原创 2023-08-13 15:38:34 · 95 阅读 · 0 评论 -
Exploration vs Exploitation (Multi-arm Bandit Problem)
策略制定, UCB算法转载 2022-10-09 21:24:11 · 133 阅读 · 0 评论