@主流强化学习算法分类:On policy,Off policy;离散,连续;基于策略,基于值
引自论文:A. Feriani and E. Hossain, “Single and Multi-Agent Deep Reinforcement Learning for AI-Enabled Wireless Networks: A Tutorial,” IEEE Communications Surveys & Tutorials, vol. 23, no. 2, pp. 1226-1252, 2021.
主流强化学习算法分类
于 2024-07-18 10:38:04 首次发布