![](https://img-blog.csdnimg.cn/20201014180756927.png?x-oss-process=image/resize,m_fixed,h_64,w_64)
强化学习
文章平均质量分 76
强化学习相关内容,包含课程笔记,论文笔记等
KpLn_HJL
新的一天从认识到自己有多菜开始
展开
-
cs285学习笔记
ucb cs285学习笔记原创 2023-02-13 11:52:18 · 656 阅读 · 0 评论 -
Virtual-Taobao: Virtualizing Real-World Online Retail Environment for Reinforcement Learning
19-aaai-Virtual-Taobao Virtualizing Real World Online Retail Environment for Reinforcement Learning淘宝数据做的离线环境原创 2022-09-20 14:04:56 · 194 阅读 · 0 评论 -
RL4RS : A Real-World Benchmark for Reinforcement Learning based Recommender System
21-arxiv-RL4RS : A Real-World Benchmark for Reinforcement Learning based Recommender System网易伏羲出的一个,可以用在推荐系统上的rl离线仿真环境原创 2022-09-20 11:09:25 · 151 阅读 · 0 评论 -
Multi-Agent Cooperative Bidding Games for Multi-Objective Optimization in e-Commercial Sponsored Sea
21-kdd-Multi-Agent Cooperative Bidding Games for Multi-Objective Optimization in e-Commercial Sponsored Search原创 2022-07-08 19:45:39 · 238 阅读 · 1 评论 -
Deep Decentralized Multi-task Multi-Agent Reinforcement Learning under Partial Observability
17-icml-Deep Decentralized Multi-task Multi-Agent Reinforcement Learning under Partial Observabilityagent用的HDRQN,multi-agent实现通过同时存储agent的trajectory,multi-task实现通过学习一个distilled agent原创 2022-07-08 17:31:53 · 388 阅读 · 0 评论 -
We Know What YouWant: An Advertising Strategy Recommender System for Online Advertising
21-kdd-We Know What YouWant: An Advertising Strategy Recommender System for Online Advertising,粗糙的广告主采纳模型原创 2022-07-07 15:13:50 · 157 阅读 · 0 评论 -
RECSIM: A Configurable Simulation Platform for Recommender Systems
google research发的关于RL用在推荐系统上的simulator原创 2022-05-05 16:58:19 · 342 阅读 · 0 评论 -
DRN: A Deep Reinforcement Learning Framework for News Recommendation
18-www-DRN: A Deep Reinforcement Learning Framework for News Recommendation用了dqn,reward里额外考虑了用户return原创 2022-04-29 10:17:11 · 205 阅读 · 0 评论 -
dqn/deep q network
强化学习dqn算法原创 2022-03-05 17:01:00 · 437 阅读 · 0 评论 -
DEAR: Deep Reinforcement Learning for Online Advertising Impression in Recommender Systems
21-aaai-DEAR: Deep Reinforcement Learning for Online Advertising Impression in Recommender Systems原创 2022-01-04 15:19:45 · 688 阅读 · 0 评论 -
maddpg/Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
17-nips-maddpg/Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments原创 2021-12-30 19:06:00 · 1362 阅读 · 0 评论 -
a3c/Asynchronous Methods for Deep Reinforcement Learning
16-icml-a3c原创 2021-12-30 15:32:28 · 374 阅读 · 0 评论 -
ddpg/Continuous control with deep reinforcement learning
16-iclr-DDPG/Continuous control with deep reinforcement learning原创 2021-12-29 18:11:27 · 346 阅读 · 0 评论 -
RL简单梳理
rl基础梳理,q-learning,sarsa-lambda,actor-critic原创 2021-12-28 19:18:48 · 350 阅读 · 0 评论 -
Learning to Collaborate: Multi-Scenario Ranking via Multi-Agent Reinforcement Learning
18-www-Learning to Collaborate: Multi-Scenario Ranking via Multi-Agent Reinforcement Learning原创 2021-12-09 16:19:17 · 168 阅读 · 0 评论 -
cs285-lec6-actor-critic
cs285, lec6, actor-critic原创 2021-10-20 22:10:17 · 149 阅读 · 0 评论 -
sarsa
强化学习 sarsa算法学习笔记原创 2021-09-22 13:52:37 · 114 阅读 · 0 评论 -
cs285-lec5-policy gradient
cs285 - lec5 - policy gradient原创 2021-09-21 14:28:39 · 279 阅读 · 0 评论 -
q-learning
强化学习q-learning原创 2021-09-12 16:29:32 · 106 阅读 · 0 评论 -
Introduction to Reinforcement Learning notes
文章目录PART 01 Introduction1.1 Reinforcement Learning1.2 Examples1.3 Elements of Reinforcement Learning1.4 Limitations and Scope1.5 An Extended Example: Tic-Tac-Toe1.6 Summary1.7 Early History of Reinfor...原创 2020-03-26 22:10:31 · 215 阅读 · 0 评论