论文笔记--Unsupervised (Meta) RL
无监督的(元)强化学习总结DIAYNKey IdeaFormulationDIAYN全名‘Diversity Is All You Need: Learning skills without a reward function’。关键词:learning skills without reward function; pretrained skills for downstream tasks...
笔记
MaxEnt RL
Inverse RL
Off-Policy Evaluation
policy gradient
meta learning
imitation learning
continual learning 
