from paper:CONTINUOUS CONTROL WITH DEEP EINFORCEMENT LEARNING
维度
task name | dim(s) | dim(a) | dim(o) |
---|---|---|---|
blockworld1 | 18 | 5 | 43 |
blockworld3da | 31 | 9 | 102 |
canada | 22 | 7 | 62 |
canada2d | 14 | 3 | 29 |
cart | 2 | 1 | 3 |
cartpole | 4 | 1 | 14 |
cartpoleBalance | 4 | 1 | 14 |
cartpoleParallelDouble | 6 | 1 | 16 |
cartpoleParallelTriple | 8 | 1 | 23 |
cartpoleSerialDouble | 6 | 1 | 14 |
cartpoleSerialTriple | 8 | 1 | 23 |
cheetah | 18 | 6 | 17 |
fixedReacher | 10 | 3 | 23 |
fixedReacherDouble | 8 | 2 | 18 |
fixedReacherSingle | 6 | 1 | 13 |
gripper | 18 | 5 | 43 |
gripperRandom | 18 | 5 | 43 |
hardCheetah | 18 | 6 | 17 |
hardCheetahNice | 18 | 6 | 17 |
hopper | 14 | 4 | 14 |
hyq | 37 | 12 | 37 |
hyqKick | 37 | 12 | 37 |
movingGripper | 22 | 7 | 49 |
movingGripperRandom | 22 | 7 | 49 |
pendulum | 2 | 1 | 3 |
reacher | 10 | 3 | 23 |
reacher3daFixedTarget | 20 | 7 | 61 |
reacher3daRandomTarget | 20 | 7 | 61 |
reacherDouble | 6 | 1 | 13 |
reacherObstacle | 18 | 5 | 38 |
reacherSingle | 6 | 1 | 13 |
walker2d | 18 | 6 | 41 |