See, feel, act: Hierarchical learning for complex manipulation skills with multisensory fusion

Research Topic

Inspired by:

  • Humans are able to seamlessly integrate tactile and visual stimuli with their intuitions to explore and execute complex manipulation skills.

While learning contact-rich manipulation skills, we face two important challenges: active perception and hybrid behavior.

怎样理解active preception 和 hybrid behavior是本文关键。

Methods

Evaluation metric

In this study, they evaluated the robot’s ability to play the game by counting the number of successful consecutive block extractions in randomly generated towers.

需要注意的一点是:
This paper emphasizes physics modeling and does not explicitly evaluate the adversarial nature of the game.

Task specifications

  • sensing
    The robot have access to its own pose, the pose of the blocks, and the forces applied to it at every time step.
  • action primitives
    The robot uses two ‘primitive’ actions, push and extract/place.
  • base exploration policy
    The robot has access to a base exploration policy for data collection.
  • termination criteria
    1. all blocks have been explored
    2. a block is dropped outside the tower
    3. the tower has toppled
  • tower and robot specifications

Simulation

使用MuJoCo作为仿真环境。在仿真环境中比较以下几种approach(前三种是model,后一种是policy)的performance:

  1. HMAs
    这篇文章提出的hierarchical model abstractions
  2. NN
    a feed-forward neural network as a representative nonhierarchical model-based approach
  3. MOR
    a mixture of regressions model as a generic hierarchical model-based approach
  4. PPO
    implementation of RL as a model-free approach

All methods have access to the same set of states, actions, and MPC.

  • 0
    点赞
  • 1
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值