障碍塔挑战赛第二轮今天开始

In February, we launched the first round of the Obstacle Tower Challenge. Since the closing of the round, we have received 2000+ entries from 350+ teams. We want to thank all the participants of the first round, congratulate the top eligible teams moving on to round 2, and share the new round 2 version of Obstacle Tower with the public.

2月,我们启动了障碍塔挑战赛的第一轮比赛 。 自该轮比赛结束以来,我们已经收到来自350多个团队的2000多个参赛作品。 我们要感谢第一轮比赛的所有参与者,祝贺进入第二轮比赛的最有资格的团队,并与公众分享新的第二轮障碍塔。

Today, we are starting the second round of the Obstacle Tower Challenge for eligible teams. For a team to make it into this round, they needed to train and submit an agent that could achieve an average score of five on unseen versions of the tower. As we described here, this was no trivial feat. We want to congratulate the top teams that made it this far and also thank our partners at Google Cloud Platform for providing GCP credits to the eligible teams and AICrowd for hosting the challenge.

今天,我们为符合条件的团队开始第二轮障碍塔挑战赛。 为了使一支团队进入这一轮比赛,他们需要培训并提交一名探员,该探员在看不见的塔架上平均得分为5分。 就像我们在这里描述的那样,这绝非易事。 我们要祝贺到目前为止取得优异成绩的顶级团队,也感谢Google Cloud Platform上的合作伙伴向符合条件的团队提供GCP积分,以及AICrowd主持了此次挑战。

To go along with this new round of the challenge, we are also releasing version 2.0 of the Obstacle Tower Environment. We’ve expanded the number of floors in the tower from 25 to 100, with these higher floors including many new visual styles, obstacles, puzzles to solve and enemies to avoid. We have also significantly expanded the customizability of the environment for researchers looking to study generalization in AI. Although only the top eligible teams will move on to round 2 of the challenge, we encourage everyone to download and try out the latest version.  After round 2 of the challenge, we plan to open source the Obstacle Tower Environment.

为了应对新一轮的挑战,我们还将发布《 障碍塔环境》的 2.0版。 我们已将塔楼的楼层数从25个增加到100个,这些较高的楼层包括许多新的视觉样式,障碍,需要解决的难题和应避免的敌人。 我们还为希望研究AI泛化的研究人员大大扩展了环境的可定制性。 尽管只有符合条件的顶级团队才能进入挑战的第二轮,但我们还是鼓励大家下载并试用最新版本。 在挑战的第二轮之后,我们计划开放障碍塔环境的源代码。

障碍塔环境v2.0中的新增功能 (What’s new in the Obstacle Tower Environment v2.0)

扩展可用楼层 (Expanded available floors)

We’ve expanded the Obstacle Tower Environment to further push the agent’s need to generalize across new and unseen combinations and floors. Whereas the original version of the environment had only 25 floors in the tower, this new version has 100. These higher floors contain three new visual styles – Industrial, Modern, and Future.

我们扩展了障碍塔环境,以进一步推动代理商对新的和看不见的组合和楼层进行概括的需求。 环境的原始版本在塔中只有25层,而新版本则有100层。这些较高的楼层包含三种新的视觉样式:工业,现代和未来。

Not only do the higher floors contain different visual appearances, but they also provide additional challenges. All of the mechanics present in the early floors are still present and expanded upon in difficulty in the higher floors. In addition to those mechanics, the higher floors also contain: enemies to dodge, distracting TVs to avoid, more complex floor layouts with circling paths, and larger rooms on each floor with additional platforming challenges.

较高的楼层不仅包含不同的视觉外观,而且还带来其他挑战。 早期楼层中存在的所有机制仍然存在并且在较高楼层中因困难而扩展。 除了这些机制之外,较高的楼层还包含:躲闪的敌人,要避开的电视分心 ,带有盘旋路径的更复杂的地板布局以及每层较大的房间以及其他平台问题。

New Visual Styles – Industrial, Modern, Future

新的视觉风格–工业,现代,未来

New Obstacles and Distractions

新的障碍和干扰

通过重置参数自定义环境 (
Customizing the environments through reset parameters)

In the original release of Obstacle Tower, it was only possible to change the starting floor and initialization seed from the python api. In the 2.0 release, we have significantly expanded on the number of available parameters which can be customized when resetting the environment. These include the ability to change things like the lighting, visual theme, floor layouts, and room contents on the floors in the tower.

在原始版本的Obstacle Tower中,只能从python api更改起始层和初始化种子。 在2.0版本中,我们极大地扩展了重置环境时可以自定义的可用参数的数量。 这些功能包括更改诸如照明,视觉主题,地板布局以及塔楼地板上的房间内容之类的功能。

根据第一轮的反馈进行更改和改进 (Changes and improvements based on feedback from Round 1)

Perhaps most importantly, we were able to make changes based directly on the feedback of the users during Round 1 of the challenge. Many of these consisted of bugs or feature requests which were made by the participants themselves, but some of the changes were based on bugs that were only found by agents learning to maximize their scores in the game. One feedback we received was that the placement of the reset button in puzzle rooms was unintuitive. As such, we have now separated out the block, goal, and reset button positions in these rooms, to make it less likely that the agent will press the reset button by accident.

也许最重要的是,我们能够直接在挑战的第1轮中根据用户的反馈进行更改。 其中许多是由参与者自己提出的错误或功能请求组成的,但某些更改是基于仅由代理商学习以在游戏中获得最大得分的错误所发现的。 我们收到的反馈是,重置按钮在拼图室中的放置是不直观的。 因此,我们现在已经将这些房间中的格挡,目标和重置按钮位置分开了,以减少座席意外按下重置按钮的可能性。

现在有空 (Available now)

The Obstacle Tower Environment natively supports the Unity ML-Agents Toolkit and is available to download here. For general issues or discussion of the environment itself, see our GitHub repo. To learn more about the environment, read our research paper. For those interested in an open source release, we are happy to share that we will be releasing the full source code for the Unity project at the end of Round 2. Our aim is to provide a foundation for researchers and the community to extend the Obstacle Tower environment in order to further advance RL research. In the meantime, we hope that the expanded reset parameters will give researchers a lot to flexibility. We can’t wait to see how you use the environment in new and unexpected ways.

障碍塔环境本身支持Unity ML-Agents工具包 ,可从此处下载。 有关环境本身的一般问题或讨论,请参见我们的GitHub repo 。 要了解有关环境的更多信息,请阅读我们的研究论文 。 对于那些对开源发行版感兴趣的人,我们很高兴与大家分享我们将在第二轮结束时发布Unity项目的完整源代码。我们的目的是为研究人员和社区扩展障碍提供基础。塔楼环境是为了进一步推进RL研究。 同时,我们希望扩展的重置参数将为研究人员带来很大的灵活性。 我们迫不及待地想看看您如何以新的出乎意料的方式使用环境。

And lastly, we would like to thank all the contributions and testers for helping us improve the Obstacle Tower Environment v2.0.

最后,我们要感谢所有的贡献者和测试人员,他们帮助我们改善了障碍塔环境v2.0。

第二轮决赛入围者 (Round 2 finalists)

Congratulations to the teams who are moving on to Round 2! When we first launched the challenge a few months ago, we didn’t know how far to expect participants to be able to make it into the tower. We have been pleasantly surprised to find that the top submissions have been of agents not only able to master the mechanics of finding and utilizing keys, but also the mechanics of pushing blocks to solve puzzles.

祝贺正在进入第二轮的团队! 几个月前,当我们首次发起挑战时,我们不知道期望参与者能够进入挑战塔的距离。 我们惊讶地发现,最受关注的是代理不仅能够掌握查找和使用键的机制,而且还能够推动推块解决难题的机制。

Below is the final list. Please note, this may differ from the AICrowd leaderboard due to disqualifications and eligibility for the contest.

下面是最终列表。 请注意,由于取消比赛资格和参加比赛的资格,这可能与AICrowd排行榜有所不同。

ParticipantRound 1 Average FloorsRound 1 Average Reward
unixpickle16.40129.881
joe_booth10.00116.461
dougm9.60115.921
karolisram8.40113.321
sova8768.20113.121
giadefa8.00112.821
wywarren8.00112.581
PerInDisguise7.00110.681
tatsuyaogawa6.6019.721
STAR.Lab6.0018.721
tky6.0018.661
sungbinchoi5.6018.101
ipv65.6018.061
kyunghyunlee5.6017.881
denamganai_kevin5.6017.861
adamloch5.4017.621
rudy_gilman5.4017.601
wenyuyangpku5.4017.561
oleksandra_fedorova5.4017.521
hanschoi865.4017.501
TruthMaker5.4017.501
petr5.4017.501
BIgG5.4017.501
ub5.4017.481
Petero5.4017.481
duc_nguyen5.4017.441
gardenermike5.2017.461
kenshi_abe5.2017.241
hyochini5.2017.221
gr1d5.2017.201
Leckofunny5.2017.161
steven5.2017.141
cit5.2017.101
felixlaumon5.0016.881
xihe5.0016.861
kyushik_min5.0016.861
Miffyli5.0016.781
HappySlice5.0016.781
thesoenke5.0016.781
paullewislobo5.0016.761
andwetry5.0016.761
alex_gomez5.0016.761
Parilo5.0016.721
banjtheman5.0016.721
参加者 第一轮平均楼层数 第一轮平均奖励
善变 16.401 29.881
joe_booth 10.001 16.461
道格 9.601 15.921
karolisram 8.401 13.321
sova876 8.201 13.121
贾迪法 8.001 12.821
怀华伦 8.001 12.581
伪装 7.001 10.681
辰尾川 6.601 9.721
STAR.Lab 6.001 8.721
ky 6.001 8.661
松宾町 5.601 8.101
ipv6 5.601 8.061
庆贤里 5.601 7.881
denamganai_kevin 5.601 7.861
亚当洛奇 5.401 7.621
rudy_gilman 5.401 7.601
文宇阳库 5.401 7.561
oleksandra_fedorova 5.401 7.521
hanschoi86 5.401 7.501
真理制造者 5.401 7.501
彼得 5.401 7.501
抗体 5.401 7.501
ub 5.401 7.481
彼多 5.401 7.481
duc_nguyen 5.401 7.441
园丁 5.201 7.461
kenshi_abe 5.201 7.241
霍奇尼 5.201 7.221
gr1d 5.201 7.201
莱科芬尼 5.201 7.161
史蒂文 5.201 7.141
cit 5.201 7.101
费利克斯劳蒙 5.001 6.881
西河 5.001 6.861
kyushik_min 5.001 6.861
米菲利 5.001 6.781
快乐片 5.001 6.781
索恩克 5.001 6.781
帕卢韦斯洛博 5.001 6.761
安德 5.001 6.761
亚历克斯·戈麦斯 5.001 6.761
帕里洛 5.001 6.721
班杰曼 5.001 6.721

If you have any questions about the challenge please email us at OTC@unity3d.com. If you’d like to work on this exciting intersection of Machine Learning and Games, we are hiring for several positions, please apply!

如果您对挑战有任何疑问,请发送电子邮件至OTC@unity3d.com 。 如果您想在机器学习和游戏这个令人兴奋的交叉领域工作,我们正在招聘几个职位,请申请

翻译自: https://blogs.unity3d.com/2019/05/15/obstacle-tower-challenge-round-2-begins-today/

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值