python alphago_(Keras/TensorFlow)用AlphaGo Zero方法实现增强学习下棋

最新推荐文章于 2021-02-09 01:04:16 发布

weixin_39610678

最新推荐文章于 2021-02-09 01:04:16 发布

阅读量299

点赞数

文章标签： python alphago

本文链接：https://blog.csdn.net/weixin_39610678/article/details/111419026

版权

About

Chess reinforcement learning by AlphaGo Zero methods.

This project is based on these main resources:

The great Reversi development of the DeepMind ideas that @mokemokechicken did in his repo: https://github.com/mokemokechicken/reversi-alpha-zero

DeepMind just released a new version of AlphaGo Zero (named now AlphaZero) where they master chess from scratch: https://arxiv.org/pdf/1712.01815.pdf. In fact, in chess AlphaZero outperformed Stockfish after just 4 hours (300k steps) Wow!

See the wiki for more details.

Note

I'm the creator of this repo. I (and some others collaborators did our best: https://github.com/Zeta36/chess-alpha-zero/graphs/contributors) but we found the self-play is too much costed for an only machine. Supervised learning worked fine but we never try the self-play by itself.

Anyway I want to mention we have moved to a new repo where lot of people is working in a distributed version of AZ for chess (MCTS in C++): https://github.com/glinscott/leela-chess

Project is almost done and everybody will be able to participate just by executing a pre-compiled windows (or Linux) application. A really great job and effort has been done is this project and I'm pretty sure we'll be able to simulate the DeepMind results in not too long time of distributed cooperation.

So, I ask everybody that wish to see a UCI engine running a neural network to beat Stockfish go into that repo a

最低0.47元/天解锁文章

weixin_39610678

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
python alphago_(Keras/TensorFlow)用AlphaGo Zero方法实现增强学习下棋

AboutChess reinforcement learning by AlphaGo Zero methods.This project is based on these main resources:The great Reversi development of the DeepMind ideas that @mokemokechicken did in his repo: htt...
复制链接

扫一扫