Report Notes: From General Game Playing to General Intelligence 全局游戏策略

Prof. Dongmo Zhang

School of Computing, Engineering and Mathematics, Western Sydney University Australia

General Game Playing (GGP)

History:

  • IBM-DeepBlue-Watson

  • Trading Agent Competition (TAC)

  • General Game Playing (GGP)

Wiki

General game playing (GGP) is the design of artificial intelligence programs to be able to play more than one game successfully.[1]

General Game Playing is a project of the Stanford Logic Group of Stanford University, California, which aims to create a platform for general game playing. It is the most well-known effort at standardizing GGP AI, and generally seen as the standard for GGP systems.

Special Intelligence vs. General Intelligence

Human intelligence

  • Special Intelligence: expert at specific tasks

    目标:outperform the best humans on specific task

  • General Intelligence: capable of taking any task

    不依赖人类专家知识

    目标:outperform average humans on any describable tasks

    模拟人类的长期记忆、短期记忆

General Game Player

  • general game player
    • 可以理解Game规则: 用一种特殊语言写出来
    • playing these games effectively without knowing the game rules in advance
    • special:人类想怎么玩,general:计算机自己想. Goal: 0-100 & utility 也要考虑

Game description language (GDL)

  • Game description language (GDL) 语形
    • logical language
    • role(p): p is a player
    • init(F)
    • true(F)
    • legal(P,A)
    • does(P,A)
    • next(F)
    • terminal
    • goal(P,N)
  • 目的:描述出来让计算机/人理解这个游戏

Game Model: Finite State Machine

语义

  • A finite game
  • Game transition
    • 看走哪条路能赢
    • Synchronous update: take a joint action as a move at each state
    • Environment chhanges only in response to moves
  • GGP Base: Standford http://www.ggp.org

Design a General Game Player

  • simple-player games

  • multi-player games

    • searching: Game Tree Search- minimax hueristic search(不容易找), α,β 剪枝

    • machine learning: monte carlo tree search

      alpha go: 蒙特卡洛树搜索 【reinforcement learning】&dl

    • game theory ?:假设走均衡的那一步

More

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值