每日一佳——Computational Rationalization: The Inverse Equilibrium Problem（Kevin Waugh et al. ，ICML ，2011）

最新推荐文章于 2021-05-15 01:56:01 发布

手撕机

最新推荐文章于 2021-05-15 01:56:01 发布

阅读量859

点赞数

文章标签：最佳论文计算合理化反均衡问题

原创文章，未经授权请勿转载。

本文链接：https://blog.csdn.net/guolindonggld/article/details/44619475

版权

PDF
这篇是2011年ICML的最佳论文。
题目意思：计算合理化：反均衡问题
摘要：
Modeling the purposeful behavior of imperfect agents from a small number of observations is a challenging task. When restricted to the single-agent decision-theoretic setting, inverse optimal control techniques assume that observed behavior is an approximately optimal solution to an unknown decision problem. These techniques learn a utility function that explains the example behavior and can then be used to accurately predict or imitate future behavior in similar observed or unobserved situations.
In this work, we consider similar tasks in competitive and cooperative multi-agent domains. Here, unlike single-agent settings, a player cannot myopically maximize its reward; it must speculate on how the other agents may act to influence the game’s outcome. Employing the game-theoretic notion of regret and the principle of maximum entropy, we introduce a technique for predicting and generalizing behavior.

通过运用博弈论中后悔的概念和最大熵原则，论文提出了一种可预测和概括行为的技术。
看来想看懂这篇论文，需要学一下博弈论。

因为论文还看不懂，下面就自娱自乐好了。
慢慢接触人工智能之后，感觉真正能够实现人工智能的时候，都不知需要多少百年，甚至千年。或者说永远也不可能实现真正的人工智能，就像永动机一样，是违背自然规律的。不过转而又想，就算不能实现像人类一样的人工智能，但也可以另外的形式实现人工智能。比如，虽然我们人类一直希望够能够飞翔，虽然不能真正像小鸟一样，但我们因此造出了飞机，进而又造出太空飞船，这已经不单单是飞翔的问题了。或许另辟蹊径，就能打破瓶颈。
让我们看一下人工智能所要完成的主要目标（也成为AI问题）：
1. Reasoning（推理）
2. Knowledge Representation（知识表示）
3. Automated Planning and Scheduling（自动规划）
4. Machine Learning（机器学习）
5. Natural Language Processing（自然语言处理）
6. Computer Vision（计算机视觉）
7. Robotics（机器人学）
8. General Intelligence/Strong AI（通用智能/强人工智能）

单词：
myopically [maɪ’ɒpɪkəlɪ] 目光短浅地