自主式战术决策建模框架系统
There is an increasing need for autonomous systems that exhibit effective decision-making in unpredictable
environments. However, the design of autonomous decision-making systems presents considerable challenges, particularly when they have to achieve their goals within a dynamic context. Tactics d
三维世界中达到人类水平性能:基于群体强化学习的多人游戏
Reinforcement learning (RL) has shown great success in increasingly complex single-agent
environments and two-player turn-based games. However, the real world contains multiple
agents, each learning and acting independently to cooperate and compete with other
agents. We used a tournament-style evalu
不完全信息下的多Agent评价.pdf
This paper investigates the evaluation of learned multiagent strategies in the incomplete information setting, which plays a critical role in ranking and training of
agents. Traditionally, researchers have relied on Elo ratings for this purpose, with
recent works also using methods based on Nash equilibria. Unfortunately, Elo is
unable to handle intransitive agent interactions, and other techniques are restricted
to zero-sum, two-player settings or are limited by the fact that the Nash equilibrium
is intractable to compute. Recently, a ranking method called α-Rank, relying on a
new graph-based game-theoretic solution concept, was shown to tractably apply
to general games. However, evaluations based on Elo or α-Rank typically assume
noise-free game outcomes, despite the data often being collected from noisy simulations, making this assumption unrealistic in practice. This paper investigates
multiagent evaluation in the incomplete information regime, involving general-sum
many-player games with noisy outcomes. We derive sample complexity guarantees
required to confidently rank agents in this setting. We propose adaptive algorithms
for accurate ranking, provide correctness and sample complexity guarantees, then
introduce a means of connecting uncertainties in noisy match outcomes to uncertainties in rankings. We evaluate the performance of these approaches in several
domains, including Bernoulli games, a soccer meta-game, and Kuhn poker.
深度学习技术在军事领域应用.pdf
一篇关于深度学习技术在军事领域中的应用的论文,PDF格式。主要介绍当前机器学习、深度学习技术、AI技术的军事应用。