有趣的的简单java算法_关于java：MiniMax算法非常有趣。什么会导致这种行为？...

袁崇赉

于 2021-02-25 07:00:05 发布

阅读量276

点赞数

文章标签：有趣的的简单java算法

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/weixin_42376118/article/details/114616783

版权

我已经实现了MiniMax算法(使用alpha-beta修剪)，但是它的表现方式很有趣。我的玩家会创造巨大的领先优势，但是当需要进入决赛并赢得胜利时，并不会采取这种行动，而只会不断拖累游戏。

这是我的minimax函数：

// Game states are represented by Node objects (holds the move and the board in that state)

//ValueStep is just a pair holding the minimax value and a game move (step)

private ValueStep minimax(Node gameState,int depth,int alpha,int beta) {

//Node.MAXDEPTH is a constant

if(depth == Node.MAXDEPTH || gameOver(gameState.board)) {

return new ValueStep(gameState.heuristicValue(),gameState.step);

}

//this method definately works. child nodes are created with a move and an

//updated board and MAX value

//which determines if they are the maximizing or minimizing players game states.

gameState.children = gameState.findPossibleStates();

if(state.MAX) { //maximizing player

ValueStep best = null;

for(Node child: gameState.children) {

ValueStep vs = new ValueStep(minimax(child,depth+1,alpha,beta).value,child.move);

//values updated here if needed

if(best==null || vs.value > best.value) best = vs;

if(vs.value > alpha) alpha = vs.value;

if(alpha >= beta) break;

}

return best;

} else { //minimizing player

ValueStep best = null;

for(Node child: gameState.children) {

ValueStep vs = new ValueStep(minimax(child,depth+1,alfa,beta).value,child.move);

if(best==null || vs.value < best.value) best = vs;

if(vs.value < beta) beta = vs.value;

if(alpha >= beta) break;

}

return best;

}

}

首先，我认为问题出在我的评估功能上，但是如果是这样，我找不到它。在这个游戏中，两个玩家都有得分，而我的功能只是根据得分差异计算启发式值。

这里是：

public int heuristicValue() {

//I calculate the score difference here in this state and save it in

//the variable scoreDiff. scoreDiff will be positive if I am winning

//here, negative if im loosing.

//"this" is a Node object here. If the game is over here, special

//heuristic values are returned, depending on who wins (or if its a

//draw)

if(gameOver(this.board)) {

if(scoreDiff>0) {

return Integer.MAX_VALUE;

} else if(scoreDiff==0) {

return 0;

} else {

return Integer.MIN_VALUE;

}

}

int value = 0;

value += 100*scoreDiff; //caluclate the heuristic value using the score differerence. If its high, the value will be high as well

return value;

}

我已将我的代码"翻译"为英语，因此可能会有错别字。我非常确定问题出在这里，但是如果您需要其他代码，那么我将更新问题。同样，我的玩家可以创造优势，但是由于某种原因，它不会使最终获胜的举动。

我感谢您的帮助！

假设您的Minimax玩家处于可以证明自己可以保证获胜的位置。通常仍然可以通过许多不同的方式来保证最终的胜利。有些举动可能是即时获胜，有些举动可能会不必要地拖累游戏……只要这不是一个愚蠢的举动突然让对手获胜(或平局)，它们都是胜利，而且都具有相同的优势游戏理论值(代码中的Integer.MAX_VALUE)。

您的Minimax算法不会区分这些移动，而只是播放恰好是gameState.children列表中第一行的移动。那可能是一个快速的，微弱的胜利，或者是一个缓慢的，非常深的胜利。

有两种简单的方法可以使您的Minimax算法优先于快赢优先于慢赢：

最好的选择(因为它还具有许多其他优点)是使用迭代加深。您可以查找详细信息，但是基本思想是，首先进行深度限制为1的Minimax搜索，然后再进行深度限制为2的另一个最大深度搜索，依此类推。以此类推。您的一项搜索证明是胜利，则可以终止搜索并进行获胜的举动。这将使您的算法始终喜欢最短的获胜(因为先找到获胜者)。

另外，您可以修改heuristicValue()函数以合并搜索深度。例如，您可以在获胜位置返回Integer.MAX_VALUE - depth。这将使更快的胜利实际上具有稍大的评估。

谢谢您的回答，2.点听起来真的很有希望！

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。