十五数码问题A*算法求解

1 问题描述

15数码问题是人工智能中的一个经典问题。15数码问题就是在一个4*4的16宫格棋盘上,摆放有15个张牌,每一个都刻有1-15中的某一个数码。棋盘中留有一个空格,允许其周围的某一个将牌向空格移动,这样通过移动将牌就可以不断改变将牌的布局。所要求解的问题:是给定一种初始棋局(初始状态)和一个目标棋局(目标状态),问如何移动数码实现从初始状态到目标状态的转变。

2 算法流程

在这里插入图片描述

3 程序源码

本程序基于python 3.7实现,未调用第三方扩展库,只使用了copy与timeit两个内置的函数库。
对于程序中的例子,在i7-8代处理器,4核八线程的CPU运行条件下,单线程运行,整个程序的时间在1小时20分钟左右。
程序中有一些小小的性能优化,主要是在于open表的处理上面。


import copy
# 定位元素element在state状态中的位置
def locate_element(state, element):
    assert 0 <= element <= 15
    x, y = "", ""
    for i in range(len(state)):
        try:
            x = state[i].index(element)
            y = i
        except ValueError:
            pass
    assert isinstance(x, int) and isinstance(y, int)
    return y, x


# 将0与上下左右的元素交换,不满足交换条件则返回原状态
# 输入交换前的状态,输出交换后的状态  [0, 3]
# direction:"up","down","left","right"
def swap(state, direction):
    new_state = copy.deepcopy(state)  # 深拷贝一份
    zero_y, zero_x = locate_element(new_state, 0)
    if direction == "up":
        if zero_y >= 1:
            new_state[zero_y][zero_x] = new_state[zero_y-1][zero_x]
            new_state[zero_y-1][zero_x] = 0
        return new_state
    elif direction == "down":
        if zero_y <= 2:
            new_state[zero_y][zero_x] = new_state[zero_y+1][zero_x]
            new_state[zero_y+1][zero_x] = 0
        return new_state
    elif direction == "left":
        if zero_x >= 1:
            new_state[zero_y][zero_x] = new_state[zero_y][zero_x-1]
            new_state[zero_y][zero_x-1] = 0
        return new_state
    elif direction == "right":
        if zero_x <= 2:
            new_state[zero_y][zero_x] = new_state[zero_y][zero_x+1]
            new_state[zero_y][zero_x+1] = 0
        return new_state
    else:
        raise("input error in function swap !")


# 计算估价函数f(n)=g(n)+h(n), g(n) = tree_deep
def evaluation_function(state, tree_deep, final_state):
    h_n = 0
    for i in range(0,16,1):
        y1, x1 = locate_element(state, i)
        y2, x2 = locate_element(final_state, i)
        h_n += abs(y1-y2) + abs(x1-x2)
    return h_n + tree_deep


# 在closed表中寻找最优解路径,返回想查找[state,num,father_num]的所有父节点列表,直到father_num=0
def search_optimal_path(closed_list, state_pack):
    if closed_list == []:
        return []
    res_path = [state_pack]
    fa = state_pack[2]
    while fa != 0:
        res_path.append(closed_list[fa])
        fa = closed_list[fa][2]
    return res_path


# 使用A*算法进行启发式搜索的主函数
# 输入的state_initial size 4x4, state_target size 4x4
def solve(state_initial, state_target):
    open_list = []
    closed_list = []
    history_list = []
    evaluation_open_list = []
    open_list.append([state_initial, 0, 0])  # 后两个参数为第几层,父节点是哪号
    evaluation_open_list.append(evaluation_function(state_initial, 0, state_target))
    history_list.append(state_initial)
    while open_list != []:  # open_list 不为空时
        min_evaluation_index = evaluation_open_list.index(min(evaluation_open_list))
        # 扩展这个节点
        state = open_list.pop(min_evaluation_index)
        evaluation_open_list.pop(min_evaluation_index)
        print(state)
        # 判断open_list第一个元素是不是target
        if state[0] == state_target:
            print(search_optimal_path(closed_list, state))
            break
        closed_list.append(state)
        for direction in ["up", "down", "left", "right"]:
            new_state = swap(state[0], direction)
            if new_state not in history_list:
                tree_deep = closed_list[-1][1]+1
                open_list.append([new_state, tree_deep, len(closed_list)-1])
                evaluation_open_list.append(evaluation_function(new_state, tree_deep, state_target))
                history_list.append(new_state)


if __name__ == "__main__":
    state_initial = [[11,9,4,15],[1,3,0,12],[7,5,8,6],[13,2,10,14]]
    state_target = [[1,2,3,4],[5,6,7,8],[9,10,11,12],[13,14,15,0]]
    import timeit
    t1 = timeit.default_timer()
    solve(state_initial, state_target)  # 目标和结果一致时需考虑
    t2 = timeit.default_timer()
    print("run time:", t2-t1)
  • 0
    点赞
  • 27
    收藏
    觉得还不错? 一键收藏
  • 1
    评论
评论 1
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值