q learning matlab,用Matlab实现简单的Q-learning算法（学习走出房间）

最新推荐文章于 2024-06-12 09:49:03 发布

月明朗

最新推荐文章于 2024-06-12 09:49:03 发布

阅读量1.9k

点赞数

文章标签： q learning matlab

本文通过Matlab代码展示了如何实现一个简单的Q-learning算法，用于学习从房间中找到出口。在100次迭代中，随机开始并逐步更新Q-table，最终达到目标状态。

摘要由CSDN通过智能技术生成

看到一个简单有趣的Q learning例子，写了段matlab代码实现一下。有兴趣的请先阅读原文链接

dbstop if error%stop at the error if it happens

%Initialization

episode_num = 100;%Iteration time of exploration

state_num = 6;%Room number (including the hall)

gamma = 0.8;%discount factor

%100: Arrival the hall

Reward_table = [

-1 -1 -1 -1 0 -1; %1

-1 -1 -1 0 -1 100; %2

-1 -1 -1 0 -1 -1; %3

-1 0 0 -1 0 -1; %4

0 -1 -1 0 -1 100; %5

-1 0 -1 -1 0 100 %6

];

Q_table = zeros(state_num, state_num);

final_state = 6;

for i = 1:episode_num

%Randomly start in a room

current_state = randperm(state_num,1);

while current_state ~= final_state

%Get the possible actions based on the current status

Action_option_list = find(Reward_table(current_state,:)>-1);

%Rando

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

关注关注