文章目录
前言
这是第三篇,如果你已经完成这篇内容可以看下一篇(赛道分析,函数思路,最优函数解析分享等内容)
同学们大家好,最近AI爆火,许多在校同学都与我一样想深入了解AI,于是来参加了deep racer student的比赛或尝试了解deepracer。
我能以总榜第三晋级也归功与以前积累的经验和向大佬的请教,现在关于deepracer的教程和博文很少,因此给我在前进路上带来了许多麻烦,所以我写下这篇文章希望能把经验分享给你,祝你成功。
一、follow the center line
1.代码
def reward_function(params):
# Example of rewarding the agent to follow center line
# Read input parameters
track_width = params['track_width']
distance_from_center = params['distance_from_center']
# Calculate 3 markers that are at varying distances away from the center line
marker_1 = 0.1 * track_width
marker_2 = 0.25 * track_width
marker_3 = 0.5 * track_width
# Give higher reward if the car is closer to center line and vice versa
if distance_from_center <= marker_1:
reward = 1.0
elif distance_from_center <= marker_2:
reward = 0.5
elif distance_from_center <= marker_3:
reward = 0.1
else:
reward = 1e-3 # likely crashed/ close to off track
return float(reward)
2.解析
def reward_function(params):
创建一个函数
track_width = params