Dehazing Via Reinforcement learing

Idea:We model the dehazing problem as a Markov Decision Process (MDP) with several existing simple traditional image processing operations and prior knowledge-based dehazing methods as actions

在这里插入图片描述

We use a deep reinforcement learning framework Deep Q-Network(DQN)


Agent Network

  1. The role of Agent Network : Exteract Future.

  2. The structure of Agent Network : The agent network consists of four fully connected layers,and the first three layers are followed by a ReLu layer.

  3. The effect of Agent Network : It can effectively capture the inside cue ,and promote the seclection of correct action during the descion sequence.

  4. The updata of Agent Network : Via the Q-learning process by Eq.(2)

    截图 1. Q\*(s,a) = real time reward r + optimal action a' selected at the next moment in the Q*(s',a')
  5. The loss fuction of Agent Network

    1. The loss fuction of agent network is defined as

      截图
    2. The gradient of loss fuction is defined as

      截图

Action space A

在这里插入图片描述

  1. Add a D C P ∗ = m i n ( D C P ( x ) , x ) DCP^* = min(DCP(x),x) DCP=min(DCP(x),x) to improve the performance in the white areas of the image.
  2. Define Actions of 5 to 10,which transform color channel’s value , and action 11 represents that theimage dehazing processing should be done .

reword function R t R_t Rt

  1. Adopt both PSNR and SSIM metrics as the measurement

    截图
  2. Based on L2,PSNR and SSIM , the artical design the reward fuction by :

截图

Farmework reinforcement learing

Classification of reinforcement learing :

  1. value based -> DQN
  2. poli based
  3. actor critic
  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 1
    评论
评论 1
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值