DRNAS: DIRICHLET NEURAL ARCHITECTURE SEARCH

夜信_

已于 2023-01-25 17:31:04 修改

阅读量95

点赞数

分类专栏： AutoML 文章标签：人工智能

于 2023-01-25 16:12:48 首次发布

本文链接：https://blog.csdn.net/qq_42454156/article/details/128758469

版权

AutoML 专栏收录该内容

5 篇文章 0 订阅

订阅专栏

Method

progressive learning scheme

Proposed method learn the architecture distribution from Dirichlet distribution from Dirichlet sample，which already injects certain stochasticity. If we directly apply the proposed method with the partial channel connection, the accuracy of the final architecture will decrease dramatically.

To this end, we propose to gradually increase the fraction of channels that are forwarded to the mixed-operations and meanwhile prunes the operations space based on the learnt distribution.

In practice, we should widen the convolution kernel or the channel number of BatchNorm layers since we will gradually increase the fraction of channels that fed into the mixed-operations.To this end, random mapping function is used to enlarge every convolution weight which is similar to Net2Net

For example:
$W_{old} \in \R ^{out_o,, in_o, h, w} \\W_{new}=\R ^{out_n, in_n, h, w} \\$
If we widen the input channels:
$r = in_n - in_o\\ inde x = rand(0,out_o,size=(r,)\\ W_{new}=Concat(W_{old}, W_{old}[index,:,:,:],dim=0)$