MetaD2A

夜信_

已于 2023-01-25 17:35:37 修改

阅读量70

点赞数

分类专栏： AutoML 文章标签：深度学习

于 2023-01-18 14:53:36 首次发布

本文链接：https://blog.csdn.net/qq_42454156/article/details/128714962

版权

AutoML 专栏收录该内容

5 篇文章 0 订阅

订阅专栏

Motivate

Conventional NAS method search for a novel architecture from scratch for a given unseen dataset, such task-specific methods search for a neural architecture from scratch for every given task, they incur a large computational cost.

This paper proposed an efficient NAS framework that is trained once on a database consisting of datasets and pretrained networks and can rapidly search for a neural architecture for a
novel dataset.

A meta-performance predictor was also introduced to estimate and select the best generated architecture.

Dataset

Dataset for training
To meta-learn our model, we practically collect multiple tasks where each task consists of (dataset, architecture, accuracy).

ImageNet-1K is compiled as multiple sub-sets by randomly sampling 20 classes and search for the set-specific architecture of each sampled dataset using random search among high-quality architectures

For the predictor, we additionally collect 2,920 tasks through random sampling

Method

Generator training

We meta-learn the model using:
在这里插入图片描述
where:
the first term of the objectve can be rewritten as :

Oi denote nodes and e ji denote edges

MetaTest

Graph Decoding

For a given dataset, sample num_sample instances per category , concat and feed them into Set encoder. After encode the give dateset, a batch of set-dependent latent codes z can be obtained from dataset-conditioned Gaussian distribution taking the encoded unseen dataset as input.

在这里插入图片描述

When generating i-th node Vi, we will compute the operation type O over N candidate operation of this node based on current graph state Hg = Hvi-1

Ovi is defined as follows:
在这里插入图片描述
We update the hidden state Hvi as follows:

function UPDATE is a GRU, m is the incoming message to vi

we decide whether to link an edge vj to vi based on edge connection e{vj, vi} = $NN_{edge}(h_j, h_i)$ , incorporating the initial state $H_0$ is also feasible .

在这里插入图片描述

Accuracy prediction
After we generated the set-dependent architectures, the predictor will predict accuracies $S_i$ for a give unseen dataset $D_i$ and each generated candidate architecture $G_i$ and then select the architecture with highest predicted accuracy.

Set encoder
Inputting the sampled instances into the IntraSetPool, the intra-class encoder, to encode
class prototype $V_c \in \R^{batch * d_x}$ for each class $c = 1....... C$ . Then we further feed the class-specific set representations $V_c$ into the InterSetPool, the inter-class encoder, to generate the dataset
representation $H$ as follows:
在这里插入图片描述