Special applications: Face recognition & Neural style transfer
4.1 face recognition
emmmmmm.....
one shot learning
siamese network
Triplet Loss
face verification and binary classification
neural style transfer
This course helps us learn how convnets really works.
generate the transfer cost function
Content Cost Function
Style cost function
1D and 3D Generalizations
question:
1. Question 1
Face verification requires comparing a new picture against one person’s face, whereas face recognition requires comparing a new picture against K person’s faces.
True
False
Question 2
1
point
2. Question 2
Why do we learn a function for face verification? (Select all that apply.)
We need to solve a one-shot learning problem.
This allows us to learn to predict a person’s identity using a softmax output unit, where the number of classes equals the number of persons in the database plus 1 (for the final “not in database” class).
This allows us to learn to recognize a new person given just a single image of that person.
Given how few images we have per person, we need to apply transfer learning.
Question 3
1
point
3. Question 3
In order to train the parameters of a face recognition system, it would be reasonable to use a training set comprising 100,000 pictures of 100,000 different persons.
True
False
Question 4
1
point
4. Question 4
Which of the following is a correct definition of the triplet loss? Consider that . (We encourage you to figure out the answer from first principles, rather than just refer to the lecture.)
[Math Processing Error]
[Math Processing Error]
[Math Processing Error]
[Math Processing Error]
Question 5
1
point
5. Question 5
Consider the following Siamese network architecture:
The upper and lower neural networks have different input images, but have exactly the same parameters.
True
False
Question 6
1
point
6. Question 6
You train a ConvNet on a dataset with 100 different classes. You wonder if you can find a hidden unit which responds strongly to pictures of cats. (I.e., a neuron so that, of all the input/training images that strongly activate that neuron, the majority are cat pictures.) You are more likely to find this unit in layer 4 of the network than in layer 1.
True
False
Question 7
1
point
7. Question 7
Neural style transfer is trained as a supervised learning task in which the goal is to input two images (), and train a network to output a new, synthesized image ().
True
False
Question 8
1
point
8. Question 8
In the deeper layers of a ConvNet, each channel corresponds to a different feature detector. The style matrix measures the degree to which the activations of different feature detectors in layer vary (or correlate) together with each other.
True
False
Question 9
1
point
9. Question 9
In neural style transfer, what is updated in each iteration of the optimization algorithm?
The regularization parameters
The pixel values of the generated image
The pixel values of the content image
The neural network parameters
Question 10
1
point
10. Question 10
You are working with 3D data. You are building a network layer whose input volume has size 32x32x32x16 (this volume has 16 channels), and applies convolutions with 32 filters of dimension 3x3x3 (no padding, stride 1). What is the resulting output volume?
Undefined: This convolution step is impossible and cannot be performed because the dimensions specified don’t match up.
30x30x30x16
30x30x30x32