Coursera Deep Learning 1 神经网络和深度学习 第二周习题

Neural Network Basics

1

What does a neuron compute?

A neuron computes a linear function (z = Wx + b) followed by an activation function

A neuron computes a function g that scales the input x linearly (Wx + b)

A neuron computes the mean of all features before applying the output to an activation function

A neuron computes an activation function followed by a linear function (z = Wx + b)

2

Which of these is the "Logistic Loss"?

3

Suppose img is a (32,32,3) array, representing a 32x32 image with 3 color channels red, green and blue. How do you reshape this into a column vector?

x = img.reshape((32323,1))

x = img.reshape((3,32*32))

x = img.reshape((1,3232,3))

x = img.reshape((32*32,3))

4

Consider the two following random arrays "a" and "b":

a = np.random.randn(2, 3) # a.shape = (2, 3)
b = np.random.randn(2, 1) # b.shape = (2, 1)
c = a + b复制代码

What will be the shape of "c"?

c.shape = (2, 3)

Yes! This is broadcasting. b (column vector) is copied 3 times so that it can be summed to each column of a.复制代码

c.shape = (2, 1)

c.shape = (3, 2)

The computation cannot happen because the sizes don't match. It's going to be "Error"!

5

Consider the two following random arrays "a" and "b":

a = np.random.randn(4, 3) # a.shape = (4, 3)
b = np.random.randn(3, 2) # b.shape = (3, 2)
c = a*b复制代码

What will be the shape of "c"?

The computation cannot happen because the sizes don't match. It's going to be "Error"!

Indeed! In numpy the "*" operator indicates element-wise multiplication. It is different from "np.dot()". If you would try "c = np.dot(a,b)" you would get c.shape = (4, 2).复制代码

c.shape = (4, 3)

c.shape = (3, 3)

c.shape = (4,2)

6

Suppose you have nx input features per example. Recall that X=[x(1)x(2)...x(m)]. What is the dimension of X?

(nx,m)

(m,1)>

(1,m)

(m,nx)

7

Recall that "np.dot(a,b)" performs a matrix multiplication on a and b, whereas "a*b" performs an element-wise multiplication.

Consider the two following random arrays "a" and "b":

a = np.random.randn(12288, 150) # a.shape = (12288, 150)
b = np.random.randn(150, 45) # b.shape = (150, 45)
c = np.dot(a,b)复制代码

What is the shape of c?

c.shape = (12288, 45)

Correct, remember that a np.dot(a, b) has shape (number of rows of a, number of columns of b). The sizes match because :

"number of columns of a = 150 = number of rows of b"复制代码

The computation cannot happen because the sizes don't match. It's going to be "Error"!

c.shape = (150,150)

c.shape = (12288, 150)

8

Consider the following code snippet:

# a.shape = (3,4)
# b.shape = (4,1)
for i in range(3):
  for j in range(4):
    c[i][j] = a[i][j] + b[j]复制代码

How do you vectorize this?

c = a.T + b.T

c = a.T + b

c = a + b.T

c = a + b

9

Consider the following code:

a = np.random.randn(3, 3)
b = np.random.randn(3, 1)
c = a*b复制代码

What will be c? (If you’re not sure, feel free to run this in python to find out).

This will invoke broadcasting, so b is copied three times to become (3,3), and ∗ is an element-wise product so c.shape will be (3, 3)

This will invoke broadcasting, so b is copied three times to become (3, 3), and ∗ invokes a matrix multiplication operation of two 3x3 matrices so c.shape will be (3, 3)

This will multiply a 3x3 matrix a with a 3x1 vector, thus resulting in a 3x1 vector. That is, c.shape = (3,1).

It will lead to an error since you cannot use “*” to operate on these two matrices. You need to instead use np.dot(a,b)

10

Consider the following computation graph.


What is the output J?

J = (c - 1)*(b + a)

J = (a - 1) * (b + c)

Yes. J = u + v - w = a*b + a*c - (b + c) = a * (b + c) - (b + c) = (a - 1) * (b + c).复制代码

J = ab + bc + a*c

J = (b - 1) * (c + a)

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值