tensorflow+ tutorial 吴恩达第二课第三周作业

最新推荐文章于 2024-07-28 22:23:07 发布

hdhuangzhihao

最新推荐文章于 2024-07-28 22:23:07 发布

阅读量2.5k

点赞数 1

本文链接：https://blog.csdn.net/hdhuangzhihao/article/details/79051893

版权

这篇博客介绍了如何使用TensorFlow进行深度学习，包括变量初始化、创建会话、训练算法和构建神经网络。通过实例展示了如何计算线性函数、sigmoid、成本、使用One Hot编码以及初始化为零和一的变量。此外，还涉及了建立第一个神经网络的过程，如创建占位符、参数初始化、前向传播、计算成本、反向传播和参数更新。最后，博主分享了在SIGNS数据集上训练模型的结果，实现了约71.7%的识别准确率。

摘要由CSDN通过智能技术生成

TensorFlow Tutorial

Welcome to this week's programming assignment. Until now, you've always used numpy to build neural networks. Now we will step you through a deep learning framework that will allow you to build neural networks more easily. Machine learning frameworks like TensorFlow, PaddlePaddle, Torch, Caffe, Keras, and many others can speed up your machine learning development significantly. All of these frameworks also have a lot of documentation, which you should feel free to read. In this assignment, you will learn to do the following in TensorFlow:

Initialize variables
Start your own session
Train algorithms
Implement a Neural Network

Programing frameworks can not only shorten your coding time, but sometimes also perform optimizations that speed up your code.

1 - Exploring the Tensorflow Library

To start, you will import the library:

    In [2]: 
  

 
       
     
 
           import math 
           import numpy as np 
           import h5py 
           import matplotlib.pyplot as plt 
           import tensorflow as tf 
           from tensorflow.python.framework import ops 
           from tf_utils import load_dataset, random_mini_batches, convert_to_one_hot, predict 
           ​ 
           %matplotlib inline 
           np.random.seed(1) 
          

/Library/Frameworks/Python.framework/Versions/3.6/lib/python3.6/importlib/_bootstrap.py:219: RuntimeWarning: compiletime version 3.5 of module 'tensorflow.python.framework.fast_tensor_util' does not match runtime version 3.6
  return f(*args, **kwds)

Now that you have imported the library, we will walk you through its different applications. You will start with an example, where we compute for you the loss of one training example.

l o s s =  (y ̂, y) = (y ̂ (i) - y (i)) 2 (1)

 
   In [3]: 
  

 
           y_hat = tf.constant(36, name='y_hat')            # Define y_hat constant. Set to 36. 
           y = tf.constant(39, name='y')                    # Define y. Set to 39 
           ​ 
           loss = tf.Variable((y - y_hat)**2, name='loss')  # Create a variable for the loss 
           ​ 
           init = tf.global_variables_initializer()         # When init is run later (session.run(init)), 
                                                            # the loss variable will be initialized and ready to be computed 
           with tf.Session() as session:                    # Create a session and print the output 
               session.run(init)                            # Initializes the variables 
               print(session.run(loss))                     # Prints the loss 
           ​ 
           ​

Writing and running programs in TensorFlow has the following steps:

Create Tensors (variables) that are not yet executed/evaluated.
Write operations between those Tensors.
Initialize your Tensors.
Create a Session.
Run the Session. This will run the operations you'd written above.

Therefore, when we created a variable for the loss, we simply defined the loss as a function of other quantities, but did not evaluate its value. To evaluate it, we had to run init=tf.global_variables_initializer(). That initialized the loss variable, and in the last line we were finally able to evaluate the value of loss and print its value.

Now let us look at an easy example. Run the cell below:

    In [4]: 
  

 
           a = tf.constant(2) 
           b = tf.constant(10) 
           c = tf.multiply(a,b) 
           print(c)

Tensor("Mul:0", shape=(), dtype=int32)

As expected, you will not see 20! You got a tensor saying that the result is a tensor that does not have the shape attribute, and is of type "int32". All you did was put in the 'computation graph', but you have not run this computation yet. In order to actually multiply the two numbers, you will have to create a session and run it.

 
   In [5]: 
  

 
           sess = tf.Session() 
           print(sess.run(c))

Great! To summarize, remember to initialize your variables, create a session and run the operations inside the session.

Next, you'll also have to know about placeholders. A placeholder is an object whose value you can specify only later. To specify values for a placeholder, you can pass in values by using a "feed dictionary" (feed_dict variable). Below, we created a placeholder for x. This allows us to pass in a number later when we run the session.

    In [6]: 
  

 
           # Change the value of x in the feed_dict 
           ​ 
           x = tf.placeholder(tf.int64, name = 'x') 
           print(sess.run(2 * x, feed_dict = {
             x: 3})) 
           sess.close()

When you first defined x you did not have to specify a value for it. A placeholder is simply a variable that you will assign data to only later, when running the session. We say that you feed data to these placeholders when running the session.

Here's what's happening: When you specify the operations needed for a computation, you are telling TensorFlow how to construct a computation graph. The computation graph can have some placeholders whose values you will specify only later. Finally, when you run the session, you are telling TensorFlow to execute the computation graph.

1.1 - Linear function

Lets start this programming exercise by computing the following equation: Y=WX+b , where W and X are random matrices and b is a random vector.

Exercise: Compute WX+b where W,X , and b are drawn from a random normal distribution. W is of shape (4, 3), X is (3,1) and b is (4,1). As an example, here is how you would define a constant X that has shape (3,1):

X = tf.constant(np.random.randn(3,1), name = "X")

You might find the following functions helpful:

tf.matmul(..., ...) to do a matrix multiplication
tf.add(..., ...) to do an addition
np.random.randn(...) to initialize randomly

 
   In [15]: 
  

 
       
     
 
          
 
          
 
           
 
           # GRADED FUNCTION: linear_function 
           ​ 
           def linear_function(): 
               """ 
               Implements a linear function:  
                       Initializes W to be a random tensor of shape (4,3) 
                       Initializes X to be a random tensor of shape (3,1) 
                       Initializes b to be a random tensor of shape (4,1) 
               Returns:  
               result -- runs the session for Y = WX + b  
               """ 
                
               np.random.seed(1) 
                
               ### START CODE HERE ### (4 lines of code) 
               X = tf.constant(np.random.randn(3,1), name = 'X') 
               W = tf.constant(np.random.randn(4,3), name = 'W') 
               b = tf.constant(np.random.randn(4,1), name = 'b') 
               Y = tf.add(tf.matmul(W, X), b) 
               ### END CODE HERE ###  
                
               # Create the session using tf.Session() and run it with sess.run(...) on the variable you want to calculate 
                
               ### START CODE HERE ### 
               sess = tf.Session() 
               result = sess.run(Y) 
               ### END CODE HERE ###  
                
               # close the session  
               sess.close() 
           ​ 
               return result 
          
 
      

    In [16]: 
  

最低0.47元/天解锁文章

hdhuangzhihao

关注

1
点赞
踩
1

收藏

觉得还不错? 一键收藏
1
评论
复制链接

分享到 QQ

分享到新浪微博

扫一扫