tensorflow示例代码注释2

最新推荐文章于 2022-07-12 13:36:54 发布

sunquan_ok

最新推荐文章于 2022-07-12 13:36:54 发布

阅读量2.1k

点赞数

本文链接：https://blog.csdn.net/sunquan_ok/article/details/51774222

版权

02_logistic_regression.py

#!/usr/bin/env python

import tensorflow as tf
import numpy as np
import input_data

// random_normal返回一个tensor其中的元素的值服从正态分布,stddev标准差

def init_weights(shape):

return tf.Variable(tf.random_normal(shape, stddev=0.01))

def model(X, w):
return tf.matmul(X, w) # notice we use the same model as linear regression, this is because there is a baked in cost function which performs softmax and cross entropy

mnist = input_data.read_data_sets("MNIST_data/", one_hot=True)
trX, trY, teX, teY = mnist.train.images, mnist.train.labels, mnist.test.images, mnist.test.labels

X = tf.placeholder("float", [None, 784]) # create symbolic variables
Y = tf.placeholder("float", [None, 10])

w = init_weights([784, 10]) # like in linear regression, we need a shared variable weight matrix for logistic regression

py_x = model(X, w)

//reduce_mean取均值，第一个参数为输入矩阵，第二个为reduce所沿的维度，X，Y或者Z等

//softmax_cross_entropy_with_logits 取KL散度，交叉熵

//tf.argmax 是一个非常有用的函数，它能给出某个tensor对象在某一维上的其数据最大值所在的索引值。由于标签向量是由0,1组成，因此最大值1所在的索引位置就是类别标

//签，比如tf.argmax(y,1)返回的是模型对于任一输入x预测到的标签值，而 tf.argmax(y_,1) 代表正确的标签，我们可以用tf.equal 来检测我们的预测是否真实标签匹配(索引位置//一样表示匹配)。

cost = tf.reduce_mean(tf.nn.softmax_cross_entropy_with_logits(py_x, Y)) # compute mean cross entropy (softmax is applied internally)
train_op = tf.train.GradientDescentOptimizer(0.05).minimize(cost) # construct optimizer
predict_op = tf.argmax(py_x, 1) # at predict time, evaluate the argmax of the logistic regression

# Launch the graph in a session
with tf.Session() as sess:
    # you need to initialize all variables
    tf.initialize_all_variables().run()
//range(开始，结束，间隔），这里是每次取128个数进行训练，一个循环128次，100次循环
    for i in range(100):
        for start, end in zip(range(0, len(trX), 128), range(128, len(trX), 128)):
            sess.run(train_op, feed_dict={X: trX[start:end], Y: trY[start:end]})
        print(i, np.mean(np.argmax(teY, axis=1) ==

sess.run(predict_op, feed_dict={X: teX, Y: teY})))

//重点说一下，np.argmax(teY,axis=1),是对teY矩阵按行求最大值的索引值，也就是分类的编码，也就是数字0，1，2，3...9，如果打印，会看到是一个一维数组，都是结果

//sess.run(predict_op, feed_dict={X: teX, Y: teY}) 中，因为predict_op是求按py_x进行分类的结果，所以测试集teX,teY,实际上teY是不需要的。我验证了一下，不输入teY,结果也没有变化，但是没有teX，会运行错误。

最后一段的意义，就是输出测试集teY的分类和模型测试的结果，有什么区别，求它们的均值。实际上就是准确率了

sunquan_ok

关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
tensorflow示例代码注释2

02_logistic_regression.py#!/usr/bin/env pythonimport tensorflow as tfimport numpy as npimport input_data//random_normal返回一个tensor其中的元素的值服从正态分布,stddev标准差def init_weights(shape):
复制链接

扫一扫