tensorflow保存、加载模型

最新推荐文章于 2023-01-07 14:27:58 发布

furuit

最新推荐文章于 2023-01-07 14:27:58 发布

阅读量299

点赞数

分类专栏： TensorFlow 文章标签： TensorFlow

本文链接：https://blog.csdn.net/fu6543210/article/details/79848307

版权

TensorFlow 专栏收录该内容

33 篇文章 3 订阅

订阅专栏

首先对tensorflow模型有个直观的了解：

我们逐个介绍：

四个文件，分别是checkpoint(记录模型文件列表)

.data和.index(存储训练好的参数包括weight，biase,etc)、

.meta(神经网络结构)。

模型的保存。

#!/usr/bin/env python
#-*- coding:utf-8 -*-

import tensorflow as tf

# prepare to feed input, i.e. feed_dict and placeholders
w1 = tf.Variable(tf.random_normal(shape = [2]), name = 'w1')  # name is very important in restoration
w2 = tf.Variable(tf.random_normal(shape = [2]), name = 'w2')
b1 = tf.Variable(2.0, name = 'bias1')
feed_dict = {w1:[10,3], w2:[5,5]}

# define a test operation that will be restored
w3 = tf.add(w1, w2)  # without name, w3 will not be stored
w4 = tf.multiply(w3, b1, name = "op_to_restore")

#saver = tf.train.Saver()
saver = tf.train.Saver(max_to_keep = 4, keep_checkpoint_every_n_hours = 1)
sess = tf.Session()
sess.run(tf.global_variables_initializer())
print sess.run(w4, feed_dict)
#saver.save(sess, 'my_test_model', global_step = 100)
saver.save(sess, 'my_test_model')
#saver.save(sess, 'my_test_model', global_step = 100, write_meta_graph = False)

需要说明的有以下几点：

#1。创建saver时，可以指明存储的tensor，如果不指明，会全部存下来。在这里也可以指明最大存储数量和checkpoint的记录时间。

#max_to_keep = 4表示只存最新的4个模型文件。keep_checkpoint_every_n_hours = 1表示训练1小时保存一次模型

#saver = tf.train.Saver()

saver = tf.train.Saver(max_to_keep = 4, keep_checkpoint_every_n_hours = 1)

sess = tf.Session()

sess.run(tf.global_variables_initializer())

saver.save(sess, './my_test_model/model.ckpt')

#saver.save(sess, 'my_test_model', global_step = 100)

#saver . save ( sess , 'my-model' , global_step = step , write_meta_graph = False )

2. saver.save()函数里面可以设定global_step和write_meta_graph。

'./my_test_model/model.ckpt'表示文件保存的路径，该文件没有会自动生成，生成以model.ckpt后缀的文件格式

global_step = 100表示每迭代训练100次保存一次模型。

通过设置write_meta_graph = False，因为meta存储的是网络结构，会使得只在开始运行程序的时候存储一次神经网络，

省的重复保存同一个神经网络多次。

3. 这个程序执行结束后，会在程序目录下生成四个文件，分别是.meta(存储网络结构)、.data和.index(存储训练好的参数)、checkpoint(记录模型列表)。

====还有一种情况：但是一般基本不用==========

.data和.index(存储训练好的参数包括weight，biase,etc)。比如做二分类网络，但是以后可能会做10分类的网络。

那他们间的差异在于最后softmax层不同而已，为了以后方便使用，我们可以选择只保存softmax层前面的

weight和biase，也就是只保存一部分参数。softmax层的参数以后再训练。

但是上面的代码模型会保存所有的weight和biase，通常我们也都保存所有的训练参数。

import tensorflow as tf

w1 = tf . Variable ( tf . random_normal ( shape = [ 2 ] ) , name = 'w1' )

w2 = tf . Variable ( tf . random_normal ( shape = [ 5 ] ) , name = 'w2' )

saver = tf . train . Saver([w1,w2])#保存部分指定的参数

sess = tf . Session ( )

sess . run ( tf . global_variables_initializer ( ) )

saver . save ( sess , 'my_test_model' , global_step = 1000 )

加载已经保存的网络模型。

这里有两种方法，第一种是saver.restore(sess, 'aaaa.ckpt')，这种方法的本质是读取全部参数，并加载到已经定义好的网络结构上，因此相当于给网络的weights和biases赋值并执行tf.global_variables_initializer()。这种方法的缺点是使用前必须重写网络结构，而且网络结构要和保存的参数完全对上。

#-*- coding:utf-8 -*-
import tensorflow as tf

# prepare to feed input, i.e. feed_dict and placeholders
w1 = tf.Variable(tf.random_normal(shape = [2]), name = 'w1') # name is very important in restoration
w2 = tf.Variable(tf.random_normal(shape = [2]), name = 'w2')
b1 = tf.Variable(2.0, name = 'bias1')
feed_dict = {w1:[10,3], w2:[5,5]}

# define a test operation that will be restored
w3 = tf.add(w1, w2) # without name, w3 will not be stored
w4 = tf.multiply(w3, b1, name = "op_to_restore")

saver = tf.train.Saver()
sess = tf.Session()
sess.run(tf.global_variables_initializer())

saver.restore(sess, './my_test_model/model.ckpt')

print (sess.run(w4, feed_dict))

#只要把saver.save()改成saver.restore()函数即可，路径保持一样

第二种就比较高端了，直接把网络结构加载进来(.meta)，

#!/usr/bin/env python
#-*- coding:utf-8 -*-

import tensorflow as tf

sess = tf.Session()
new_saver = tf.train.import_meta_graph('my_test_model.meta')
new_saver.restore(sess, tf.train.latest_checkpoint('./'))
print sess.run('w1:0')

使用加载的模型，输入新数据，计算输出，

#!/usr/bin/env python
#-*- coding:utf-8 -*-

import tensorflow as tf

sess = tf.Session()

# First, load meta graph and restore weights
saver = tf.train.import_meta_graph('my_test_model.meta')
saver.restore(sess, tf.train.latest_checkpoint('./'))

# Second, access and create placeholders variables and create feed_dict to feed new data
graph = tf.get_default_graph()
w1 = graph.get_tensor_by_name('w1:0')
w2 = graph.get_tensor_by_name('w2:0')
feed_dict = {w1:[-1,1], w2:[4,6]}

# Access the op that want to run
op_to_restore = graph.get_tensor_by_name('op_to_restore:0')

print sess.run(op_to_restore, feed_dict)     # ouotput: [6. 14.]

在已经加载的网络后继续加入新的网络层：

import tensorflow as tf

sess=tf.Session()    
#First let's load meta graph and restore weights
saver = tf.train.import_meta_graph('my_test_model-1000.meta')
saver.restore(sess,tf.train.latest_checkpoint('./'))


# Now, let's access and create placeholders variables and
# create feed-dict to feed new data

graph = tf.get_default_graph()
w1 = graph.get_tensor_by_name("w1:0")
w2 = graph.get_tensor_by_name("w2:0")
feed_dict ={w1:13.0,w2:17.0}

#Now, access the op that you want to run. 
op_to_restore = graph.get_tensor_by_name("op_to_restore:0")

#Add more to the current graph
add_on_op = tf.multiply(op_to_restore,2)

print sess.run(add_on_op,feed_dict)
#This will print 120.

对加载的网络进行局部修改和处理(这个最麻烦，我还没搞太明白，后续会继续补充)。

训练部分参数，浅层不训练，深层训练。

前面不训练，最后一层softmax训练

在测试或者离线版的时候，我们不想初始化参数，我们测试时候只要测试，但参数不参与反馈调节等：

......
......
saver = tf.train.import_meta_graph('vgg.meta')
# Access the graph
graph = tf.get_default_graph()
## Prepare the feed_dict for feeding data for fine-tuning 

#Access the appropriate output for fine-tuning
fc7= graph.get_tensor_by_name('fc7:0')

#use this if you only want to change gradients of the last layer
fc7 = tf.stop_gradient(fc7) # It's an identity function
fc7_shape= fc7.get_shape().as_list()

new_outputs=2
weights = tf.Variable(tf.truncated_normal([fc7_shape[3], num_outputs], stddev=0.05))
biases = tf.Variable(tf.constant(0.05, shape=[num_outputs]))
output = tf.matmul(fc7, weights) + biases
pred = tf.nn.softmax(output)

# Now, you run this with fine-tuning data in sess.run()

有了这样的方法，无论是自行训练、加载模型继续训练、使用经典模型还是finetune经典模型抑或是加载网络跑前项，效果都是杠杠的。

参考链接

https://blog.csdn.net/lordofrobots/article/details/77719020

http://cv-tricks.com/tensorflow-tutorial/save-restore-tensorflow-models-quick-complete-tutorial/