Tensorflow实现残差网络ResNet-50+官方源码

最新推荐文章于 2024-04-19 13:43:01 发布

qq_27882259

最新推荐文章于 2024-04-19 13:43:01 发布

阅读量3.2k

点赞数 2

本文链接：https://blog.csdn.net/qq_27882259/article/details/102723660

版权

使用Tensorflow实现残差网络ResNet-502018-01- https://github.com/KaimingHe/deep-residual-networks https://blog.csdn.net/liangyihuai/article/details/79140481 这篇文章讲解的是使用Tensorflow实现残差网络resnet-50. 侧重点不在于理论部分，而是在于代码实现部分。在github上面已经有其他的开源实现，如果希望直接使用代码运行自己的数据，不建议使用本人的代码。但是如果希望学习resnet的代码实现思路，那么阅读本文将是一个不错的选择，因为本文的代码的思路是很清晰的。如果你刚刚阅读完resnet的那篇论文，非常建议你进一步学习如何使用代码实现resnet。本文包含源码的数据集。resnet只是在CNN上面增加了shortcut，所以，resnet和CNN是很相似的。##1. model
下面将要实现的是resnet-50。下面是网络模型的整体模型图。其中的CONV表示卷积层，Batch Norm表示Batch 归一化层，ID BLOCK表示Identity块，由多个层构成，具体见第二个图。Conv BLOCK表示卷积块，由多个层构成。为了使得model个结构更加清晰，才提取出了conv block 和id block两个‘块’，分别把它们封装成函数。如果不了解batch norm，可以暂时滤过这部分的内容，可以把它看作是一个特殊的层，它不会改变数据的维度。这将不影响对resnet实现的理解。具体见第三个图。

上图表示Resnet-50的整体结构图
上图表示ID block
上图表示conv block##2. 数据
输入的是类似上图所示的手势图片数据，总共有6个类。所给的数据已经加工过，是‘.h5’格式的数据。有1080张图片，120张测试数据。每一张图片是一个64x64的RGB图片。具体的数据格式为：number of training examples = 1080
number of test examples = 120
X_train shape: (1080, 64, 64, 3)
Y_train shape: (1080, 6)
X_test shape: (120, 64, 64, 3)
Y_test shape: (120, 6)
x train max, 0.956; x train min, 0.015
x test max, 0.94; x test min, 0.011
123456783. 目标训练一个模型，使之能够判别图片中的手指所代表的数字。实质上这个是属于多分类问题。所以，模型的输入是一个64x64x3的图片；模型的输出层为6个节点，每一个节点表示一种分类。4. 模型实现identity block的实现，对于上图2。需要注意的是，X_shortcut一开始就保存了所传入的数据，然后在函数的末尾部分再加上X_shortcut。除了这一点，其他点跟CNN是一样的。 def identity_block(self, X_input, kernel_size, in_filter, out_filters, stage, block, training):
“”"
Implementation of the identity block as defined in Figure 3

    Arguments:
    X -- input tensor of shape (m, n_H_prev, n_W_prev, n_C_prev)
    kernel_size -- integer, specifying the shape of the middle CONV's window for the main path
    filters -- python list of integers, defining the number of filters in the CONV layers of the main path
    stage -- integer, used to name the layers, depending on their position in the network
    block -- string/character, used to name the layers, depending on their position in the network
    training -- train or test

    Returns:
    X -- output of the identity block, tensor of shape (n_H, n_W, n_C)
    """

    # defining name basis
    block_name = 'res' + str(stage) + block
    f1, f2, f3 = out_filters
    with tf.variable_scope(block_name):
        X_shortcut = X_input

        #first

最低0.47元/天解锁文章

qq_27882259

关注

2
点赞
踩
7

收藏

觉得还不错? 一键收藏
0
评论
Tensorflow实现残差网络ResNet-50+官方源码

使用Tensorflow实现残差网络ResNet-502018-01- https://github.com/KaimingHe/deep-residual-networks ...
复制链接

扫一扫