【飞桨】【PaddlePaddle】【PaddleGAN】【飞桨GAN打卡营】学习笔记-将第一节课实践的DCGAN代码改成lsgan的损失函数

最新推荐文章于 2023-04-04 10:07:55 发布

DewNose

最新推荐文章于 2023-04-04 10:07:55 发布

阅读量842

点赞数 3

分类专栏：笔记文章标签：深度学习神经网络机器学习

本文链接：https://blog.csdn.net/only_ctrl/article/details/115800846

版权

笔记专栏收录该内容

3 篇文章 0 订阅

订阅专栏

将第一节课实践的DCGAN代码改成lsgan的损失函数

先放上课程链接，大家可以一起学习
https://aistudio.baidu.com/aistudio/course/introduce/16651

#导入一些必要的包
import os
import random
import paddle 
import paddle.nn as nn
import paddle.optimizer as optim
import paddle.vision.datasets as dset
import paddle.vision.transforms as transforms
import numpy as np
import matplotlib.pyplot as plt
import matplotlib.animation as animation

构建数据集及数据读取器

dataset = paddle.vision.datasets.MNIST(mode='train', 
                                        transform=transforms.Compose([
                                        # resize ->(32,32)
                                        transforms.Resize((32,32)),
                                        # 归一化到-1~1
                                        transforms.Normalize([127.5], [127.5])
                                    ]))

dataloader = paddle.io.DataLoader(dataset, batch_size=32,
                                  shuffle=True, num_workers=4)

Cache file /home/aistudio/.cache/paddle/dataset/mnist/train-images-idx3-ubyte.gz not found, downloading https://dataset.bj.bcebos.com/mnist/train-images-idx3-ubyte.gz 
Begin to download

Download finished
Cache file /home/aistudio/.cache/paddle/dataset/mnist/train-labels-idx1-ubyte.gz not found, downloading https://dataset.bj.bcebos.com/mnist/train-labels-idx1-ubyte.gz 
Begin to download
........
Download finished

#参数初始化的模块
@paddle.no_grad()
def normal_(x, mean=0., std=1.):
    temp_value = paddle.normal(mean, std, shape=x.shape)
    x.set_value(temp_value)
    return x

@paddle.no_grad()
def uniform_(x, a=-1., b=1.):
    temp_value = paddle.uniform(min=a, max=b, shape=x.shape)
    x.set_value(temp_value)
    return x

@paddle.no_grad()
def constant_(x, value):
    temp_value = paddle.full(x.shape, value, x.dtype)
    x.set_value(temp_value)
    return x

def weights_init(m):
    classname = m.__class__.__name__
    if hasattr(m, 'weight') and classname.find('Conv') != -1:
        normal_(m.weight, 0.0, 0.02)
    elif classname.find('BatchNorm') != -1:
        normal_(m.weight, 1.0, 0.02)
        constant_(m.bias, 0)

# Generator Code
class Generator(nn.Layer):
    def __init__(self, ):
        super(Generator, self).__init__()
        self.gen = nn.Sequential(
            # input is Z, [B, 100, 1, 1] -> [B, 64 * 4, 4, 4]
            nn.Conv2DTranspose(100, 64 * 4, 4, 1, 0, bias_attr=False),
            nn.BatchNorm2D(64 * 4),
            nn.ReLU(True),
            # state size. [B, 64 * 4, 4, 4] -> [B, 64 * 2, 8, 8]
            nn.Conv2DTranspose(64 * 4, 64 * 2, 4, 2, 1, bias_attr=False),
            nn.BatchNorm2D(64 * 2),
            nn.ReLU(True),
            # state size. [B, 64 * 2, 8, 8] -> [B, 64, 16, 16]
            nn.Conv2DTranspose( 64 * 2, 64, 4, 2, 1, bias_attr=False),
            nn.BatchNorm2D(64),
            nn.ReLU(True),
            # state size. [B, 64, 16, 16] -> [B, 1, 32, 32]
            nn.Conv2DTranspose( 64, 1, 4, 2, 1, bias_attr=False),
            nn.Tanh()
        )

    def forward(self, x):
        return self.gen(x)


netG = Generator()
# Apply the weights_init function to randomly initialize all weights
#  to mean=0, stdev=0.2.
netG.apply(weights_init)

# Print the model
# 打印一下网络，也可以看我后边用summary打印的结构
print(netG)

Generator(
  (gen): Sequential(
    (0): Conv2DTranspose(100, 256, kernel_size=[4, 4], data_format=NCHW)
    (1): BatchNorm2D(num_features=256, momentum=0.9, epsilon=1e-05)
    (2): ReLU(name=True)
    (3): Conv2DTranspose(256, 128, kernel_size=[4, 4], stride=[2, 2], padding=1, data_format=NCHW)
    (4): BatchNorm2D(num_features=128, momentum=0.9, epsilon=1e-05)
    (5): ReLU(name=True)
    (6): Conv2DTranspose(128, 64, kernel_size=[4, 4], stride=[2, 2], padding=1, data_format=NCHW)
    (7): BatchNorm2D(num_features=64, momentum=0.9, epsilon=1e-05)
    (8): ReLU(name=True)
    (9): Conv2DTranspose(64, 1, kernel_size=[4, 4], stride=[2, 2], padding=1, data_format=NCHW)
    (10): Tanh()
  )
)

paddle.summary(netG,(1, 100, 1, 1))

-----------------------------------------------------------------------------
  Layer (type)        Input Shape          Output Shape         Param #    
=============================================================================
Conv2DTranspose-1   [[1, 100, 1, 1]]      [1, 256, 4, 4]        409,600    
  BatchNorm2D-1     [[1, 256, 4, 4]]      [1, 256, 4, 4]         1,024     
     ReLU-1         [[1, 256, 4, 4]]      [1, 256, 4, 4]           0       
Conv2DTranspose-2   [[1, 256, 4, 4]]      [1, 128, 8, 8]        524,288    
  BatchNorm2D-2     [[1, 128, 8, 8]]      [1, 128, 8, 8]          512      
     ReLU-2         [[1, 128, 8, 8]]      [1, 128, 8, 8]           0       
Conv2DTranspose-3   [[1, 128, 8, 8]]     [1, 64, 16, 16]        131,072    
  BatchNorm2D-3    [[1, 64, 16, 16]]     [1, 64, 16, 16]          256      
     ReLU-3        [[1, 64, 16, 16]]     [1, 64, 16, 16]           0       
Conv2DTranspose-4  [[1, 64, 16, 16]]      [1, 1, 32, 32]         1,024     
     Tanh-1         [[1, 1, 32, 32]]      [1, 1, 32, 32]           0       
=============================================================================
Total params: 1,067,776
Trainable params: 1,065,984
Non-trainable params: 1,792
-----------------------------------------------------------------------------
Input size (MB): 0.00
Forward/backward pass size (MB): 0.67
Params size (MB): 4.07
Estimated Total Size (MB): 4.75
-----------------------------------------------------------------------------






{'total_params': 1067776, 'trainable_params': 1065984}

class Discriminator(nn.Layer):
    def __init__(self,):
        super(Discriminator, self).__init__()
        self.dis = nn.Sequential(

            # input [B, 1, 32, 32] -> [B, 64, 16, 16]
            nn.Conv2D(1, 64, 4, 2, 1, bias_attr=False),
            nn.LeakyReLU(0.2),

            # state size. [B, 64, 16, 16] -> [B, 128, 8, 8]
            nn.Conv2D(64, 64 * 2, 4, 2, 1, bias_attr=False),
            nn.BatchNorm2D(64 * 2),
            nn.LeakyReLU(0.2),

            # state size. [B, 128, 8, 8] -> [B, 256, 4, 4]
            nn.Conv2D(64 * 2, 64 * 4, 4, 2, 1, bias_attr=False),
            nn.BatchNorm2D(64 * 4),
            nn.LeakyReLU(0.2),

            # state size. [B, 256, 4, 4] -> [B, 1, 1, 1]
            nn.Conv2D(64 * 4, 1, 4, 1, 0, bias_attr=False),
            # 这里为需要改变的地方
            # nn.Sigmoid() 更改为LeakyReLU激活函数，
            # 参数为x<0时的斜率
            nn.LeakyReLU(0.02)
        )

    def forward(self, x):
        return self.dis(x)

netD = Discriminator()
netD.apply(weights_init)
print(netD)

Discriminator(
  (dis): Sequential(
    (0): Conv2D(1, 64, kernel_size=[4, 4], stride=[2, 2], padding=1, data_format=NCHW)
    (1): LeakyReLU(negative_slope=0.2)
    (2): Conv2D(64, 128, kernel_size=[4, 4], stride=[2, 2], padding=1, data_format=NCHW)
    (3): BatchNorm2D(num_features=128, momentum=0.9, epsilon=1e-05)
    (4): LeakyReLU(negative_slope=0.2)
    (5): Conv2D(128, 256, kernel_size=[4, 4], stride=[2, 2], padding=1, data_format=NCHW)
    (6): BatchNorm2D(num_features=256, momentum=0.9, epsilon=1e-05)
    (7): LeakyReLU(negative_slope=0.2)
    (8): Conv2D(256, 1, kernel_size=[4, 4], data_format=NCHW)
    (9): LeakyReLU(negative_slope=0.02)
  )
)

paddle.summary(netD,(1, 1, 32, 32))

---------------------------------------------------------------------------
 Layer (type)       Input Shape          Output Shape         Param #    
===========================================================================
   Conv2D-9       [[1, 1, 32, 32]]     [1, 64, 16, 16]         1,024     
  LeakyReLU-8    [[1, 64, 16, 16]]     [1, 64, 16, 16]           0       
   Conv2D-10     [[1, 64, 16, 16]]      [1, 128, 8, 8]        131,072    
 BatchNorm2D-8    [[1, 128, 8, 8]]      [1, 128, 8, 8]          512      
  LeakyReLU-9     [[1, 128, 8, 8]]      [1, 128, 8, 8]           0       
   Conv2D-11      [[1, 128, 8, 8]]      [1, 256, 4, 4]        524,288    
 BatchNorm2D-9    [[1, 256, 4, 4]]      [1, 256, 4, 4]         1,024     
 LeakyReLU-10     [[1, 256, 4, 4]]      [1, 256, 4, 4]           0       
   Conv2D-12      [[1, 256, 4, 4]]       [1, 1, 1, 1]          4,096     
 LeakyReLU-11      [[1, 1, 1, 1]]        [1, 1, 1, 1]            0       
===========================================================================
Total params: 662,016
Trainable params: 660,480
Non-trainable params: 1,536
---------------------------------------------------------------------------
Input size (MB): 0.00
Forward/backward pass size (MB): 0.53
Params size (MB): 2.53
Estimated Total Size (MB): 3.06
---------------------------------------------------------------------------






{'total_params': 662016, 'trainable_params': 660480}

在这里插入图片描述

# Initialize BCELoss function
# 这里为需要改变的地方
# loss = nn.BCELoss() 替换为最小二乘损失函数
# 这里也可以用数学公式自己利用底层api手打，我放在下边的注释里
loss = nn.MSELoss()
"""
import paddle.fluid as fluid
class Loss(fluid.dygraph.Layer):
    def __init__(self, label):
        super(Loss, self).__init__()
        self.label = label
    def forward(self, x):
        if self.label==0:
            loss = x
        else:
            loss = x-paddle.full(x.shape, self.label, dtype='float32')
        loss = paddle.pow(loss,2)
        loss = paddle.mean(loss)
        loss = loss*0.5
        return loss
       """

# Create batch of latent vectors that we will use to visualize
#  the progression of the generator
fixed_noise = paddle.randn([32, 100, 1, 1], dtype='float32')

# Establish convention for real and fake labels during training
real_label = 1.
fake_label = 0.

# Setup Adam optimizers for both G and D
optimizerD = optim.Adam(parameters=netD.parameters(), learning_rate=0.0002, beta1=0.5, beta2=0.999)
optimizerG = optim.Adam(parameters=netG.parameters(), learning_rate=0.0002, beta1=0.5, beta2=0.999)

losses = [[], []]
#plt.ion()
now = 0
for pass_id in range(100):
    for batch_id, (data, target) in enumerate(dataloader):
        ############################
        # (1) Update D network: maximize log(D(x)) + log(1 - D(G(z)))
        ###########################

        optimizerD.clear_grad()
        real_img = data
        bs_size = real_img.shape[0]
        label = paddle.full((bs_size, 1, 1, 1), real_label, dtype='float32')
        real_out = netD(real_img)
        errD_real = loss(real_out, label)
        errD_real.backward()

        noise = paddle.randn([bs_size, 100, 1, 1], 'float32')
        fake_img = netG(noise)
        label = paddle.full((bs_size, 1, 1, 1), fake_label, dtype='float32')
        fake_out = netD(fake_img.detach())
        errD_fake = loss(fake_out,label)
        errD_fake.backward()
        optimizerD.step()
        optimizerD.clear_grad()

        errD = errD_real + errD_fake
        losses[0].append(errD.numpy()[0])

        ############################
        # (2) Update G network: maximize log(D(G(z)))
        ###########################
        optimizerG.clear_grad()
        noise = paddle.randn([bs_size, 100, 1, 1],'float32')
        fake = netG(noise)
        label = paddle.full((bs_size, 1, 1, 1), real_label, dtype=np.float32,)
        output = netD(fake)
        errG = loss(output,label)
        errG.backward()
        optimizerG.step()
        optimizerG.clear_grad()

        losses[1].append(errG.numpy()[0])


        ############################
        # visualize
        ###########################
        if batch_id % 100 == 0:
            generated_image = netG(noise).numpy()
            imgs = []
            plt.figure(figsize=(15,15))
            try:
                for i in range(10):
                    image = generated_image[i].transpose()
                    image = np.where(image > 0, image, 0)
                    image = image.transpose((1,0,2))
                    plt.subplot(10, 10, i + 1)
                    
                    plt.imshow(image[...,0], vmin=-1, vmax=1)
                    plt.axis('off')
                    plt.xticks([])
                    plt.yticks([])
                    plt.subplots_adjust(wspace=0.1, hspace=0.1)
                msg = 'Epoch ID={0} Batch ID={1} \n\n D-Loss={2} G-Loss={3}'.format(pass_id, batch_id, errD.numpy()[0], errG.numpy()[0])
                print(msg)
                plt.suptitle(msg,fontsize=20)
                plt.draw()
                plt.savefig('{}/{:04d}_{:04d}.png'.format('work', pass_id, batch_id), bbox_inches='tight')
                plt.pause(0.01)
            except IOError:
                print(IOError)
    paddle.save(netG.state_dict(), "work/generator.params")

DewNose

关注

3
点赞
踩
4

收藏

觉得还不错? 一键收藏
1
评论
【飞桨】【PaddlePaddle】【PaddleGAN】【飞桨GAN打卡营】学习笔记-将第一节课实践的DCGAN代码改成lsgan的损失函数

将第一节课实践的DCGAN代码改成lsgan的损失函数先放上课程链接，大家可以一起学习https://aistudio.baidu.com/aistudio/course/introduce/16651#导入一些必要的包import osimport randomimport paddle import paddle.nn as nnimport paddle.optimizer as optimimport paddle.vision.datasets as dsetimport pa
复制链接

扫一扫