Transformer Track的pytorch版本

最新推荐文章于 2025-04-18 15:33:16 发布

原创最新推荐文章于 2025-04-18 15:33:16 发布 · 841 阅读

0 ·

CC 4.0 BY-SA版权

文章标签：

#python #深度学习 #transformer

本文介绍如何使用pip安装与CUDA10.2兼容的PyTorch 1.10.1、torchvision 0.11.2及torchaudio 0.10.1版本。通过指定URL确保正确安装适用于CUDA10.2的PyTorch相关包。

# CUDA 10.2
pip install torch==1.10.1+cu102 torchvision==0.11.2+cu102 torchaudio==0.10.1 -f https://download.pytorch.org/whl/cu102/torch_stable.html

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

彼岸花使

关注关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
分享

复制链接

分享到 QQ

分享到新浪微博

扫一扫
举报

举报

Pytorch详解-模型模块(RNN,CNN,FNN,LSTM,GRU,TCN,Transformer)

算法工程师

09-17

2187

Pytorch详解-模型模块(RNN,CNN,FNN,LSTM,GRU,TCN,Transformer)

PyTorch实战：音乐生成

AI天才研究院

02-19

538

音乐是一种独特的艺术形式，能够表达情感、讲述故事、抚慰心灵。随着人工智能技术的发展，使用深度学习模型自动生成音乐已经成为一个热门的研究方向。音乐生成不仅可以帮助音乐家和作曲家获得灵感，还能创造出全新的音乐风格和体验。PyTorch是一个流行的深度学习框架，提供了灵活的API和强大的GPU加速能力，非常适合用于音乐生成任务。本文将详细介绍如何使用PyTorch实现音乐生成，重点探讨几种经典的模型架构，如LSTM、Transformer和GAN。

参与评论您还未登录，请先登录后发表或查看评论

torch/transformers版本查看，transformers不同版本执行时，带来不同的bug

最新发布

weixin_47391305的博客

04-18

2355

大模型 Transformers 各依赖包对照

安装使用 pytorch 1.9.1, transformers 4.11.3

wangxiaosu的专栏

10-19

7955

以前跑实验用的pytorch和transformers的版本都比较低，最近的论文放出的代码使用的两个软件的版本都已经很高了，为了减少修代码的麻烦，决定升级这两个软件的版本。废了一番周折。 1、anaconda创建新的环境，安装上述两个包之前，先安装python，python不要安装当前的最高版本（估计最高版本还不被pytorch 1.9.1支持），我选择的是安装python 3.7.10 conda install python=3.7.10 2、安装pytorch 完成1后就可以使用pip安装，

深度学习学习——pytorch1.9 transformer

m0_37876745的博客

09-12

1357

参考文献： TRANSFORMERENCODERLAYER https://pytorch.org/docs/stable/generated/torch.nn.TransformerEncoderLayer.html pytorch多gpu并行训练 https://zhuanlan.zhihu.com/p/86441879 基于Transformer模型的智能选股策略https://bigquant.com/community/t/topic/199242 Transformer在量化投资的应用 http

torch对应版本torchvision和torchAudio

04-18

2万+

返回False，那么是无法使用gpu的，所以需要找到pytorch和cuda的对应关系，不能安装cpu版本，不然无法使用gpu的。

pytorch与python版本对应表_PyTorch 历史版本安装-祖传老代码运行刚需

weixin_39901571的博客

12-04

5392

【transfomers2.11.0】安装版本记录

Jack_Kuo的博客

04-13

3836

问题在安装transformers==2.11.0的时候总是出错，这里总结一个正确的版本匹配解决 python==3.6.10 # 注意3.6.0 会与安装的torch不兼容报错 pip install torch===1.4.0+cpu -f https://download.pytorch.org/whl/torch_stable.html # cpu版本的，其他的自行更改 pip install transformers==2.11.0 ...

tensorflow-gpu==2.6对应的 transformers 版本

wangmengmeng99的博客

04-11

655

tensorflow-gpu==2.6对应的 transformers 版本

查看torch是否可以调用GPU、CUDA版本、cuDNN版本、torch版本、transformers版本，更新pip版本

weixin_49899130的博客

03-07

1470

查看GPU是否可用、CUDA版本、cuDNN版本、torch版本、transformers版本

WGAN自动生成动漫头像PyTorch 代码

05-13

以下是使用WGAN生成动漫头像的PyTorch代码，其中使用了DCGAN的结构和WGAN的损失函数。首先需要导入必要的库： ```python import torch import torch.nn as nn import torch.optim as optim import torchvision.utils as vutils import torchvision.datasets as dset import torchvision.transforms as transforms from torch.utils.data import DataLoader from torch.autograd import Variable import numpy as np import matplotlib.pyplot as plt import matplotlib.animation as animation from IPython.display import HTML ``` 接下来定义一些超参数： ```python # Root directory for dataset dataroot = "./data" # Number of workers for dataloader workers = 2 # Batch size during training batch_size = 64 # Spatial size of training images. All images will be resized to this # size using a transformer. image_size = 64 # Number of channels in the training images. For color images this is 3 nc = 3 # Size of z latent vector (i.e. size of generator input) nz = 100 # Size of feature maps in generator ngf = 64 # Size of feature maps in discriminator ndf = 64 # Number of training epochs num_epochs = 5 # Learning rate for optimizers lr = 0.00005 # Beta1 hyperparam for Adam optimizers beta1 = 0.5 # Number of GPUs available. Use 0 for CPU mode. ngpu = 0 # Number of critic iterations per generator iteration n_critic = 5 # Clipping parameter for WGAN clip_value = 0.01 # Output directory for generated images output_dir = "./output" ``` 接下来定义数据加载器： ```python # Create the dataset dataset = dset.ImageFolder(root=dataroot, transform=transforms.Compose([ transforms.Resize(image_size), transforms.CenterCrop(image_size), transforms.ToTensor(), transforms.Normalize((0.5, 0.5, 0.5), (0.5, 0.5, 0.5)), ])) # Create the dataloader dataloader = torch.utils.data.DataLoader(dataset, batch_size=batch_size, shuffle=True, num_workers=workers) ``` 接下来定义生成器和判别器的结构： ```python # Generator Code class Generator(nn.Module): def __init__(self, ngpu): super(Generator, self).__init__() self.ngpu = ngpu self.main = nn.Sequential( # input is Z, going into a convolution nn.ConvTranspose2d(nz, ngf * 8, 4, 1, 0, bias=False), nn.BatchNorm2d(ngf * 8), nn.ReLU(True), # state size. (ngf*8) x 4 x 4 nn.ConvTranspose2d(ngf * 8, ngf * 4, 4, 2, 1, bias=False), nn.BatchNorm2d(ngf * 4), nn.ReLU(True), # state size. (ngf*4) x 8 x 8 nn.ConvTranspose2d(ngf * 4, ngf * 2, 4, 2, 1, bias=False), nn.BatchNorm2d(ngf * 2), nn.ReLU(True), # state size. (ngf*2) x 16 x 16 nn.ConvTranspose2d(ngf * 2, ngf, 4, 2, 1, bias=False), nn.BatchNorm2d(ngf), nn.ReLU(True), # state size. (ngf) x 32 x 32 nn.ConvTranspose2d(ngf, nc, 4, 2, 1, bias=False), nn.Tanh() # state size. (nc) x 64 x 64 ) def forward(self, input): return self.main(input) # Discriminator Code class Discriminator(nn.Module): def __init__(self, ngpu): super(Discriminator, self).__init__() self.ngpu = ngpu self.main = nn.Sequential( # input is (nc) x 64 x 64 nn.Conv2d(nc, ndf, 4, 2, 1, bias=False), nn.LeakyReLU(0.2, inplace=True), # state size. (ndf) x 32 x 32 nn.Conv2d(ndf, ndf * 2, 4, 2, 1, bias=False), nn.BatchNorm2d(ndf * 2), nn.LeakyReLU(0.2, inplace=True), # state size. (ndf*2) x 16 x 16 nn.Conv2d(ndf * 2, ndf * 4, 4, 2, 1, bias=False), nn.BatchNorm2d(ndf * 4), nn.LeakyReLU(0.2, inplace=True), # state size. (ndf*4) x 8 x 8 nn.Conv2d(ndf * 4, ndf * 8, 4, 2, 1, bias=False), nn.BatchNorm2d(ndf * 8), nn.LeakyReLU(0.2, inplace=True), # state size. (ndf*8) x 4 x 4 nn.Conv2d(ndf * 8, 1, 4, 1, 0, bias=False), ) def forward(self, input): return self.main(input).view(-1, 1).squeeze(1) ``` 接下来定义初始化生成器和判别器： ```python # Initialize generator and discriminator netG = Generator(ngpu).cuda() netD = Discriminator(ngpu).cuda() ``` 接下来定义优化器和损失函数： ```python # Initialize optimizer optimizerD = optim.RMSprop(netD.parameters(), lr=lr) optimizerG = optim.RMSprop(netG.parameters(), lr=lr) # Initialize loss functions criterion = nn.BCEWithLogitsLoss() ``` 接下来定义训练过程： ```python # Training Loop # Lists to keep track of progress img_list = [] G_losses = [] D_losses = [] iters = 0 print("Starting Training Loop...") # For each epoch for epoch in range(num_epochs): # For each batch in the dataloader for i, data in enumerate(dataloader, 0): ############################ # (1) Update D network ########################### for n in range(n_critic): # Initialize gradients netD.zero_grad() # Format batch real_cpu = data[0].cuda() b_size = real_cpu.size(0) label = torch.full((b_size,), 1, device=torch.device('cuda')) # Forward pass real batch through D output = netD(real_cpu).view(-1) # Calculate loss on real batch D_loss_real = -output.mean() # Calculate gradients for D in backward pass D_loss_real.backward() # Sample noise as input for G noise = torch.randn(b_size, nz, 1, 1, device=torch.device('cuda')) # Generate fake image batch with G fake = netG(noise) # Classify fake batch with D output = netD(fake.detach()).view(-1) # Calculate loss on fake batch D_loss_fake = output.mean() # Calculate gradients for D in backward pass D_loss_fake.backward() # Compute gradient penalty alpha = torch.rand(b_size, 1, 1, 1).cuda() x_hat = (alpha * real_cpu.data + (1 - alpha) * fake.data).requires_grad_(True) out = netD(x_hat).view(-1) grad = torch.autograd.grad(outputs=out, inputs=x_hat, grad_outputs=torch.ones(out.size()).cuda(), create_graph=True, retain_graph=True, only_inputs=True)[0] gp = ((grad.norm(2, dim=1) - 1) ** 2).mean() * 10 gp.backward() # Add the gradients from the all critic iterations D_loss = D_loss_real + D_loss_fake + gp Wasserstein_D = D_loss_real - D_loss_fake # Update D optimizerD.step() # Clip weights of D for p in netD.parameters(): p.data.clamp_(-clip_value, clip_value) ############################ # (2) Update G network ########################### netG.zero_grad() # Generate a batch of images noise = torch.randn(b_size, nz, 1, 1, device=torch.device('cuda')) fake = netG(noise) # Classify the generated batch with D output = netD(fake).view(-1) # Calculate G's loss based on this output G_loss = -output.mean() # Update G G_loss.backward() optimizerG.step() # Output training stats if i % 50 == 0: print('[%d/%d][%d/%d]\tLoss_D: %.4f\tLoss_G: %.4f\tWasserstein_D: %.4f' % (epoch, num_epochs, i, len(dataloader), D_loss.item(), G_loss.item(), Wasserstein_D.item())) # Save Losses for plotting later G_losses.append(G_loss.item()) D_losses.append(D_loss.item()) # Check how the generator is doing by saving G's output on fixed noise if (iters % 500 == 0) or ((epoch == num_epochs-1) and (i == len(dataloader)-1)): with torch.no_grad(): fake = netG(fixed_noise).detach().cpu() img_list.append(vutils.make_grid(fake, padding=2, normalize=True)) iters += 1 ``` 接下来定义输出结果： ```python # Output generated images fig = plt.figure(figsize=(8, 8)) plt.axis("off") ims = [[plt.imshow(np.transpose(i, (1, 2, 0)), animated=True)] for i in img_list] ani = animation.ArtistAnimation(fig, ims, interval=1000, repeat_delay=1000, blit=True) HTML(ani.to_jshtml()) # Save generated images as GIF file fig = plt.figure(figsize=(8, 8)) plt.axis("off") ims = [[plt.imshow(np.transpose(i, (1, 2, 0)), animated=True)] for i in img_list] ani = animation.ArtistAnimation(fig, ims, interval=1000, repeat_delay=1000, blit=True) ani.save(output_dir + "/anime.gif", writer='pillow', fps=2) ```