OpenAI GPT pytorch 实现微调 ROCStories 数据集

最新推荐文章于 2024-06-09 11:06:39 发布

DarrenXf

最新推荐文章于 2024-06-09 11:06:39 发布

阅读量1.4k

点赞数

分类专栏： pytorch NLP 深度学习人工智能 Deep Learning AI 文章标签： gpt pytorch nlp

本文链接：https://blog.csdn.net/DarrenXf/article/details/88695029

版权

人工智能同时被 3 个专栏收录

41 篇文章 0 订阅

订阅专栏

深度学习

26 篇文章 0 订阅

订阅专栏

Deep Learning

26 篇文章 0 订阅

订阅专栏

implement OpenAI gpt

papers

Gaussian Error Linear Units
translate to chinese

Attention Is All You Need
translate to chinese

Improving Language Understanding by Generative Pre-Training
translate to chinese

Language Models are Unsupervised Multitask Learners
translate to chinese

Dataset

use the dataset ROCStories

output

my device

31.4 GiB
Intel® Core™ i7-8700K CPU @ 3.70GHz × 12 
GeForce GTX 1080 Ti/PCIe/SSE2
64-bit

run three epochs

with 40 seconds an epoch for train and 23 seconds an epoch for eval,
so we need less than 3 minutes to get the results below.

show.py line:42 ***** Eval results *****
show.py line:44 eval_accuracy = 0.874933190807055
show.py line:44 eval_loss = 0.432198545569156
show.py line:44 train_loss = 2.201771383611565

run one epoch

with 40 seconds an epoch for train and 23 seconds an epoch for eval,
so we need about 1 minutes to get the results below.

show.py line:42 ***** Eval results *****
show.py line:44 eval_accuracy = 0.863174772848744
show.py line:44 eval_loss = 0.31887995107815814
show.py line:44 train_loss = 3.087455103540013

run 10 epochs

with 40 seconds an epoch for train and 23 seconds an epoch for eval,
so we need about 7 minutes to get the results below.

show.py line:42 ***** Eval results *****
show.py line:44 eval_accuracy = 0.8786745056119722
show.py line:44 eval_loss = 0.5693538990389142
show.py line:44 train_loss = 1.2477980831749418

run 30 epochs

with 40 seconds an epoch for train and 23 seconds an epoch for eval,
so we need about 21 minutes to get the results below.

show.py line:42 ***** Eval results *****
show.py line:44 eval_accuracy = 0.8727952966328166
show.py line:44 eval_loss = 0.6764590177271101
show.py line:44 train_loss = 0.23714334345780885

run directly

with 23 seconds an epoch for eval,
so we need about half a minutes to get the results below.

show.py line:42 ***** Eval results *****
show.py line:44 eval_accuracy = 0.5611972207375735
show.py line:44 eval_loss = 0.6895335352318919
show.py line:44 train_loss = 0.0

github

https://github.com/darr/gpt

DarrenXf

关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
OpenAI GPT pytorch 实现微调 ROCStories 数据集

implement OpenAI gptpapersGaussian Error Linear Unitstranslate to chineseAttention Is All You Needtranslate to chineseImproving Language Understanding by Generative Pre-Trainingtranslate to chi...
复制链接

扫一扫