T5模型在OCNLI 训练微调 4

CSPhD-winston-杨帆

已于 2024-07-25 19:39:33 修改

阅读量401

点赞数 4

分类专栏： LLMs实战文章标签： T5

于 2024-05-31 02:11:50 首次发布

本文链接：https://blog.csdn.net/WhiffeYF/article/details/139333854

版权

LLMs实战专栏收录该内容

4 篇文章 1 订阅

订阅专栏

资料

我的代码：https://github.com/Whiffe/Bert-OCNLI/tree/main/T5-OCNLI-yf

过去的内容：
Bert 在 OCNLI 训练微调
 Bert 在 OCNLI 训练微调 2
BERT系列模型在OCNLI 训练微调 3

模型下载与训练测试

mt5-base

conda install -c conda-forge sentencepiece

https://huggingface.co/google/mt5-base/tree/main
在这里插入图片描述

调用t5模型的过程：https://blog.csdn.net/znevegiveup1/article/details/121300828

训练测试结果：

train.50k.json、max_length=128、batch_size=32、dropout=0.1、lr=5e-5、epochs=10
准确率：70.13
train.50k.json、max_length=128、batch_size=32、dropout=0.3、lr=5e-5、epochs=10
准确率：70.03
train.50k.json、max_length=128、batch_size=32、dropout=0.2、lr=5e-5、epochs=10
准确率：67.43
train.50k.json、max_length=128、batch_size=32、dropout=0.1、lr=1e-5、epochs=10
准确率：58.9%
train.50k.json、max_length=128、batch_size=32、dropout=0.1、lr=5e-5、epochs=15
准确率：70.7%
train.50k.json、max_length=128、batch_size=16、dropout=0.1、lr=5e-5、epochs=15
准确率：37.63%
train.50k.json、max_length=128、batch_size=64、dropout=0.1、lr=5e-5、epochs=15
准确率：37.8%

t5-base

https://huggingface.co/google-t5/t5-base/tree/main

在这里插入图片描述
训练测试准确率

train.50k.json、max_length=128、batch_size=32、dropout=0.1、lr=5e-5、epochs=15
准确率：37.6%

nlp_mt5_zero-shot-augment_chinese-base

全任务零样本学习-mT5分类增强版-中文-base · 模型库 (modelscope.cn)：https://www.modelscope.cn/models/iic/nlp_mt5_zero-shot-augment_chinese-base/files

在这里插入图片描述
训练测试准确率：

train.50k.json、max_length=128、batch_size=32、dropout=0.1、lr=5e-5、epochs=15
准确率：71.6%

mt5-large

https://huggingface.co/google/mt5-large/tree/main
在这里插入图片描述
训练测试准确率：

在这里插入代码片

CSPhD-winston-杨帆

关注

4
点赞
踩
7

收藏

觉得还不错? 一键收藏
0
评论
T5模型在OCNLI 训练微调 4

https://huggingface.co/google-t5/t5-base/tree/mainconda install -c conda-forge sentencepiece
复制链接

扫一扫

专栏目录