InstructGPT：Training language models to follow instructions with human feedback

最新推荐文章于 2024-11-07 17:15:19 发布

YingJingh

最新推荐文章于 2024-11-07 17:15:19 发布

阅读量1k

点赞数

分类专栏：论文记录文章标签：语言模型人工智能自然语言处理

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/Hekena/article/details/131163801

版权

论文记录专栏收录该内容

147 篇文章 9 订阅

订阅专栏

Training language models to follow instructions with human feedback

通过人类反馈的微调，在广泛的任务中使语言模型与用户的意图保持一致
aligning language models with user intent on a wide range of tasks by fine-tuning
with human feedback

实验动机

language models to be helpful (they should help the user solve their task), honest (they shouldn’t fabricate information or mislead the user), and harmless (they should not cause physical, psychological, or social harm to people or the environment).

实验过程

我们首先聘请了一个由40名承包商组成的团队，根据他们在筛选测试中的表现，为我们的数据贴上标签（详见3.4节和附录B.1）。
We then collect a dataset of human-written demonstrations of the
desired output behavior on (mostly English) prompts submitted to
the OpenAI API3 and some labeler-written prompts
we collect a dataset of human-labeled comparisons between
outputs from our models on a larger set of API prompts.
We then train a reward model (RM) on this dataset to predict
which model output our labelers would prefer.
Finally, we use this RM as a reward function and fine-tune our supervised learning baseline to maximize this reward using the PPO algorithm (Schulman et al., 2017).

在这里插入图片描述

YingJingh CSDN认证博客专家 CSDN认证企业博客

码龄5年

345: 原创

2万+: 周排名

1万+: 总排名

27万+: 访问

: 等级

4163: 积分

2071: 粉丝

241: 获赞

49: 评论

712: 收藏

私信

关注

热门文章

分类专栏

最新评论

关系抽取：传统：UniRel: Unified Representation and Interaction for Joint Relational
snacksix: 你好，请问换成中文后效果如何
论文复现_1：Chinese NER Using Lattice LSTM
Fɪɴᴀʟ: YJ使用的词典可以分享一下吗
word中避免无引用源的方法
hx0520: 摸索了一下mac系统锁定域,按command+fn+f11
PDF相关的处理操作
haakaa: csdn这段确实好用
EMNLP-21-Enhanced Language Representation with Label Knowledge for Span Extraction-NER-融入label knowl
小阳不一样666666: 请问作者你复现成功了嘛？我按照论文设置超参数，但是对于ace2005效果只有0.84没有论文的0.86，这是我设置的情况：--task_type=ner --task_save_name=ner111 --data_dir=./data/ace2005 --data_name=ace2005 --model_name_or_path=D:/YangCode/data/bert-large-cased --model_name=SERS --output_dir=./outmodel --result_dir=./result --do_lower_case=False --first_label_file=./data/ace2005/processed/label_map.json --train_set=./data/ace2005/processed/train.json --dev_set=./data/ace2005/processed/dev.json --test_set=./data/ace2005/processed/test.json --label_str_file=./data/ace2005/processed/label_annotation.txt --overwrite_output_dir=True --exist_nested=True --do_train=True --is_chinese=False --val_step=20 --use_attn=True --seed=42 --max_seq_length=128 --dropout_rate=0.1 --learning_rate=3e-5 --task_layer_lr=2 --num_train_epochs=20能帮忙看看问题所在嘛？

大家在看

最新文章

目录

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

打赏作者

YingJingh 你的鼓励将是我创作的最大动力

¥1 ¥2 ¥4 ¥6 ¥10 ¥20

扫码支付：¥1

获取中

扫码支付

您的余额不足，请更换扫码支付或充值

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。