百度ERNIE系列预训练语言模型浅析(3)-ERNIE3.0

Sophie'sCookingLab

已于 2024-05-29 18:09:42 修改

阅读量2.9k

点赞数 39

分类专栏： NLP 文章标签：百度语言模型人工智能

于 2024-05-29 17:14:36 首次发布

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/weixin_40566713/article/details/139301080

版权

ERNIE 3.0: LARGE-SCALE KNOWLEDGE ENHANCED PRE-TRAINING FOR LANGUAGE UNDERSTANDING AND GENERATION
Sun Y, Wang S, Feng S, et al. ERNIE 3.0: Large-scale Knowledge Enhanced Pre-training for Language Understanding and Generation[J]. arXiv preprint arXiv:2107.02137, 2021.

关键词：百亿参数大模型 \Transformer-XL\Knowledge graph
预训练中加入知识图谱三元组，模型基本单元从2.0的transformer换成transformer-XL

百度文心可以体验模型效果：https://wenxin.baidu.com/wenxin/ernie

1、ERNIE 3.0基本特点

(1)参数规模:10 billion
(2)引入知识图谱

large-scale knowledge enhanced models ：4TB corpus consisting of plain texts and a large-scale knowledge graph

(3) fuses auto-regressive network and auto-encoding network

handle both natural language understanding and generation tasks with zero-shot learning, few-shot learning or fine-tuning.

(4)模型性能

outperforms the state-of-the-art models on 54 Chinese NLP tasks
English version achieves the first place on the SuperGLUE benchmark (July 3, 2021), surpassing the human performance by +0.8% (90.6% vs. 89.8%)

2、 ERNIE 3.0 framework

Continual Multi-Paradigms Unified Pre-training Framework
(1)Universal representation Module:通用语义表示层一旦预训练完成，就不再更新（即便在fine-tune时也不再更新）
(

最低0.47元/天解锁文章

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。