百度ERNIE系列预训练语言模型浅析(3)-ERNIE3.0

ERNIE 3.0: LARGE-SCALE KNOWLEDGE ENHANCED PRE-TRAINING FOR LANGUAGE UNDERSTANDING AND GENERATION
Sun Y, Wang S, Feng S, et al. ERNIE 3.0: Large-scale Knowledge Enhanced Pre-training for Language Understanding and Generation[J]. arXiv preprint arXiv:2107.02137, 2021.

关键词:百亿参数大模型 \Transformer-XL\Knowledge graph
预训练中加入知识图谱三元组,模型基本单元从2.0的transformer换成transformer-XL

百度文心可以体验模型效果:https://wenxin.baidu.com/wenxin/ernie

1、ERNIE 3.0基本特点

(1)参数规模:10 billion
(2)引入知识图谱

  • large-scale knowledge enhanced models :4TB corpus consisting of plain texts and a large-scale knowledge graph

(3) fuses auto-regressive network and auto-encoding network

  • handle both natural language understanding and generation tasks with zero-shot learning, few-shot learning or fine-tuning.

(4)模型性能

  • outperforms the state-of-the-art models on 54 Chinese NLP tasks
  • English version achieves the first place on the SuperGLUE benchmark (July 3, 2021), surpassing the human performance by +0.8% (90.6% vs. 89.8%)

2、 ERNIE 3.0 framework

Continual Multi-Paradigms Unified Pre-training Framework
(1)Universal representation Module:通用语义表示层一旦预训练完成,就不再更新(即便在fine-tune时也不再更新)
(

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值