BERT模型分析

最新推荐文章于 2023-09-25 09:58:08 发布

Dimension_

最新推荐文章于 2023-09-25 09:58:08 发布

阅读量457

点赞数

Bert Config中的几个参数

vocab_size: Vocabulary size of inputs_ids in BertModel. 词汇表大小
hidden_size: Size of the encoder layers and the pooler layer.
encoder层和pooler层大小。

（embedding size）
num_hidden_layers: Number of hidden layers in the Transformer
encoder.
每个attention层的head个数
List item

num_attention_heads: Number of attention heads for each attention layer in
the Transformer encoder.
intermediate_size: The size of the “intermediate” (i.e., feed-forward)
layer in the Transformer encoder.

hidden_act: The non-linear activation function (function or string) in the
encoder and pooler.
hidden_dropout_prob: The dropout probability for all fully connected
layers in the embeddings, encoder, and pooler.
attention_probs_dropout_prob: The dropout ratio for the attention
probabilities.
max_position_embeddings: The maximum sequence length that this model might
ever be used with. Typically set this to something large just in case
(e.g., 512 or 1024 or 2048).
type_vocab_size: The vocabulary size of the token_type_ids passed into
BertModel.
initializer_range: The stdev of the truncated_normal_initializer for
initializing all weight matrices.

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
BERT模型分析

Bert Config中的几个参数 - vocab_size: Vocabulary size of `inputs_ids` in `BertModel`. 词汇表大小 hidden_size: Size of the encoder layers and the pooler layer. encoder层和pooler层大小。（embedding size） num_hidden_layers: Number
复制链接

扫一扫

博客等级

码龄5年

28
原创

41
点赞

141
收藏

13
粉丝

关注

私信

热门文章

分类专栏

pat 1篇
保研复试 2篇
计算机网络 1篇
操作系统 1篇
acm 8篇
shuoj 4篇
水题 2篇
polya 1篇
数据结构
博弈论 1篇
c++学习
机试题 1篇
sql 1篇
py爬虫
py绘图 1篇
MySQL 2篇
小程序 1篇
Springboot 2篇
JAVA 1篇

最新评论

连接WiFi后，上的了qqvx，上不了浏览器的网页
CSDN-Ada助手: 非常感谢您分享这篇博客，这对于遇到类似问题的用户来说非常有用。我建议您可以继续深入探讨WiFi连接问题，特别是在连接过程中可能出现的各种错误和解决方案。您可以分享一些常见的WiFi连接问题，例如连接失败、连接速度慢、信号不稳定等，并提供相应的解决方法和技巧。这样的技术文章对其他用户也会非常有帮助。期待您的下一篇博客，相信会有更多读者受益。为了方便博主创作，提高生产力，CSDN上线了AI写作助手功能，就在创作编辑器右侧哦～（https://mp.csdn.net/edit?utm_source=blog_comment_recall ）诚邀您来加入测评，到此（https://activity.csdn.net/creatActivity?id=10450&utm_source=blog_comment_recall）发布测评文章即可获得「话题勋章」，同时还有机会拿定制奖牌。
Vue前端换行问题
不是谁只是我: 很有用，感谢，救了大命
Vue前端换行问题
T.Rayin: 最近也在无基础写前端。。赞个
Vue前端换行问题
xiaoan66_6: 太棒了，感谢
解决xx is not an annotation type
nishijaideba: 那该怎么办呢

您愿意向朋友推荐“博客详情页”吗？

强烈不推荐
不推荐
一般般
推荐
强烈推荐

提交

最新文章

目录

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。