NLP进阶，Bert+BiLSTM情感分析实战

PDD工程师

于 2024-06-24 15:30:44 发布

阅读量955

点赞数 30

文章标签：自然语言处理 bert 深度学习

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/m0_61408947/article/details/139927800

版权

self.output_size = output_size

self.n_layers = n_layers

self.hidden_dim = hidden_dim

self.bidirectional = bidirectional

#Bert ----------------重点，bert模型需要嵌入到自定义模型里面

self.bert=BertModel.from_pretrained(bertpath)

for param in self.bert.parameters():

param.requires_grad = True

LSTM layers

self.lstm = nn.LSTM(768, hidden_dim, n_layers, batch_first=True,bidirectional=bidirectional)

dropout layer

self.dropout = nn.Dropout(drop_prob)

linear and sigmoid layers

if bidirectional:

self.fc = nn.Linear(hidden_dim*2, output_size)

else:

self.fc = nn.Linear(hidden_dim, output_size)

#self.sig = nn.Sigmoid()

def forward(self, x, hidden):

batch_size = x.size(0)

#生成bert字向量

x=self.bert(x)[0] #bert 字向量

lstm_out

#x = x.float()

lstm_out, (hidden_last,cn_last) = self.lstm(x, hidden)

#print(lstm_out.shape) #[32,100,768]

#print(hidden_last.shape) #[4, 32, 384]

#print(cn_last.shape) #[4, 32, 384]

#修改双向的需要单独处理

if self.bidirectional:

#正向最后一层，最后一个时刻

hidden_last_L=hidden_last[-2]

#print(hidden_last_L.shape) #[32, 384]

#反向最后一层，最后一个时刻

hidden_last_R=hidden_last[-1]

#print(hidden_last_R.shape) #[32, 384]

#进行拼接

hidden_last_out=torch.cat([hidden_last_L,hidden_last_R],dim=-1)

#print(hidden_last_out.shape,‘hidden_last_out’) #[32, 768]

else:

hidden_last_out=hidden_last[-1] #[32, 384]

dropout and fully-connected layer

out = self.dropout(hidden_last_out)

#print(out.shape) #[32,768]

out = self.fc(out)

return out

def init_hidden(self, batch_size):

weight = next(self.parameters()).data

number = 1

if self.bidirectional:

number = 2

if (USE_CUDA):

hidden = (weight.new(self.n_layers*number, batch_size, self.hidden_dim).zero_().float().cuda(),

weight.new(self.n_layers*number, batch_size, self.hidden_dim).zero_().float().cuda()

)

else:

hidden = (weight.new(self.n_layers*number, batch_size, self.hidden_dim).zero_().float(),

weight.new(self.n_layers*number, batch_size, self.hidden_dim).zero_().float()

)

return hidden

bert_lstm需要的参数功6个，参数说明如下：

–bertpath：bert预训练模型的路径

–hidden_dim：隐藏层的数量。

–output_size：分类的个数。

–n_layers：lstm的层数

–bidirectional：是否是双向lstm

–drop_prob：dropout的参数

定义bert的参数，如下：

class ModelConfig:

batch_size = 2

output_size = 2

hidden_dim = 384 #768/2

n_layers = 2

lr = 2e-5

bidirectional = True #这里为True，为双向LSTM

training params

epochs = 10

batch_size=50

print_every = 10

clip=5 # gradient clipping

use_cuda = USE_CUDA

bert_path = ‘bert-base-chinese’ #预训练bert路径

save_path = ‘bert_bilstm.pth’ #模型保存路径

batch_size：batchsize的大小，根据显存设置。

output_size：输出的类别个数，本例是2.

hidden_dim：隐藏层的数量。

n_layers：lstm的层数。

bidirectional：是否双向

print_every：输出的间隔。

use_cuda：是否使用cuda，默认使用，不用cuda太慢了。

bert_path：预训练模型存放的文件夹。

save_path：模型保存的路径。

配置环境

===============================================================

需要下载transformers和sentencepiece，执行命令：

conda install sentencepiece

conda install transformers

数据集切分

================================================================

数据集按照7:3，切分为训练集和测试集，然后又将测试集按照1：1切分为验证集和测试集。

代码如下：

model_config = ModelConfig()

data=pd.read_csv(‘caipindianping.csv’,encoding=‘utf-8’)

result_comments = pretreatment(list(data[‘comment’].values))

tokenizer = BertTokenizer.from_pretrained(model_config.bert_path)

result_comments_id = tokenizer(result_comments,

padding=True,

truncation=True,

max_length=200,

return_tensors=‘pt’)

X = result_comments_id[‘input_ids’]

y = torch.from_numpy(data[‘sentiment’].values).float()

一、Python所有方向的学习路线

Python所有方向路线就是把Python常用的技术点做整理，形成各个领域的知识点汇总，它的用处就在于，你可以按照上面的知识点去找对应的学习资源，保证自己学得较为全面。

二、学习软件

工欲善其事必先利其器。学习Python常用的开发软件都在这里了，给大家节省了很多时间。

三、入门学习视频

我们在看视频学习的时候，不能光动眼动脑不动手，比较科学的学习方法是在理解之后运用它们，这时候练手项目就很适合了。

关注

30
点赞
踩
25

收藏

觉得还不错? 一键收藏
0
评论
NLP进阶，Bert+BiLSTM情感分析实战

Bert ----------------重点，bert模型需要嵌入到自定义模型里面。
复制链接

扫一扫

PDD工程师 CSDN认证博客专家 CSDN认证企业博客

码龄3年

637: 原创

1万+: 周排名

2667: 总排名

41万+: 访问

: 等级

1万+: 积分

6239: 粉丝

9076: 获赞

17: 评论

9174: 收藏

私信

关注

热门文章

分类专栏

最新评论

Python数据结构与算法（1
梦幻精灵_cq: 太妙了读完您的文章，也基本明白了我一直迷糊的“时间复杂度” 要是整好代码片容器，把python代码都收进代码片容器，页面会更完美
Python编程基础（快速入门必看）_创建字典{‘newton‘ 1642, ‘darwin‘ 1809, ‘turing‘
梦幻精灵_cq: 内容很不错，python字典讲得清楚明了，图文并茂。但可能由于代码片字符对没调配好，让部分文档跑进了代码片，大多代码又暴露在了文档区域。调整一下，原貌其实很不错
Python常用模块之 logging：日志模块_python中日志模块longging 难不难学
北风之神c: 总结的很全面，写得赞，博主用心了。此国产日志 https://nb-log-doc.readthedocs.io/zh_CN/latest 使用原生 loggng封装，兼容性和替换性100%,大幅简化logging的使用。 1、日志能根据级别能够自动变彩色。 2、print自动变彩色。 3、日志和print在pycahrm控制台的输出都自动可以点击跳转到文件和行号。 4、多进程日志切割安全，文件日志写入性能高。 5、入参简单，能一键自动记录到多种地方。 6、兼容 loguru模式。相比 loguru 有10胜。 pip install nb_log 。
同事摸鱼扫雷通关来炫耀~ 我用Python做出自动扫雷十秒通关
普通网友: 这篇文章是优质之作，内容充实，结构明晰，语言流畅且通俗易懂，适合广大读者阅读。【我也写了一些相关领域的文章，希望能够得到博主的指导，共同进步！】
同事摸鱼扫雷通关来炫耀~ 我用Python做出自动扫雷十秒通关
普通网友: 支持一下，细节很到位！【我也写了一些相关领域的文章，希望能够得到博主的指导，共同进步！】

您愿意向朋友推荐“博客详情页”吗？

强烈不推荐
不推荐
一般般
推荐
强烈推荐

提交

最新文章

2024

目录

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。