Bert系列模型情感分析任务对比实验

最新推荐文章于 2024-08-17 03:15:00 发布

福将～白鹿

最新推荐文章于 2024-08-17 03:15:00 发布

阅读量1.5k

点赞数

分类专栏： nlp 情感分析文章标签： bert 自然语言处理深度学习 Finbert Roberta

本文链接：https://blog.csdn.net/qq_41475067/article/details/122580278

版权

nlp 同时被 2 个专栏收录

5 篇文章 1 订阅

订阅专栏

情感分析

3 篇文章 0 订阅

订阅专栏

实验介绍

实验数据信息

实验数据来源：github
实验任务：情感分析，二分类任务
训练集大小：9600
验证集大小：1200
测试集大小：1200
样本均衡情况：均衡
参与对比的Bert系列模型包括：Bert、Finbert、Roberta

实验数据选型

文本长度
最小长度：4
最大长度：1992
平均长度：108

Bert

具体参数如下
训练命令及参数

python run_classifier.py --task_name=emlo --do_train=true --do_eval=true --data_dir=./ChnSentiCorp_data --vocab_file=./uncased/chine
se_L-12_H-768_A-12/vocab.txt --bert_config_file=./uncased/chinese_L-12_H-768_A-12/bert_config.json --init_checkpoint=./uncased/chinese_L-12_H-768_A-12/bert_model.ckpt --max_seq_length=64 --train_batch_size=16 --learning_rate=2e-5 --num_train_epochs=3.0 --output_dir=./tmp/bert_out/

预测命令及参数

python run_classifier.py --task_name=emlo --do_predict=true --data_dir=./ChnSentiCorp_data --vocab_file=./uncased/chinese_L-12_H-768_A-12/vocab.txt --bert_config_file=./uncased/chinese_L-12_H-768_A-12/bert_config.json --init_checkpoint=./tmp/bert_out/ --max_seq_length=64  --output_dir=./tmp/bert_emotion/

Finbert

具体参数如下
训练命令及参数

python run_classifier.py --task_name=emlo --do_train=true --do_eval=true --data_dir=./ChnSentiCorp_data --vocab_file=./uncased/FinBERT_L-12_H-768_A-12_tf/vocab.txt --bert_config_file=./uncased/FinBERT_L-12_H-768_A-12_tf/bert_config.json --init_checkpoint=./uncased/FinBERT_L-12_H-768_A-12_tf/bert_model.ckpt --max_seq_length=64 --train_batch_size=16 --learning_rate=2e-5 --num_train_epochs=3.0 --output_dir=./tmp/finbert_out/

预测命令及参数

python run_classifier.py --task_name=emlo --do_predict=true --data_dir=./ChnSentiCorp_data --vocab_file=./uncased/FinBERT_L-12_H-768_A-12_tf/vocab.txt --bert_config_file=./uncased/FinBERT_L-12_H-768_A-12_tf/bert_config.json --init_checkpoint=./tmp/finbert_out/ --max_seq_length=64  --output_dir=./tmp/finbert_emotion/

Roberta

具体参数如下
训练命令及参数

python run_classifier.py --task_name=emlo --do_train=true --do_eval=true --data_dir=./ChnSentiCorp_data --vocab_file=./uncased/roberta_zh_l12/vocab.txt --bert_config_file=./uncased/roberta_zh_l12/bert_config.json --init_checkpoint=./uncased/roberta_zh_l12/bert_model.ckpt --max_seq_length=64 --train_batch_size=16 --learning_rate=2e-5 --num_train_epochs=3.0 --output_dir=./tmp/roberta_out/

预测命令及参数

python run_classifier.py --task_name=emlo --do_predict=true --data_dir=./ChnSentiCorp_data --vocab_file=./uncased/roberta_zh_l12/vocab.txt --bert_config_file=./uncased/roberta_zh_l12/bert_config.json --init_checkpoint=./tmp/roberta_out/ --max_seq_length=64  --output_dir=./tmp/roberta_emotion/