Be aware, overflowing tokens are not returned for the setting you have chosen

最新推荐文章于 2023-02-25 15:20:17 发布

James-J

最新推荐文章于 2023-02-25 15:20:17 发布

阅读量4.6k

点赞数 6

分类专栏： transformer 文章标签： transformer

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/Jamesjjjjj/article/details/124325833

版权

transformer 专栏收录该内容

1 篇文章 0 订阅

订阅专栏

原文提醒如下：

Be aware, overflowing tokens are not returned for the setting you have chosen, i.e. sequence pairs with the 'longest_first' truncation strategy. So the returned list will always be empty even if some tokens have been removed.

出现场景：

encode_tokens = tokenizer.encode_plus(
                            text="1 1 1 1 1", 
                            text_pair="2 2 2",
                            padding='max_length', 
                            max_length=10, 
                            truncation=True
                )

上面的两个文本段含有8个单词（5个1，3个2），加上1个CLS、2个SEP，一共切分出11个token，大于最大长度10，会被截断，所以提醒你。

解决方案：

1、你把上面的例子改成7个单词，不会截断，不会提醒。

2、其他博客提到加上下面一句话：

import transformers
transformers.logging.set_verbosity_error()

相当于改变报错级别。

mark一下，如有错误欢迎指正。

关注

6
点赞
踩
4

收藏

觉得还不错? 一键收藏
0
评论
Be aware, overflowing tokens are not returned for the setting you have chosen

原文提醒如下：Be aware, overflowing tokens are not returned for the setting you have chosen, i.e. sequence pairs with the 'longest_first' truncation strategy. So the returned list will always be empty even if some tokens have been removed.出现场景：encode_token
复制链接

扫一扫

专栏目录

James-J CSDN认证博客专家 CSDN认证企业博客

码龄7年

21: 原创

10万+: 周排名

84万+: 总排名

13万+: 访问

: 等级

1235: 积分

26: 粉丝

152: 获赞

26: 评论

295: 收藏

私信

关注

分类专栏

最新评论

Pytorch LSTM
余槿&流年: 您好可是我使用原本数据维度为[240,60,1]的股票价格数据，出来的数据还是[240,60,1] 我的网络代码如下： class LSTMNet(nn.Module): def __init__(self): super(LSTMNet, self).__init__() # 这里input_size是1是因为数据集为股票价格其中特征数量只有1个即为当前日期前60天的股票价格（就只有股票价格，所以是一个特征数量） self.lstm1 = nn.LSTM(input_size=1, hidden_size=80,dropout=0.2) # 这里输出[240,60,80] # 没太搞懂batch_first是啥意思 self.lstm2 = nn.LSTM(input_size=80, hidden_size=100,dropout=0.2) # 这里输出[240,60,100] self.linear1 = nn.Linear(100, 1) # 经过这里输出[240,60,1] def forward(self, x): print(x.shape) # 原始维度：[240,60,1] x, _ = self.lstm1(x) # 只保留最后一个时间步的输出，忽略hidden state print(x.shape) # 原始维度：[240,60,80] x, _ = self.lstm2(x) # 只保留最后一个时间步的输出，忽略hidden state print(x.shape) # 原始维度：[240,60,100] x = torch.squeeze(x, dim=0) # 去除维度为1的维度 print(x.shape) output = self.linear1(x) return output[:,-1,:]
Pytorch LSTM
James-J: 也可以用每个时间步的输出，如果是做股票预测的话是输入历史序列预测明天的未知结果，所以只需要取最后一个结果
Pytorch LSTM
嗳galaxy: 作者你好，我不太明白为什么forward函数总是返回最后一个时间步，预测出来的output不应该都是预测值吗，为什么只要每个batch的最后一个时间步？
简单理解LSTM
James-J: 输入维度的定义有些API可以自己选择，默认第一维度是数据条数，也就是5000，一般输入三维的形式，比如（数据条数，时间步长，一条数据的特征数量）
简单理解LSTM
AI兴趣爱好者: 5000,打错了

大家在看

最新文章

目录

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。