allennlp框架多gpu训练时`warning:UserWarning: RNN module weights are not part of single contiguous chunk of

风筝迷了向

于 2020-06-27 16:39:21 发布

阅读量1.3k

点赞数 1

分类专栏：自然语言处理文章标签：深度学习 pytorch 自然语言处理

本文链接：https://blog.csdn.net/nihao1621/article/details/106984580

版权

自然语言处理专栏收录该内容

2 篇文章

订阅专栏

问题1

allennlp框架多gpu训练时warning:UserWarning: RNN module weights are not part of single contiguous chunk of memory. This means they need to be compacted at every call, possibly greately increasing memory usage. To compact weights again call flatten_parameters()

解决方案1

1.普通pytorch模型处理方式，在forward函数中加上flatten_parameters()
def forward...
    if not hasattr(self, '_flattened'):
	    self.rnn.flatten_parameters()  #rnn为自定义的rnn模型
        setattr(self, '_flattened', True)
2. allennlp多gpu模型时处理方式，由于self.ner_encoder在allennlp框架中进行了封装，因此需要进行以下操作：
def forward...
    if not hasattr(self.ner_encoder._module, '_flattened'):
        self.ner_encoder._module.flatten_parameters()
    if not hasattr(self.ner_decoder._module, '_flattened'):
        self.ner_decoder._module.flatten_parameters()