snips示例tdnn训练报错

最新推荐文章于 2023-10-22 22:36:44 发布

一片橡树叶子的故事

最新推荐文章于 2023-10-22 22:36:44 发布

阅读量394

点赞数

分类专栏： Kaldi snips

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/hdm314/article/details/108337117

版权

Kaldi 同时被 2 个专栏收录

23 篇文章 1 订阅

订阅专栏

1 篇文章 0 订阅

订阅专栏

问题：

当训练tdnn时迭代到110次时报错

查看对应的log文件，显示

ERROR (nnet3-chain-train[5.5.0-]:AllocateNewRegion():cu-allocator.cc:519) Failed to allocate a memory region of 2502950912 bytes. Possibly this is due to sharing the GPU. Try switching the GPUs to exclusive mode (nvidia-smi -c 3) and using the option --use-gpu=wait to scripts like steps/nnet3/chain/train.py. Memory info: free:4773M, used:6244M, total:11018M, free/total:0.433275 CUDA error: 'out of memory'

解决办法：

修改GPU模式：

sudo nvidia-smi -c 3

修改run_e2e_tdnn.sh

然后重新运行脚本。

解决。

一片橡树叶子的故事

关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
1
评论
snips示例tdnn训练报错

ERROR (nnet3-chain-train[5.5.0-]:AllocateNewRegion():cu-allocator.cc:519) Failed to allocate a memory region of 2502950912 bytes. Possibly this is due to sharing the GPU. Try switching the GPUs to exclusive mode (nvidia-smi -c 3) and using the opti...
复制链接

扫一扫

专栏目录

评论 1

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。