Python/Pytorch debugging

记录我的pytorch debug之路,持续更新。如果您google到了此文章,请使用Ctrl+F查找您的关键字。

1. OMP Error#15

OMP: Error #15: Initializing libiomp5md.dll, but found libiomp5md.dll already initialized.
OMP: Hint This means that multiple copies of the OpenMP runtime have been linked into the program. That is dangerous, since it can degrade performance or cause incorrect results. The best thing to do is to ensure that only a single OpenMP runtime is linked into the process, e.g. by avoiding static linking of the OpenMP runtime in any library. As an unsafe, unsupported, undocumented workaround you can set the environment variable KMP_DUPLICATE_LIB_OK=TRUE to allow the program to continue to execute, but that may cause crashes or silently produce incorrect results. For more information, please see http://www.intel.com/software/products/support/.

解决办法

        在文件开头插入这两句,

import os
os.environ['KMP_DUPLICATE_LIB_OK']='True'

原因

        不知道

参考

Training spaCy project file in Anaconda - OMP: Error #15: Initializing libiomp5md.dll, but found libiomp5md.dll already initialized. · Issue #7664 · explosion/spaCy (github.com)

File "/home/zhxk/.local/bin/yolo", line 8, in <module> sys.exit(entrypoint()) File "/home/zhxk/.local/lib/python3.8/site-packages/ultralytics/yolo/cfg/__init__.py", line 249, in entrypoint getattr(model, mode)(verbose=True, **overrides) File "/home/zhxk/.local/lib/python3.8/site-packages/ultralytics/yolo/engine/model.py", line 207, in train self.trainer.train() File "/home/zhxk/.local/lib/python3.8/site-packages/ultralytics/yolo/engine/trainer.py", line 183, in train self._do_train(int(os.getenv("RANK", -1)), world_size) File "/home/zhxk/.local/lib/python3.8/site-packages/ultralytics/yolo/engine/trainer.py", line 302, in _do_train self.loss, self.loss_items = self.criterion(preds, batch) File "/home/zhxk/.local/lib/python3.8/site-packages/ultralytics/yolo/v8/detect/train.py", line 76, in criterion return self.compute_loss(preds, batch) File "/home/zhxk/.local/lib/python3.8/site-packages/ultralytics/yolo/v8/detect/train.py", line 174, in __call__ _, target_bboxes, target_scores, fg_mask, _ = self.assigner( File "/home/zhxk/.local/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1051, in _call_impl return forward_call(*input, **kwargs) File "/home/zhxk/.local/lib/python3.8/site-packages/torch/autograd/grad_mode.py", line 28, in decorate_context return func(*args, **kwargs) File "/home/zhxk/.local/lib/python3.8/site-packages/ultralytics/yolo/utils/tal.py", line 97, in forward target_gt_idx, fg_mask, mask_pos = select_highest_overlaps(mask_pos, overlaps, self.n_max_boxes) File "/home/zhxk/.local/lib/python3.8/site-packages/ultralytics/yolo/utils/tal.py", line 44, in select_highest_overlaps if fg_mask.max() > 1: # one anchor is assigned to multiple gt_bboxes RuntimeError: CUDA error: device-side assert triggered CUDA kernel errors might be asynchronously reported at some other API call,so the stacktrace below might be incorrect. For debugging consider passing CUDA_LAUNCH_BLOCKING=1. Sentry is attempting to send 2 pending error messages Waiting up to 2 seconds Press Ctrl-C to quit THCudaCheck FAIL file=/pytorch/aten/src/THC/THCCachingHostAllocator.cpp line=278 error=710 : device-side assert triggered
07-13
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值