[debug] segmentation fault (core dumped) 错误
本文记录我在coding过程中遇到的一个比较没有头绪的错误。
一、背景
我的代码中使用了pytorch3d
、torch_geometric
,以及最基本的pytorch
。因为pytorch3d
、torch_geometric
对pytorch
环境都有要求,我先安装好了pyg
,然后又切换了pytorch
版本,所以就出现了未知的问题。
(partAssembly)
└─(22:10:17 on main ✹)──> python ./train_dynamic.py
[1] 3591666 segmentation fault (core dumped) python ./train_dynamic.py
二、解决办法
# 在python加上参数`-X faulthandler`
python -X faulthandler ./train_dynamic.py
这样就能看到具体的错误在哪里了
T640:pts/10)─┐
└─(22:12:12 on main ✹)──> python -X faulthandler ./train_dynamic.py \ ──(四,3月31)─┘
--exp_suffix '_Chair_new' \
--model_version 'model_score_based' \
--category 'Chair' \
--train_data_fn 'Chair.train.npy' \
--val_data_fn 'Chair.val.npy' \
--loss_weight_trans_l2 1.0 \
--loss_weight_rot_l2 0.0 \
--loss_weight_rot_cd 10 \
--loss_weight_shape_cd 1.0 \
--device cuda:0 \
--num_epoch_every_visu 200 \
--epochs 2000 \
--level 3 \
--overwrite \
--lr 1e-4 \
--batch_size 1 \
--num_workers 8 \
--num_batch_every_visu 0 \
--obj_png "no" \
--snr 0.20 \
--t0 1.0 \
--cor_steps 4 \
--cor_final_steps 4 \
--sampler 'PC'
Fatal Python error: Segmentation fault
Current thread 0x00001463d04d8740 (most recent call first):
File "/devdata/zry/anaconda3/envs/partAssembly/lib/python3.8/ctypes/__init__.py", line 381 in __init__
File "/devdata/zry/anaconda3/envs/partAssembly/lib/python3.8/site-packages/torch/_ops.py", line 110 in load_library
File "/devdata/zry/anaconda3/envs/partAssembly/lib/python3.8/site-packages/torch_sparse/__init__.py", line 19 in <module>
File "<frozen importlib._bootstrap>", line 219 in _call_with_frames_removed
File "<frozen importlib._bootstrap_external>", line 783 in exec_module
File "<frozen importlib._bootstrap>", line 671 in _load_unlocked
File "<frozen importlib._bootstrap>", line 975 in _find_and_load_unlocked
File "<frozen importlib._bootstrap>", line 991 in _find_and_load
File "/devdata/zry/anaconda3/envs/partAssembly/lib/python3.8/site-packages/torch_geometric/data/data.py", line 9 in <module>
File "<frozen importlib._bootstrap>", line 219 in _call_with_frames_removed
File "<frozen importlib._bootstrap_external>", line 783 in exec_module
File "<frozen importlib._bootstrap>", line 671 in _load_unlocked
File "<frozen importlib._bootstrap>", line 975 in _find_and_load_unlocked
File "<frozen importlib._bootstrap>", line 991 in _find_and_load
File "/devdata/zry/anaconda3/envs/partAssembly/lib/python3.8/site-packages/torch_geometric/data/__init__.py", line 1 in <module>
File "<frozen importlib._bootstrap>", line 219 in _call_with_frames_removed
File "<frozen importlib._bootstrap_external>", line 783 in exec_module
File "<frozen importlib._bootstrap>", line 671 in _load_unlocked
File "<frozen importlib._bootstrap>", line 975 in _find_and_load_unlocked
File "<frozen importlib._bootstrap>", line 991 in _find_and_load
File "/devdata/zry/anaconda3/envs/partAssembly/lib/python3.8/site-packages/torch_geometric/__init__.py", line 4 in <module>
File "<frozen importlib._bootstrap>", line 219 in _call_with_frames_removed
File "<frozen importlib._bootstrap_external>", line 783 in exec_module
File "<frozen importlib._bootstrap>", line 671 in _load_unlocked
File "<frozen importlib._bootstrap>", line 975 in _find_and_load_unlocked
File "<frozen importlib._bootstrap>", line 991 in _find_and_load
File "<frozen importlib._bootstrap>", line 219 in _call_with_frames_removed
File "<frozen importlib._bootstrap>", line 961 in _find_and_load_unlocked
File "<frozen importlib._bootstrap>", line 991 in _find_and_load
File "./train_dynamic.py", line 27 in <module>
[1] 3593728 segmentation fault (core dumped) python -X faulthandler ./train_dynamic.py --exp_suffix '_Chair_new' 'Chair