CUDA error: device-side assert triggered CUDA kernel errors might be asynchronously reported at some other API call,so the stacktrace below might be incorrect. For debugging consider passing CUDA_LAUNCH_BLOCKING=1.
今天在重新手机数据集构建了数据集后,在进行训练时有遇到这个错误,这次由于标签不是从0开始连续的是23-27,62-97,导致这个问题
错误位置出现在这
loss = loss_fn(out, labels.long())
后来写了一个字典,将每个标签和0-40一一对应就解