报错:
Traceback (most recent call last):
File "/home/bingxing2/ailab/group/ai4agr/wzf/LLM/models/ChatGLM-Finetuning/train.py", line 20, in <module>
from utils import print_trainable_parameters, print_rank_0, to_device, set_random_seed, save_model
File "/home/bingxing2/ailab/group/ai4agr/wzf/LLM/models/ChatGLM-Finetuning/utils.py", line 15, in <module>
from transformers import set_seed
File "<frozen importlib._bootstrap>", line 1055, in _handle_fromlist
File "/home/bingxing2/ailab/scxlab0069/.conda/envs/llm_test/lib/python3.9/site-packages/transformers/utils/import_utils.py", line 1174, in __getattr__
module = self._get_module(self._class_to_module[name])
File "/home/bingxing2/ailab/scxlab0069/.conda/envs/llm_test/lib/python3.9/site-packages/transformers/utils/import_utils.py", line 1186, in _get_module
raise RuntimeError(
RuntimeError: Failed to import transformers.trainer_utils because of the following error (look up to see its traceback):
Traceback (most recent call last):
File "/home/bingxing2/ailab/scxlab0069/.conda/envs/llm_test/lib/python3.9/site-packages/tensorflow/python/pywrap_tensorflow.py", line 60, in <module>
from tensorflow.python._pywrap_tensorflow_internal import *
ImportError: libnccl.so.2: cannot open shared object file: No such file or directory
Failed to load the native TensorFlow runtime.
原因:没安装nccl库