subprocess.CalledProcessError: Command ‘[’/home/***/anaconda3/envs/***/bin/python’, ‘-u’, ‘pretrain_distribute.py’, ‘–local_rank=0’]’ returned non-zero exit status 1.
解决方案:
model = nn.parallel.DistributedDataParallel(model,device_ids=[self.local_rank],output_device=self.local_rank,find_unused_parameters=True)