Pytorch DDP would fail when using the parameters directly to calculate the loss.
These are my scripts:
# train.py:
class Model(nn.Module):
def __init__(self, params):
...
self.xnli_proj = nn.Linear
Pytorch DDP would fail when using the parameters directly to calculate the loss.
These are my scripts:
# train.py:
class Model(nn.Module):
def __init__(self, params):
...
self.xnli_proj = nn.Linear