torch.nn.parallel.DistributedDataParallel #33

TousenKaname · 2023-10-31T14:47:37Z

Excellent work! : )
But I got a bug. When I use the multi-GPU run the first_stage code, my code was block up at this line. I find the issue is induced by model desynchronization.
criterion = torch.nn.parallel.DistributedDataParallel(criterion, device_ids=[device], broadcast_buffers=False, find_unused_parameters=True)

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

torch.nn.parallel.DistributedDataParallel #33

torch.nn.parallel.DistributedDataParallel #33

TousenKaname commented Oct 31, 2023

torch.nn.parallel.DistributedDataParallel #33

torch.nn.parallel.DistributedDataParallel #33

Comments

TousenKaname commented Oct 31, 2023