Fix PyTorchBackend
TP vs DP inputs distribution across replicas and shards
#48
Job | Run time |
---|---|
4m 2s | |
4m 2s |
PyTorchBackend
TP vs DP inputs distribution across replicas and shards
#48
Job | Run time |
---|---|
4m 2s | |
4m 2s |