📉 Optimize GRPO memory usage by redefining per_device_batch_size
as generations per device
#6443
Annotations
2 errors
|
Initialize containers
The operation was canceled.
|
Loading