Skip to content

📉 Optimize GRPO memory usage by redefining per_device_batch_size as generations per device #6443

📉 Optimize GRPO memory usage by redefining per_device_batch_size as generations per device

📉 Optimize GRPO memory usage by redefining per_device_batch_size as generations per device #6443

Annotations

2 errors

build  /  build_pr_documentation

cancelled Feb 5, 2025 in 2m 3s