Skip to content

📉 Optimize GRPO memory usage by redefining per_device_batch_size as generations per device #6467

📉 Optimize GRPO memory usage by redefining per_device_batch_size as generations per device

📉 Optimize GRPO memory usage by redefining per_device_batch_size as generations per device #6467

build  /  build_pr_documentation

succeeded Feb 6, 2025 in 3m 37s