📉 Optimize GRPO memory usage by redefining per_device_batch_size
as generations per device
#7292
Annotations
1 error
Test with pytest
Process completed with exit code 2.
|
Loading