📉 Optimize GRPO memory usage by redefining per_device_batch_size
as generations per device
#7296
Annotations
2 errors
The run was canceled by @qgallouedec.
|
Test with pytest
The operation was canceled.
|
Loading