📉 Optimize GRPO memory usage by redefining per_device_batch_size
as generations per device#2776
Merged
qgallouedec merged 18 commits intomainfrom distribute_batch_grpoFeb 6, 2025
+107-68
Commits
Commits on Feb 5, 2025
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed
Commits on Feb 6, 2025
- committed
- committed
- authored
Merge branch 'distribute_batch_grpo' of https://github.com/huggingface/trl into distribute_batch_grpo
committed- committed
- committed
- committed
- committed
- committed