Skip to content

Actions: huggingface/trl

Tests

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
4,280 workflow runs
4,280 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Dynamically load LoRA weights when using vLLM
Tests #7198: Pull request #2730 opened by tgaddair
February 1, 2025 23:38 Action required tgaddair:fix-peft-vllm-grpo-lora
February 1, 2025 23:38 Action required
GRPO: Set max_model_len when initializing vLLM instance
Tests #7197: Pull request #2728 synchronize by mirceapricop
February 1, 2025 18:58 27m 4s mirceapricop:patch-1
February 1, 2025 18:58 27m 4s
GRPO: Set max_model_len when initializing vLLM instance
Tests #7196: Pull request #2728 opened by mirceapricop
February 1, 2025 16:01 Action required mirceapricop:patch-1
February 1, 2025 16:01 Action required
fix: Fix typo in filename in ultrafeedback-prompt.py (#2716)
Tests #7195: Commit a325a0e pushed by qgallouedec
February 1, 2025 13:53 25m 16s main
February 1, 2025 13:53 25m 16s
February 1, 2025 13:52 25m 59s
⚠️ Fix Attention Masking in GRPO
Tests #7192: Pull request #2708 synchronize by qgallouedec
January 31, 2025 20:57 Action required andyl98:fix-grpo-logits-calc
January 31, 2025 20:57 Action required
WIP: RLOOV2
Tests #7188: Pull request #2724 opened by mnoukhov
January 31, 2025 19:53 Action required mnoukhov:rloov2
January 31, 2025 19:53 Action required
🏰 num_logits_to_keep to logits_to_keep (#2721)
Tests #7187: Commit 1c35a48 pushed by qgallouedec
January 31, 2025 19:19 27m 4s main
January 31, 2025 19:19 27m 4s
⚠️ Fix Attention Masking in GRPO
Tests #7186: Pull request #2708 synchronize by andyl98
January 31, 2025 18:57 24m 50s andyl98:fix-grpo-logits-calc
January 31, 2025 18:57 24m 50s
⚠️ Fix Attention Masking in GRPO
Tests #7185: Pull request #2708 synchronize by andyl98
January 31, 2025 18:27 Action required andyl98:fix-grpo-logits-calc
January 31, 2025 18:27 Action required
🏰 num_logits_to_keep to logits_to_keep
Tests #7184: Pull request #2721 synchronize by qgallouedec
January 31, 2025 17:44 25m 9s logits_to_keep
January 31, 2025 17:44 25m 9s
⚠️ Fix Attention Masking in GRPO
Tests #7183: Pull request #2708 synchronize by andyl98
January 31, 2025 17:42 Action required andyl98:fix-grpo-logits-calc
January 31, 2025 17:42 Action required
⚠️ Fix Attention Masking in GRPO
Tests #7182: Pull request #2708 synchronize by andyl98
January 31, 2025 16:45 Action required andyl98:fix-grpo-logits-calc
January 31, 2025 16:45 Action required
⚠️ Fix Attention Masking in GRPO
Tests #7181: Pull request #2708 synchronize by andyl98
January 31, 2025 16:42 Action required andyl98:fix-grpo-logits-calc
January 31, 2025 16:42 Action required
⚠️ Fix Attention Masking in GRPO
Tests #7180: Pull request #2708 synchronize by andyl98
January 31, 2025 16:42 Action required andyl98:fix-grpo-logits-calc
January 31, 2025 16:42 Action required
📖 Nit fix in SFT Documentation (#2722)
Tests #7179: Commit 2ce36ae pushed by qgallouedec
January 31, 2025 15:46 26m 21s main
January 31, 2025 15:46 26m 21s
⚠️ Fix Attention Masking in GRPO
Tests #7178: Pull request #2708 synchronize by qgallouedec
January 31, 2025 15:38 Action required andyl98:fix-grpo-logits-calc
January 31, 2025 15:38 Action required
🏰 num_logits_to_keep to logits_to_keep
Tests #7177: Pull request #2721 opened by qgallouedec
January 31, 2025 14:43 27m 14s logits_to_keep
January 31, 2025 14:43 27m 14s
Improve GRPO example (#2717)
Tests #7175: Commit bf69191 pushed by lewtun
January 31, 2025 11:04 24m 48s main
January 31, 2025 11:04 24m 48s
⚠️ Fix Attention Masking in GRPO
Tests #7174: Pull request #2708 synchronize by kashif
January 31, 2025 09:41 Action required andyl98:fix-grpo-logits-calc
January 31, 2025 09:41 Action required
📖 Add GRPOTrainer to README.md (#2713)
Tests #7172: Commit 265663a pushed by qgallouedec
January 31, 2025 09:30 24m 22s main
January 31, 2025 09:30 24m 22s