Skip to content

Actions: huggingface/trl

Build PR Documentation

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
3,698 workflow runs
3,698 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

🫷 Include stop token in policy model's generation_config
Build PR Documentation #6104: Pull request #2528 synchronize by qgallouedec
January 22, 2025 09:49 3m 24s dawidm:ppo-stop-token
January 22, 2025 09:49 3m 24s
🫷 Include stop token in policy model's generation_config
Build PR Documentation #6103: Pull request #2528 synchronize by qgallouedec
January 22, 2025 08:34 3m 28s dawidm:ppo-stop-token
January 22, 2025 08:34 3m 28s
🫷 Include stop token in policy model's generation_config
Build PR Documentation #6102: Pull request #2528 synchronize by qgallouedec
January 22, 2025 08:28 3m 14s dawidm:ppo-stop-token
January 22, 2025 08:28 3m 14s
🫷 Include stop token in policy model's generation_config
Build PR Documentation #6101: Pull request #2528 synchronize by qgallouedec
January 22, 2025 08:16 Action required dawidm:ppo-stop-token
January 22, 2025 08:16 Action required
🫷 Include stop token in policy model's generation_config
Build PR Documentation #6100: Pull request #2528 synchronize by qgallouedec
January 22, 2025 08:10 Action required dawidm:ppo-stop-token
January 22, 2025 08:10 Action required
🫷 Include stop token in policy model's generation_config
Build PR Documentation #6099: Pull request #2528 synchronize by qgallouedec
January 22, 2025 08:05 Action required dawidm:ppo-stop-token
January 22, 2025 08:05 Action required
🫷 Include stop token in policy model's generation_config
Build PR Documentation #6098: Pull request #2528 synchronize by qgallouedec
January 22, 2025 08:05 Action required dawidm:ppo-stop-token
January 22, 2025 08:05 Action required
add "_prepare_fsdp" for DPOTrainer
Build PR Documentation #6097: Pull request #2539 synchronize by faaany
January 22, 2025 01:19 Action required faaany:prepare-fsdp
January 22, 2025 01:19 Action required
add "_prepare_fsdp" for DPOTrainer
Build PR Documentation #6096: Pull request #2539 synchronize by faaany
January 22, 2025 00:54 Action required faaany:prepare-fsdp
January 22, 2025 00:54 Action required
⚡ Add uv installation instructions
Build PR Documentation #6091: Pull request #2601 synchronize by qgallouedec
January 21, 2025 21:04 4m 13s stevhliu:patch-1
January 21, 2025 21:04 4m 13s
[SFT VLM] Added support for Molmo models via standalone script sft_vlm_molmo
Build PR Documentation #6090: Pull request #2236 synchronize by sergiopaniego
January 21, 2025 20:56 Action required sergiopaniego:sft_vlm_molmo
January 21, 2025 20:56 Action required
⚡ Add uv installation instructions
Build PR Documentation #6089: Pull request #2601 synchronize by qgallouedec
January 21, 2025 20:52 3m 48s stevhliu:patch-1
January 21, 2025 20:52 3m 48s
⚡ Add uv installation instructions
Build PR Documentation #6088: Pull request #2601 synchronize by qgallouedec
January 21, 2025 20:49 2m 36s stevhliu:patch-1
January 21, 2025 20:49 2m 36s
vLLM for GRPO
Build PR Documentation #6087: Pull request #2600 synchronize by qgallouedec
January 21, 2025 19:17 3m 25s grpo_vllm
January 21, 2025 19:17 3m 25s
⚡ Add uv installation instructions
Build PR Documentation #6086: Pull request #2601 opened by stevhliu
January 21, 2025 19:02 3m 33s stevhliu:patch-1
January 21, 2025 19:02 3m 33s
vLLM for GRPO
Build PR Documentation #6085: Pull request #2600 opened by qgallouedec
January 21, 2025 18:21 3m 46s grpo_vllm
January 21, 2025 18:21 3m 46s
[Not meant to be merged] Support branch for Trainer refactor
Build PR Documentation #6084: Pull request #2594 synchronize by qgallouedec
January 21, 2025 16:13 3m 46s trainer-refactor-support-branch
January 21, 2025 16:13 3m 46s
Add generation caching in TextEnvironment and fix bugs in TextEnvironment
Build PR Documentation #6083: Pull request #2556 synchronize by konrad-gerlach
January 21, 2025 15:08 Action required konrad-gerlach:text_environment_caching
January 21, 2025 15:08 Action required
[Liger] liger DPO support
Build PR Documentation #6082: Pull request #2568 synchronize by kashif
January 21, 2025 13:58 3m 28s liger-dpo
January 21, 2025 13:58 3m 28s
[SFT] add token accuracy metric
Build PR Documentation #6080: Pull request #2597 synchronize by kashif
January 21, 2025 12:33 3m 33s mean_token_accuracy
January 21, 2025 12:33 3m 33s
[SFT] add token accuracy metric
Build PR Documentation #6079: Pull request #2597 synchronize by kashif
January 21, 2025 09:39 3m 39s mean_token_accuracy
January 21, 2025 09:39 3m 39s
[SFT] add token accuracy metric
Build PR Documentation #6078: Pull request #2597 opened by kashif
January 21, 2025 09:17 3m 16s mean_token_accuracy
January 21, 2025 09:17 3m 16s