Skip to content

Actions: vllm-project/vllm

pre-commit

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
3,526 workflow runs
3,526 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[V1][Spec Decode] Ngram Spec Decode (#12193)
pre-commit #3526: Commit 80f63a3 pushed by WoosukKwon
February 16, 2025 02:05 4m 54s main
February 16, 2025 02:05 4m 54s
[V1] Get input tokens from scheduler
pre-commit #3525: Pull request #13339 synchronize by WoosukKwon
February 16, 2025 01:18 4m 42s v1-scheduler-input
February 16, 2025 01:18 4m 42s
[V1] Optimize handling of sampling metadata and req_ids list
pre-commit #3524: Pull request #13244 synchronize by njhill
February 16, 2025 00:29 5m 4s njhill:sampler-streamline
February 16, 2025 00:29 5m 4s
[Model][Speculative Decoding] DeepSeek MTP spec decode
pre-commit #3523: Pull request #12755 synchronize by luccafong
February 15, 2025 22:45 4m 43s luccafong:ds_mtp
February 15, 2025 22:45 4m 43s
[V1][Spec Decode] Ngram Spec Decode
pre-commit #3522: Pull request #12193 synchronize by LiuXiaoxuanPKU
February 15, 2025 22:14 4m 51s LiuXiaoxuanPKU:ngram
February 15, 2025 22:14 4m 51s
[V1][Spec Decode] Ngram Spec Decode
pre-commit #3521: Pull request #12193 synchronize by LiuXiaoxuanPKU
February 15, 2025 22:01 4m 43s LiuXiaoxuanPKU:ngram
February 15, 2025 22:01 4m 43s
[V1][Spec Decode] Ngram Spec Decode
pre-commit #3520: Pull request #12193 synchronize by LiuXiaoxuanPKU
February 15, 2025 21:48 4m 43s LiuXiaoxuanPKU:ngram
February 15, 2025 21:48 4m 43s
[V1] Get input tokens from scheduler
pre-commit #3519: Pull request #13339 opened by WoosukKwon
February 15, 2025 21:10 4m 45s v1-scheduler-input
February 15, 2025 21:10 4m 45s
[Bugfix] Pin xgrammar to 0.1.11
pre-commit #3518: Pull request #13338 opened by mgoin
February 15, 2025 20:51 4m 38s xgrammar-0.1.11
February 15, 2025 20:51 4m 38s
[V1][Metrics] Handle preemptions
pre-commit #3515: Pull request #13169 synchronize by markmc
February 15, 2025 17:12 4m 37s markmc:metrics-v1-preemptions
February 15, 2025 17:12 4m 37s
[Kernel] moe wna16 cuda kernel
pre-commit #3513: Pull request #13321 synchronize by jinzhen-lin
February 15, 2025 17:09 4m 35s jinzhen-lin:moe_wna16_cuda_kernel
February 15, 2025 17:09 4m 35s
[V1][Metrics] Support vllm:cache_config_info
pre-commit #3512: Pull request #13299 synchronize by markmc
February 15, 2025 17:08 4m 41s markmc:metrics-v1-cache-config-info
February 15, 2025 17:08 4m 41s
[Kernel] moe wna16 cuda kernel
pre-commit #3511: Pull request #13321 synchronize by jinzhen-lin
February 15, 2025 16:51 4m 32s jinzhen-lin:moe_wna16_cuda_kernel
February 15, 2025 16:51 4m 32s
[Kernel] moe wna16 cuda kernel
pre-commit #3510: Pull request #13321 synchronize by jinzhen-lin
February 15, 2025 16:48 4m 52s jinzhen-lin:moe_wna16_cuda_kernel
February 15, 2025 16:48 4m 52s
[Bugfix]: DeepseekR1 model load fails with weights tied error
pre-commit #3509: Pull request #13335 synchronize by cennn
February 15, 2025 16:40 4m 46s cennn:deepseek-r1-offload
February 15, 2025 16:40 4m 46s
[VLM] Support multimodal inputs for Florence-2 models
pre-commit #3506: Pull request #13320 synchronize by Isotr0py
February 15, 2025 16:22 2m 33s Isotr0py:florence-2
February 15, 2025 16:22 2m 33s
[Kernel] moe wna16 cuda kernel
pre-commit #3505: Pull request #13321 synchronize by jinzhen-lin
February 15, 2025 16:21 4m 47s jinzhen-lin:moe_wna16_cuda_kernel
February 15, 2025 16:21 4m 47s
[Bugfix]: DeepseekR1 model load fails with weights tied error
pre-commit #3503: Pull request #13335 opened by cennn
February 15, 2025 15:08 4m 37s cennn:deepseek-r1-offload
February 15, 2025 15:08 4m 37s
[Doc] [2/N] Add Fuyu E2E example for multimodal processor (#13331)
pre-commit #3502: Commit 367cb8c pushed by simon-mo
February 15, 2025 15:06 4m 45s main
February 15, 2025 15:06 4m 45s