[DRAFT] Vllm integration #1628

vwxyzjn · 2024-05-07T18:06:11Z

-- UPDATE 7/7/2024: after chatting with @lewtun, we'd like to see if vLLM is willing to support vllm-project/vllm#6189 officially before merging this PR as it may cause confusion for the users.

This PR adds a vLLM backend for generation purposes. Preliminary testing shows it's ~8x faster. Given 80 mins of training, the one with HF generation proceeded for 2650 episodes, whereas the one with vLLM generation proceeded for 16k episodes.

Note that your milage might vary with different hardware / generation length. For example, in TL;DR vllm 1B models vLLM does not seem to provide much speed benefits, likely due to short generation length.

Note that we have to use our custom vLLM build to achieve precise device placement (so that we can place the vLLM instance on the 8th GPU). See vwxyzjn/vllm#1

HuggingFaceDocBuilderDev · 2024-05-07T18:11:15Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

scottsuk0306 · 2024-07-19T05:02:20Z

I'm really looking forward to this integration! Just out of curiosity, do you think using optimum or torch.compile as a generation backend is possible? @vwxyzjn

github-actions · 2024-08-12T15:06:17Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

lewtun · 2024-08-13T07:09:29Z

I'm really looking forward to this integration! Just out of curiosity, do you think using optimum or torch.compile as a generation backend is possible? @vwxyzjn

Yes I think torch.compile would be an option, but with the caveat that currently only a few model architectures are supported.

fzyzcjy · 2024-12-18T12:37:27Z

Hi, is there any updates? Thanks!

vwxyzjn added 30 commits April 12, 2024 15:48

Add ppov2 trainer

4b53069

make eos trick optional, remove unused args

2f94ccf

quick fix

7433733

precommit

138078a

update debugging script

a9b58c8

fix out of bound drop_last=True; use built-in scheduler

2afde83

Add PPO examples

907ea8e

push changes

6ecbc0d

quick change

61e3901

quick change

fd780b2

various bug fixes

cd0a006

remove unnecessary grad accumulation setting

753acb9

push new changes

c54f111

fix DS3 model saving

6d2caa0

update ppo.py

03b2b1f

refactor

3be2d84

quick change

c8a4a4c

refactor

404bd00

update ppo trainer

28f20e8

refactor

82a8d48

quick test

0a28223

add ds2 /ds3 7 processes config

4ec98ad

add vllm trainer

b7654df

quick change

46a1d16

experiment with reward normalization

9bd7296

push changes

40865e0

quick push

eed26dd

push changes

b247030

push various changes

c21924d

refactor to use ModelConfig

38228a0

vwxyzjn added 20 commits May 7, 2024 18:24

quick update

e0e1858

remove unnecessary files

4ebe948

precommit

93b61c5

deepspeed fix; handle edge case when eos_token_id = 0

1eccfbf

add PPO tldr example

a0f5bc1

add TL;DR example

d40c7cf

fix undefined var

bcef2e5

update training script with vllm

6fa5fac

various fixes

d444e0b

quick push

d226e42

add reward model revision

0ea101b

escape characters

49f7888

allow loading vllm on other devices

cd4aa32

use our custom reward

9b54a7b

push zephyr recipe

6a538e9

Merge branch 'main' into vllm-integration

dc5c047

use the latest API

6628482

Support vllm backend

e045944

cleanup

1c3b2f7

Make it work with our GPU

0015663

vwxyzjn marked this pull request as ready for review July 3, 2024 18:27

remove additional script

5bfc056

qgallouedec marked this pull request as draft September 23, 2024 21:55

seanexp mentioned this pull request Oct 25, 2024

Asynchronous RLHF: Faster and More Efficient Online DPO #2278

Open

3 tasks

fzyzcjy mentioned this pull request Jan 9, 2025

(Willing to PR) Will it be welcomed if speeding up algorithms like PPO and code refactor/cleanup? #2535

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DRAFT] Vllm integration #1628

[DRAFT] Vllm integration #1628

vwxyzjn commented May 7, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented May 7, 2024

scottsuk0306 commented Jul 19, 2024

github-actions bot commented Aug 12, 2024

lewtun commented Aug 13, 2024

fzyzcjy commented Dec 18, 2024

[DRAFT] Vllm integration #1628

Are you sure you want to change the base?

[DRAFT] Vllm integration #1628

Conversation

vwxyzjn commented May 7, 2024 • edited Loading

HuggingFaceDocBuilderDev commented May 7, 2024

scottsuk0306 commented Jul 19, 2024

github-actions bot commented Aug 12, 2024

lewtun commented Aug 13, 2024

fzyzcjy commented Dec 18, 2024

vwxyzjn commented May 7, 2024 •

edited

Loading