Add example of integration with vLLM #435

rlouf · 2023-12-14T12:36:29Z

This example currently does not work with multiple prompts, because self.fsm_state will be updated every time self.__call__ is called. So with several prompts, self.fsm_states will be updated, at each step, as many times as there are sequences. This can be avoided by having self.fsm_states as a DefaultDict and passing the seq_id to logits_processor.

Looking at the code, it might be a good idea to revert back to the original tokenizer interface.

rlouf force-pushed the vllm-integration branch 4 times, most recently from 844f467 to e0f9e76 Compare December 15, 2023 17:51

rlouf added 3 commits December 17, 2023 09:20

Add example of integration with vLLM

7e5c524

Exclude examples from pre-commit mypy check

1282f08

Remove unused packages from dependencies

5eada6a

rlouf force-pushed the vllm-integration branch from e0f9e76 to 5eada6a Compare December 17, 2023 08:20

rlouf merged commit 7ee827f into main Dec 17, 2023
4 checks passed

rlouf deleted the vllm-integration branch December 17, 2023 08:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add example of integration with vLLM #435

Add example of integration with vLLM #435

rlouf commented Dec 14, 2023 •

edited

Loading

Add example of integration with vLLM #435

Add example of integration with vLLM #435

Conversation

rlouf commented Dec 14, 2023 • edited Loading

rlouf commented Dec 14, 2023 •

edited

Loading