Constrained decoding #1243

viktor-ferenczi · 2023-10-01T17:40:39Z

Changes:

Added allowed_token_ids to SamplingParams (configuration)
Enforced allowed_token_ids in Sampler.forward by squashing the logits of disallowed tokens

It allows the user to generate only specific tokens.

Please note, that it is the caller's responsibility to add the EOS and and additional stop tokens to the list of allowed tokens, but this is required only for open ended generations (not limited to 1 or a few tokens). It may be error prone, so we may want to add them automatically to the allowed tokens list.

Idea is to make a separate call for each segment of the generation which has different allowed_token_ids. The tokens known for sure can be efficiently "skipped" by appending them at the end of prompt for the next call (segment). The end of segments can be detected by adding them temporarily to stop_token_ids or by detecting them on the fly from a streaming generation. It gives the caller maximum control over the schema.

TODO:

Write an example.
Add support for regexp based constraint on the text returned (prompt+generated or generated only). It would alleviate the need for multiple generation calls in most cases, which is also more friendly with REST API calls. Regexp validation is the basis of the outlines library, study that library and figure out how can we implement this efficiently.
Consider OpenAI API integration: Unsure whether we should do it or how to do it.

Constrained generation libraries we may want to provide adapters for:

The adapters could be separate libraries or just examples. They should go into separate PRs (or repo).

Issue: #288

jpeig · 2023-10-05T12:24:31Z

Did you check out LMQL?

viktor-ferenczi · 2023-10-05T21:17:09Z

Did you check out LMQL?

Please give me a link or some hint where to find it.

iiLaurens · 2023-10-10T16:49:21Z

Did you check out LMQL?

Please give me a link or some hint where to find it.

https://lmql.ai/

viktor-ferenczi · 2023-10-17T00:51:38Z

LMQL sounds like a great library to support. I was thinking about regex, but that's very crude and difficult to work with.

viktor-ferenczi · 2023-10-17T01:31:10Z

See the related LMQL ticket

I guess best would be to work together with them to finish LMQL support.

DarkLight1337 · 2024-11-09T06:49:58Z

Closing as superseded by #8252.

Allowed tokens

4e2bbd3

viktor-ferenczi mentioned this pull request Oct 1, 2023

Support for Constrained decoding #288

Closed

Formatting and coding style fixes

2bec500

viktor-ferenczi marked this pull request as draft October 2, 2023 19:20

DarkLight1337 closed this Nov 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Constrained decoding #1243

Constrained decoding #1243

viktor-ferenczi commented Oct 1, 2023 •

edited

Loading

jpeig commented Oct 5, 2023

viktor-ferenczi commented Oct 5, 2023

iiLaurens commented Oct 10, 2023

viktor-ferenczi commented Oct 17, 2023

viktor-ferenczi commented Oct 17, 2023

DarkLight1337 commented Nov 9, 2024

Constrained decoding #1243

Constrained decoding #1243

Conversation

viktor-ferenczi commented Oct 1, 2023 • edited Loading

jpeig commented Oct 5, 2023

viktor-ferenczi commented Oct 5, 2023

iiLaurens commented Oct 10, 2023

viktor-ferenczi commented Oct 17, 2023

viktor-ferenczi commented Oct 17, 2023

DarkLight1337 commented Nov 9, 2024

viktor-ferenczi commented Oct 1, 2023 •

edited

Loading