Add `Outline`s #1364

yvan-sraka · 2025-01-08T08:23:04Z

Fix #1346, WDYT about the integration with outlines.prompt? (Here, the template argument is just a function that returns a str, so it could be either an @outlines.prompt-decorated function or a regular one)

tests/test_outline.py

outlines/outline.py

rlouf · 2025-01-08T13:00:05Z

Fix #1346, WDYT about the integration with outlines.prompt?

We should allow functions that are not decorated, such as

def build_prompt(a: int) -> str:
    return f"What is {a} squared?"

Because we do not want to force the use of templates on users.

rlouf · 2025-01-08T13:01:24Z

Also you can deprecate the code in function.py as this is going to replace it.

outlines/outline.py

rlouf · 2025-01-08T13:06:25Z

outlines/outline.py

+        prompt = self.template(*args)
+
+        # Generate the response using the model
+        response = self.model.generate(prompt)


This should use either outlines.generate.json, outlines.generate.regex, etc. The interface is necessarily going to be brittle for now. You can take a look at the v1 branch to see how this is going to be more robust with the undergoing refactor.

Hmm, okay, but which one of those should I use by default? And how should I let users switch to the other one, by adding a fourth optional argument, e.g. generation_kind, WDYT?

I just realized that maybe I misunderstood the issue, and this PR should infer the correct regex to match the user's required output type. E.g., if a user asks for an int, I should ask outlines.generate.regex something like ^[+-]?[0-9]+$. IIUC, I'm unsure how to scale this to arbitrary return types. Should we define a list of supported basic return types? Using ast.literal_eval was the most flexible solution I came up with, but I'm not sure how well it handles custom data structures. In such cases (when the return type isn't in the list of straightforward Python builtins), I think we should check that users provide types that implement deserialization from JSON (and then use outlines.generate.json), WDYT?

Let's only accept Pydantic models / JSON Schema strings for now and use outlines.generate.json. We'll be able to easily generalise after we push the refactor in the v1 branch.

I've updated the PR (and the usage example in the docstring). Does it still look like what we want?

yvan-sraka · 2025-01-08T13:49:48Z

Fix #1346, WDYT about the integration with outlines.prompt?

We should allow functions that are not decorated, such as
def build_prompt(a: int) -> str:
    return f"What is {a} squared?"
Because we do not want to force the use of templates on users.

That's already how this PR works, I just wanted to confirm that this was the intended design and not an omission!

Also you can deprecate the code in function.py as this is going to replace it.

Should I handle that in this PR or a separate one? By deprecate, do you mean adding a user-facing warning when it's used, rather than removing it entirely?

rlouf · 2025-01-08T15:14:32Z

Should I handle that in this PR or a separate one? By deprecate, do you mean adding a user-facing warning when it's used, rather than removing it entirely?

We can do it in this PR. Let's add a user-facing warning, and announce that it will be removed in favour of the Outline interface in v1.0

And the `output_type` should now be a Pydantic model or a JSON Schema str

torymur · 2025-01-10T16:01:30Z

outlines/function.py

@@ -10,6 +10,9 @@
    from outlines.generate.api import SequenceGenerator
    from outlines.prompts import Prompt

+# Raising a warning here caused all the tests to fail…
+print("The 'function' module is deprecated and will be removed in a future release.")
+


Maybe better place would be putting warning in __post_init__ of Function?

Moving my previous comment here for visibility:

It would be nice to specify which release (1.0.0) so people can pin their version to < this release?

torymur · 2025-01-10T16:16:16Z

outlines/outline.py

+    """
+
+    def __init__(self, model, template, output_type):
+        if not (isinstance(output_type, str) or issubclass(output_type, BaseModel)):


This seems like an oversimplified check to confirm str to be a json schema string: isinstance(output_type, str). Even though v1 will handle this much more gracefully, perhaps, even for a temp solution it worth to verify str being an actual json schema?

It's surprisingly difficult to find a tool that validates a Json Schema. Would be very happy if you found one!

torymur · 2025-01-10T16:20:02Z

tests/test_outline.py

+        )
+        result = outline_instance(3)
+
+    assert result["result"] == 6


Understanding that this is a draft yet, but still to mention it: tests for misc errors flows are missing

outlines/function.py

rlouf · 2025-01-10T16:19:54Z

outlines/outline.py

+    """
+
+    def __init__(self, model, template, output_type):
+        if not (isinstance(output_type, str) or issubclass(output_type, BaseModel)):


It's surprisingly difficult to find a tool that validates a Json Schema. Would be very happy if you found one!

torymur · 2025-01-10T16:20:34Z

outlines/outline.py

+        self.generator = generate.json(model, output_type)
+
+    def __call__(self, *args):
+        prompt = self.template(*args)


Maybe worth to support **kwargs as well?

yvan-sraka requested review from torymur and rlouf January 8, 2025 08:23

yvan-sraka self-assigned this Jan 8, 2025

yvan-sraka linked an issue Jan 8, 2025 that may be closed by this pull request

Add Outlines #1346

Open

yvan-sraka commented Jan 8, 2025

View reviewed changes

tests/test_outline.py Outdated Show resolved Hide resolved

yvan-sraka force-pushed the 1346-add-outlines branch from 8cb052d to 26b14b9 Compare January 8, 2025 12:47

rlouf reviewed Jan 8, 2025

View reviewed changes

outlines/outline.py Show resolved Hide resolved

rlouf reviewed Jan 8, 2025

View reviewed changes

yvan-sraka force-pushed the 1346-add-outlines branch from 26b14b9 to e328144 Compare January 8, 2025 13:45

yvan-sraka requested a review from rlouf January 8, 2025 13:53

yvan-sraka force-pushed the 1346-add-outlines branch 3 times, most recently from 9a2a79a to 1faa76f Compare January 10, 2025 11:02

Add Outlines

ac77adb

yvan-sraka force-pushed the 1346-add-outlines branch 2 times, most recently from ddd5409 to 214a2f3 Compare January 10, 2025 11:07

yvan-sraka marked this pull request as ready for review January 10, 2025 11:07

yvan-sraka force-pushed the 1346-add-outlines branch from 214a2f3 to b2512e9 Compare January 10, 2025 11:09

Use generate.json in Outlines

9a9acba

And the `output_type` should now be a Pydantic model or a JSON Schema str

yvan-sraka force-pushed the 1346-add-outlines branch from b2512e9 to 9a9acba Compare January 10, 2025 11:43

torymur reviewed Jan 10, 2025

View reviewed changes

rlouf reviewed Jan 10, 2025

View reviewed changes

torymur reviewed Jan 10, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `Outline`s #1364

Add `Outline`s #1364

yvan-sraka commented Jan 8, 2025

rlouf commented Jan 8, 2025

rlouf commented Jan 8, 2025

rlouf Jan 8, 2025

yvan-sraka Jan 8, 2025

yvan-sraka Jan 8, 2025 •

edited

Loading

rlouf Jan 9, 2025

yvan-sraka Jan 10, 2025

yvan-sraka commented Jan 8, 2025

rlouf commented Jan 8, 2025

torymur Jan 10, 2025

rlouf Jan 10, 2025 •

edited

Loading

torymur Jan 10, 2025

rlouf Jan 10, 2025

torymur Jan 10, 2025

rlouf Jan 10, 2025

torymur Jan 10, 2025

Add Outlines #1364

Are you sure you want to change the base?

Add Outlines #1364

Conversation

yvan-sraka commented Jan 8, 2025

rlouf commented Jan 8, 2025

rlouf commented Jan 8, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

yvan-sraka Jan 8, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

yvan-sraka commented Jan 8, 2025

rlouf commented Jan 8, 2025

Choose a reason for hiding this comment

rlouf Jan 10, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Add `Outline`s #1364

Add `Outline`s #1364

yvan-sraka Jan 8, 2025 •

edited

Loading

rlouf Jan 10, 2025 •

edited

Loading