response_schema is not working when using typing.TypedDict #560

mwigh · 2024-09-18T10:36:19Z

Description of the bug:

When using response_schema in generate_content the response schema is not respected if the response_schema is set using a <class 'typing_extensions._TypedDictMeta'> object

Actual vs expected behavior:

I expect the response schema to be respected, since according to the documentation it should:
https://ai.google.dev/gemini-api/docs/structured-output?lang=python#generate-json

The issue is probably that the fields are not set as "required".
Can that be done somehow?

Any other information you'd like to share?

Code where I explicit set the schema but also explicit asks it to not respect the schema (but it still should, according to documentation):

import typing_extensions as typing
import google.generativeai as genai
import os

class Recipe(typing.TypedDict):
    recipe_name: str
    ingredients: list[str]

genai.configure(api_key=os.environ["API_KEY"])
model = genai.GenerativeModel("gemini-1.5-pro-latest")
result = model.generate_content(
    "List one popular cookie recipe. Response should be a JSON string on format {receipt_name: str}. There should only be one key in the response.",
    generation_config=genai.GenerationConfig(
        response_mime_type="application/json", response_schema=Recipe
    ),
)
print(result.text)

The text was updated successfully, but these errors were encountered:

Hamza-nabil · 2024-09-18T17:55:16Z

This issue seems related to #541

Gunand3043 · 2024-09-27T05:11:56Z

@MarkDaoust ,there is an issue with passing text prompt schema and model configuration schema at the same time, Please find the gist here for reference .Thanks!

mwigh · 2024-09-27T07:34:17Z

@Gunand3043
The problem is probably that GenerationConfig does not set the fields as required.

Your: "Example 2: (working) - Supply a schema through model configuration"
is not always working for all cases, that is how I found this issue

kinto-b · 2024-09-27T22:03:42Z

Yep there's a reprex of the Example 2 type failing here: #541 (comment)

HNx1 · 2024-10-02T01:08:00Z

EDIT: Edited to reflect below conversation

Hi, just wanted to share an example: https://colab.research.google.com/drive/1ZdyrENeHWh30Ijws0o8l2Od5M4i56HjD?usp=sharing

In this case, I tested with both the python SDK and the REST API. The REST API call works all the time, and the python SDK does not work.

I tested with both gemini pro and flash and neither produced structured outputs in the sdk, but did with the rest api.

Extra test if excluding response_schema as suggested:
This now works 100% of the time, on both the sdk and the rest api. .

Needs to set required fields as required

mwigh · 2024-10-02T12:22:21Z

I only mean that it seems that it is loosing required field when creating the response schema, and that code is also in this SDK.

See this commit that fixes it:
mwigh@f30c36d

HNx1 · 2024-10-02T13:07:48Z

Ah I see. So if you supply all properties as 'required' to the REST API, it works.

Which suggests you are correct that this is a local issue to the SDK missing required flags.

I corrected the openapi schema builder function in my colab to now set required on all properties in the object, and now it works again for the REST API.

MarkDaoust · 2024-10-02T16:08:48Z

@mwigh - Nice, I'll have to investigate this deeper. Can you send that commit as a PR?

MarkDaoust · 2024-10-02T16:22:35Z

It's interesting that Schema supports both "required" and "nullable" flags. That's a subtle distinction.

mwigh · 2024-10-02T16:56:06Z

Yes I noticed that too. But I think this was the cleanest way of solving the issue, without creating new functions or modifying _build_schema(). And it is pretty obvious what it does so someone can change it further if needed in future

Needs to set required fields as required Handle cases dataclasses, TypedDict and Pydantic models

mwigh · 2024-10-03T21:29:41Z

@MarkDaoust
Ref your comment in the PR.
I think you need to decide on what should be allowed or not, e.g.:

class RecipeTypeDict(typing.TypedDict):
    recipe_name: str
    ingredient: str
    yyy: Optional[str]
    xxx: NotRequired[str]


@dataclass
class RecipeDataClass():
    recipe_name: str
    ingredient: str
    xxx: Optional[str]  #  should = None be allowed?


class ReceiptPydantic(pydantic.BaseModel):
    recipe_name: str
    ingredient: str
    xxx: Optional[str]  # should = None be allowed?

In none of RecipeDataClass or ReceiptPydantic are xxx considered optional. And if you make them optional by setting default values, then the code fail d/t 'default' exists in the dict. I showed it in this commit: https://github.com/mwigh/generative-ai-python/commit/2801c74190b3385572db0a02fc5cf747aa952558

bakeryproducts · 2024-10-21T15:09:51Z

Im not sure where exactly is the issue with pydantic schema, but @ MarkDaoust pop's required keys out of the schema here . In _schema_for_function they eventually get back there but in _schema_for_class it seems they are not?

MaKTaiL · 2024-10-25T23:24:46Z

I'm also having an issue with response_schema not working as intended. It appears to ignore all keys of a Dict except the first one. This is my class:

class SubtitleObject(typing.TypedDict):
    index: str
    content: str

This is the response I'm getting:

[{'index': '0'}, {'index': '1'}, {'index': '2'}, {'index': '3'}, {'index': '4'}, {'index': '5'}, {'index': '6'}, {'index': '7'}, {'index': '8'}, {'index': '9'}, {'index': '10'}]

Even the official docs show the same bug (it's missing the ingredients key):

popovidis · 2024-10-29T12:38:23Z

Having this issue also.

antoo05-11 · 2024-11-03T13:41:57Z

I recently got this issue too :((

PeterMinin · 2024-11-19T23:09:20Z

Here's a workaround for now, before #580 is merged. You can tweak it if some of your fields are optional.

from google.generativeai.types import generation_types


def force_required_fields(generation_config) -> dict:
    """
    Returns a copy with all fields in the schema marked as required.
    Workaround for https://github.com/google-gemini/generative-ai-python/issues/560.
    """
    generation_config = generation_types.to_generation_config_dict(generation_config)
    schema = generation_config["response_schema"]
    schema.required = list(schema.properties)
    return generation_config


# Usage:
generation_config = genai.GenerationConfig(
    response_mime_type="application/json",
    response_schema=MyClass,
)
generation_config = force_required_fields(generation_config)

Gunand3043 self-assigned this Sep 19, 2024

Gunand3043 added status:triaged Issue/PR triaged to the corresponding sub-team component:python sdk Issue/PR related to Python SDK type:help Support-related issues type:bug Something isn't working and removed type:help Support-related issues labels Sep 19, 2024

Gunand3043 assigned MarkDaoust and unassigned Gunand3043 Sep 27, 2024

mwigh pushed a commit to mwigh/generative-ai-python that referenced this issue Oct 2, 2024

Issue google-gemini#560

f30c36d

Needs to set required fields as required

mwigh mentioned this issue Oct 2, 2024

Issue #560 #580

Open

6 tasks

mwigh pushed a commit to mwigh/generative-ai-python that referenced this issue Oct 3, 2024

Issue google-gemini#560

2801c74

Needs to set required fields as required Handle cases dataclasses, TypedDict and Pydantic models

Surya2k1 linked a pull request Nov 28, 2024 that will close this issue

Fix response_schema with TypedDict #640

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

response_schema is not working when using typing.TypedDict #560

response_schema is not working when using typing.TypedDict #560

mwigh commented Sep 18, 2024

Hamza-nabil commented Sep 18, 2024

Gunand3043 commented Sep 27, 2024

mwigh commented Sep 27, 2024

kinto-b commented Sep 27, 2024

HNx1 commented Oct 2, 2024 •

edited

Loading

mwigh commented Oct 2, 2024

HNx1 commented Oct 2, 2024

MarkDaoust commented Oct 2, 2024 •

edited

Loading

MarkDaoust commented Oct 2, 2024

mwigh commented Oct 2, 2024

mwigh commented Oct 3, 2024

bakeryproducts commented Oct 21, 2024

MaKTaiL commented Oct 25, 2024 •

edited

Loading

popovidis commented Oct 29, 2024

antoo05-11 commented Nov 3, 2024

PeterMinin commented Nov 19, 2024 •

edited

Loading

response_schema is not working when using typing.TypedDict #560

response_schema is not working when using typing.TypedDict #560

Comments

mwigh commented Sep 18, 2024

Description of the bug:

Actual vs expected behavior:

Any other information you'd like to share?

Hamza-nabil commented Sep 18, 2024

Gunand3043 commented Sep 27, 2024

mwigh commented Sep 27, 2024

kinto-b commented Sep 27, 2024

HNx1 commented Oct 2, 2024 • edited Loading

mwigh commented Oct 2, 2024

HNx1 commented Oct 2, 2024

MarkDaoust commented Oct 2, 2024 • edited Loading

MarkDaoust commented Oct 2, 2024

mwigh commented Oct 2, 2024

mwigh commented Oct 3, 2024

bakeryproducts commented Oct 21, 2024

MaKTaiL commented Oct 25, 2024 • edited Loading

popovidis commented Oct 29, 2024

antoo05-11 commented Nov 3, 2024

PeterMinin commented Nov 19, 2024 • edited Loading

HNx1 commented Oct 2, 2024 •

edited

Loading

MarkDaoust commented Oct 2, 2024 •

edited

Loading

MaKTaiL commented Oct 25, 2024 •

edited

Loading

PeterMinin commented Nov 19, 2024 •

edited

Loading