genai: Fixed nested pydantic structures recursion #658

Shahar-Y · 2024-12-19T16:18:24Z

PR Description

While working with gregpr07/browser-use, @MahlerTom and I found several bugs in langchain handling of nested Pydantic structured in with_structured_output. Specifically, the issues occurred in _get_properties_from_schema method in libs\genai\langchain_google_genai\_function_utils.py.

The current implementation did not recursively handle cases of nested glm.Type.OBJECT and only took care of the first layer.
In cases where pydantic fields are required (no default value is provided), the method did not properly add these fields to the required list.
In cases where an object of type glm.Type.OBJECT had empty properties, there was an error that properties of type glm.Type.OBJECT must not be empty.

Relevant issues

Fixes #657,
May be related to langchain-ai/langchain#24225

Type

🐛 Bug Fix

Changes

As mentioned above:

The previous implementation did not recursively handle cases of nested glm.Type.OBJECT and only took care of the first layer. We fixed the recursion with:

if properties_item.get("type_") == glm.Type.OBJECT:
    if v.get('anyOf') and isinstance(v['anyOf'], list) and isinstance(v['anyOf'][0], dict):
        v = v['anyOf'][0]
    v_properties = v.get("properties")
    if v_properties:
        properties_item["properties"] = _get_properties_from_schema_any(v_properties)

However, we found that when sending nested pedantic objects with required fields as expected output structure, like the case in the following method from gregpr07/browser-use below:

@time_execution_async('--get_next_action')
	async def get_next_action(self, input_messages: list[BaseMessage]) -> AgentOutput:
		"""Get next action from LLM based on current state"""

		structured_llm = self.llm.with_structured_output(self.AgentOutput, include_raw=True)
		response: dict[str, Any] = await structured_llm.ainvoke(input_messages)  # type: ignore

		parsed: AgentOutput = response['parsed']

		self._log_response(parsed)
		self.n_steps += 1

		return parsed

Here, when we observed the response['parsed'] was often None and response['parsing_error'] included validation errors for missing fields that were required.

To fix that error, we realized we needed to add the required list when parsing the pydantic objects. We added the nested if to our fix:

if properties_item.get("type_") == glm.Type.OBJECT:
    if v.get('anyOf') and isinstance(v['anyOf'], list) and isinstance(v['anyOf'][0], dict):
        v = v['anyOf'][0]
    v_properties = v.get("properties")
    if v_properties:
        properties_item["properties"] = _get_properties_from_schema_any(v_properties)
        if isinstance(v_properties, dict):
            properties_item["required"] = [
                k for k, v in v_properties.items() if "default" not in v
            ]

But still got this error:

Unexpected error: Invalid argument provided to Gemini: 400 * GenerateContentRequest.tools[0].function_declarations[0].parameters.properties[action].properties[go_back].properties: should be non-empty for OBJECT type

This was because in cases where an object of type glm.Type.OBJECT had empty properties, as was the case with the 'go_back' action in gregpr07/browser-use, there was an error that properties of type glm.Type.OBJECT must not be empty. We changed that by adding dummy type in such cases adding the last else statement:

if properties_item.get("type_") == glm.Type.OBJECT:
    if v.get('anyOf') and isinstance(v['anyOf'], list) and isinstance(v['anyOf'][0], dict):
        v = v['anyOf'][0]
    v_properties = v.get("properties")
    if v_properties:
        properties_item["properties"] = _get_properties_from_schema_any(v_properties)
        if isinstance(v_properties, dict):
            properties_item["required"] = [
                k for k, v in v_properties.items() if "default" not in v
            ]
    else:
        # Providing dummy type for object without properties
        properties_item["type_"] = glm.Type.STRING

Testing

Tested on the code described in #657

Note

PR title and description are appropriate
Necessary tests and documentation have been added
Lint and tests pass successfully
The following additional guidelines are adhered to:
- Optional dependencies are imported within functions
- No unnecessary dependencies added to pyproject.toml files (except those required for unit tests)
- PR doesn't touch more than one package
- Changes are backwards compatible

…ctured output of the llm

MagMueller · 2024-12-20T16:31:53Z

Great work!

Shahar-Y · 2024-12-21T13:59:18Z

Hi @lkuligin , could you please review this pull request when you get a chance?
Let me know if there's anything that needs clarification. Thanks!

lkuligin · 2024-12-24T05:33:56Z

thanks for your contribution! could you add a unit test for a nested Pydantic, please?

Shahar-Y · 2024-12-25T09:22:17Z

thanks for your contribution! could you add a unit test for a nested Pydantic, please?

Hi, @MahlerTom and I added several tests regarding the PR.
Please let us know if anything else is missing 👍

MagMueller · 2024-12-27T08:13:15Z

Thank you @lkuligin!
Could you create a new release for the pip package?

MahlerTom · 2024-12-27T09:56:53Z

Thanks @lkuligin! Could you please do a rebase and merge instead of squash and merge? I'd like my commits to be included :)

Shahar-Y and others added 4 commits December 19, 2024 15:31

fix: adding "required" to the sub properties in order to enforce stru…

befcbdf

…ctured output of the llm

added fix to deal with anyof

d07a09c

Fix: Providing dummy type for object without properties

0adac0e

after make format

c246124

Shahar-Y mentioned this pull request Dec 19, 2024

[BUG] Gemini does not work with Browser-Use browser-use/browser-use#104

Open

Merge branch 'langchain-ai:main' into main

140ab2e

Shahar-Y mentioned this pull request Dec 22, 2024

genai: Fix handling of optional arrays in tool input #661

Open

Shahar-Y and others added 2 commits December 25, 2024 10:00

Added tests for _get_properties_from_schema

eb1c23f

fixed linting errors in tests

dbaf111

Merge branch 'main' into main

4c3c209

lkuligin approved these changes Dec 27, 2024

View reviewed changes

lkuligin merged commit 4a0a2d3 into langchain-ai:main Dec 27, 2024
15 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

genai: Fixed nested pydantic structures recursion #658

genai: Fixed nested pydantic structures recursion #658

Shahar-Y commented Dec 19, 2024

MagMueller commented Dec 20, 2024

Shahar-Y commented Dec 21, 2024 •

edited

Loading

lkuligin commented Dec 24, 2024

Shahar-Y commented Dec 25, 2024

MagMueller commented Dec 27, 2024

MahlerTom commented Dec 27, 2024

genai: Fixed nested pydantic structures recursion #658

genai: Fixed nested pydantic structures recursion #658

Conversation

Shahar-Y commented Dec 19, 2024

PR Description

Relevant issues

Type

Changes

Testing

Note

MagMueller commented Dec 20, 2024

Shahar-Y commented Dec 21, 2024 • edited Loading

lkuligin commented Dec 24, 2024

Shahar-Y commented Dec 25, 2024

MagMueller commented Dec 27, 2024

MahlerTom commented Dec 27, 2024

Shahar-Y commented Dec 21, 2024 •

edited

Loading