feat: support LLM process document file #10966

hjlarry · 2024-11-22T03:48:00Z

Summary

Tip

Close issue syntax: Fixes #<issue number> or Resolves #<issue number>, see documentation for more details.

Currently, lots of LLM( gemini, sonnet ...) can directly process document, and make user's chat context based on these documents. This PR aimed to support this feature in a dify agent app. For the chatflow app, maybe this PR can resolve.

ChangeList

Backend

add DocumentPromptMessageContent and more model features, copied from feat: Allow using file variables directly in the LLM node and support more file types. #10679
support gemini series models receive document
add video feature to support video models, document feature to support document models

Frontend

add a switch button to control whether allow upload document
for not support video feature's LLM, open vision config will not allow upload video
chat page whether display file upload button depends on file.allowed_file_types has any value

remaining issues

the vision settings is strange, Resolution only affect Image type files, Upload Method and Upload Limit affect all type files.

Screenshots

Checklist

Important

Please review the checklist below before submitting your pull request.

This change requires a documentation update, included: Dify Document
I understand that this PR may be closed in case there was no previous discussion or issues. (This doesn't apply to typos!)
I've added a test for each change that was introduced, and I tried as much as possible to make a single atomic change.
I've updated the documentation accordingly.
I ran dev/reformat(backend) and cd web && npx lint-staged(frontend) to appease the lint gods

laipz8200 · 2024-11-22T07:34:14Z

Thank you for this awesome contribution! Some changes in #10679 are still in processing, I'll review this PR after #10679 is merged.

laipz8200 · 2024-11-22T07:52:42Z

the vision settings is strange, Resolution only affect Image type files, Upload Method and Upload Limit affect all type files.

I think this resolution config should be removed in the future.

laipz8200 · 2024-11-22T09:06:46Z

Hi @hjlarry! #10679 is merged, could you please sync the code with the main branch?

hjlarry · 2024-11-22T09:34:13Z

Hi @hjlarry! #10679 is merged, could you please sync the code with the main branch?

Done :)

laipz8200 · 2024-11-22T09:57:18Z

Screen.Recording.2024-11-22.at.5.56.05.PM.mov

hjlarry · 2024-11-22T10:14:10Z

Screen.Recording.2024-11-22.at.5.56.05.PM.mov

seems the icon has been overwrite by the merge action, please try again

hjlarry added 4 commits November 22, 2024 11:27

support LLM process document file

5c81370

support LLM process document file

c565180

fix yaml features config

93556ec

fix document icon

5041300

hjlarry marked this pull request as ready for review November 22, 2024 06:04

dosubot bot added size:L This PR changes 100-499 lines, ignoring generated files. ⚙️ feat:model-runtime 💪 enhancement New feature or request labels Nov 22, 2024

crazywoola requested review from laipz8200 and zxhlyh November 22, 2024 07:06

hjlarry added 2 commits November 22, 2024 17:28

Merge branch 'p138' into p137

8292f2f

remove duplicated DocumentPromptMessageContent

95ccc9f

hjlarry and others added 5 commits November 22, 2024 18:14

add icon

6717ab7

feat(token_buffer_memory): Allow all types of file in memory.

b122e69

feat(llm_node): raise an error if model not support the file.

ed52139

fix(message_entities): Use str instead of StrEnum

e760870

test(llm_node): Remove unused tests

5465a8d

laipz8200 approved these changes Nov 22, 2024

View reviewed changes

dosubot bot added the lgtm This PR has been approved by a maintainer label Nov 22, 2024

laipz8200 merged commit 08ac368 into langgenius:main Nov 22, 2024
7 checks passed

fdb02983rhy mentioned this pull request Nov 22, 2024

fix(gpt-4o-audio-preview): Remove the vision feature #10932

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: support LLM process document file #10966

feat: support LLM process document file #10966

hjlarry commented Nov 22, 2024 •

edited

Loading

laipz8200 commented Nov 22, 2024

laipz8200 commented Nov 22, 2024

laipz8200 commented Nov 22, 2024

hjlarry commented Nov 22, 2024

laipz8200 commented Nov 22, 2024

hjlarry commented Nov 22, 2024

feat: support LLM process document file #10966

feat: support LLM process document file #10966

Conversation

hjlarry commented Nov 22, 2024 • edited Loading

Summary

ChangeList

Backend

Frontend

remaining issues

Screenshots

Checklist

laipz8200 commented Nov 22, 2024

laipz8200 commented Nov 22, 2024

laipz8200 commented Nov 22, 2024

hjlarry commented Nov 22, 2024

laipz8200 commented Nov 22, 2024

hjlarry commented Nov 22, 2024

hjlarry commented Nov 22, 2024 •

edited

Loading