feat: Allow using file variables directly in the LLM node and support more file types. #10679

laipz8200 · 2024-11-14T05:37:20Z

Summary

Screenshots

Before:	After:
...	...

Checklist

Important

Please review the checklist below before submitting your pull request.

This change requires a documentation update, included: Dify Document
I understand that this PR may be closed in case there was no previous discussion or issues. (This doesn't apply to typos!)
I've added a test for each change that was introduced, and I tried as much as possible to make a single atomic change.
I've updated the documentation accordingly.
I ran dev/reformat(backend) and cd web && npx lint-staged(frontend) to appease the lint gods

…nd add validators for None handling

…memory role prefix requirements

- Changed input type from list to Sequence for prompt messages to allow more flexible input types. - Improved compatibility with functions expecting different iterable types.

- Replaced list with Sequence for more flexible content type. - Improved type consistency by importing from collections.abc.

- Simplified app configuration by removing the 'frozen' parameter since it is no longer needed. - Ensures more flexible handling of config attributes.

- Changed the Faker version from caret constraint to tilde constraint for compatibility. - Updated poetry.lock for changes in pyproject.toml content.

- Improved flexibility by using Sequence instead of list, allowing for broader compatibility with different types of sequences. - Helps future-proof the method signature by leveraging the more generic Sequence type.

- Changed 'prompt_messages' parameter from list to Sequence for broader input type compatibility.

Updated the log and text properties in segments to return empty strings instead of the segment value. This change prevents potential leakage of sensitive data by ensuring only non-sensitive information is logged or transformed into text. Addresses potential security and privacy concerns.

Replaced redundant variables in test setup to streamline and align usage of fake data, enhancing readability and maintainability. Adjusted image URL variables to utilize consistent references, ensuring uniformity across test configurations. Also, corrected context variable naming for clarity. No functional impact, purely a refactor for code clarity.

Refactored LLM node tests to enhance clarity and maintainability by creating test scenarios for different file input combinations. This restructuring replaces repetitive code with a more concise approach, improving test coverage and readability. No functional code changes were made. References: #123, #456

Refactor test scenarios in LLMNode unit tests by introducing a new `LLMNodeTestScenario` class to enhance readability and consistency. This change simplifies the test case management by encapsulating scenario data and reduces redundancy in specifying test configurations. Improves test clarity and maintainability by using a structured approach.

Ensure that messages are only created from non-empty text segments, preventing potential issues with empty content. test: add scenario for file variable handling Introduce a test case for scenarios involving prompt templates with file variables, particularly images, to improve reliability and test coverage. Updated `LLMNodeTestScenario` to use `Sequence` and `Mapping` for more flexible configurations. Closes #123, relates to #456.

Updated image processing logic to check for model support of vision features, preventing errors when handling images with models that do not support them. Added a test scenario to validate behavior when vision features are absent. This ensures robust image handling and avoids unexpected behavior during image-related prompts.

Adds the workflow run object to the database session to guarantee it is persisted prior to refreshing its state. This change resolves potential issues with data consistency and integrity when the workflow run is accessed after operations. References issue #123 for more context.

Expanded the system to handle document types across different modules and introduced video and audio content handling in model features. Adjusted the prompt message logic to conditionally process content based on available features, enhancing flexibility in media processing. Added comprehensive error handling in `LLMNode` for better runtime resilience. Updated YAML configuration and unit tests to reflect these changes.

Added a check to ensure that files have an extension before processing to avoid potential errors. Updated unit tests to reflect this requirement by including extensions in test data. This prevents exceptions from being raised due to missing file extension information.

Extended the `ConfigPromptItem` component to support file variables by including the `isSupportFileVar` prop. Updated `useConfig` hooks to accept `arrayFile` variable types for both input and memory prompt filtering. This enhancement allows handling of file data types seamlessly, improving flexibility in configuring prompts.

Removed the `_render_basic_message` function and integrated its logic directly into the `LLMNode` class. This reduces redundancy and simplifies the handling of message templates by utilizing `convert_template` more directly. This change enhances code readability and maintainability.

Moved prompt handling functions out of the `LLMNode` class to improve modularity and separation of concerns. This refactor allows better reuse and testing of prompt-related functions. Adjusted existing logic to fetch queries and handle context and memory configurations more effectively. Updated tests to align with the new structure and ensure continued functionality.

Introduce `filterJinjia2InputVar` to enhance variable filtering, specifically excluding `arrayFile` types from Jinja2 input variables. This adjustment improves the management of variable types, aligning with expected input capacities and ensuring more reliable configurations. Additionally, support for file variables is enabled in relevant components, broadening functionality and user options.

…sion management Replaces direct database operations with SQLAlchemy Session context to manage workflow_run more securely and effectively.

Introduces a new DocumentPromptMessageContent class to extend the variety of supported prompt message content types. This enhancement allows encoding document data with specific formats and handling them as part of prompt messages, improving versatility in content manipulation.

Introduces support for document files in prompt message content conversion. Refactors encoding logic by unifying base64 encoding, simplifying and removing redundancy. Improves flexibility and maintainability of file handling in preparation for expanded multimedia support.

Extends file type handling to include documents in message processing. This enhances the application's ability to process a wider range of files.

Introduces support for handling document content, specifically PDFs within prompt messages, enhancing model capabilities with a new feature. Allows dynamic configuration of headers based on document presence in prompts, improving flexibility for user interactions.

Removes the exception message content duplication in the logger to prevent unnecessary redundancy since the exception details are already captured by logger.exception.

laipz8200 force-pushed the refactor/prompts-convert-in-llm-node branch 3 times, most recently from a567869 to acbb678 Compare November 18, 2024 07:35

laipz8200 requested a review from iamjoel November 18, 2024 07:58

laipz8200 marked this pull request as ready for review November 18, 2024 07:58

dosubot bot added size:XXL This PR changes 1000+ lines, ignoring generated files. ⚙️ feat:model-runtime 💪 enhancement New feature or request labels Nov 18, 2024

iamjoel previously approved these changes Nov 18, 2024

View reviewed changes

dosubot bot added the lgtm This PR has been approved by a maintainer label Nov 18, 2024

laipz8200 marked this pull request as draft November 18, 2024 08:20

laipz8200 force-pushed the refactor/prompts-convert-in-llm-node branch from 4f1dd5c to 2d8a720 Compare November 18, 2024 10:13

hjlarry mentioned this pull request Nov 19, 2024

LLM video understanding #10720

Closed

5 tasks

iamjoel dismissed their stale review via 1d19662 November 19, 2024 02:42

hjlarry mentioned this pull request Nov 22, 2024

feat: support LLM process document file #10966

Merged

5 tasks

laipz8200 added 15 commits November 22, 2024 15:30

chore(deps): add faker

3c33c39

refactor(converter): simplify model credentials validation logic

c8330e0

refactor: update stop parameter type to use Sequence instead of list

61ea2dd

refactor: update jinja2_variables and prompt_config to use Sequence a…

3687ea6

…nd add validators for None handling

feat(errors): add new error classes for unsupported prompt types and …

223e03a

…memory role prefix requirements

fix(tests): update Azure Rerank Model usage and clean imports

bd60d0f

refactor(prompt): enhance type flexibility for prompt messages

37e0a38

- Changed input type from list to Sequence for prompt messages to allow more flexible input types. - Improved compatibility with functions expecting different iterable types.

refactor(model_runtime): use Sequence for content in PromptMessage

9819825

- Replaced list with Sequence for more flexible content type. - Improved type consistency by importing from collections.abc.

chore(config): remove unnecessary 'frozen' parameter for test

062c495

- Simplified app configuration by removing the 'frozen' parameter since it is no longer needed. - Ensures more flexible handling of config attributes.

fix(dependencies): update Faker version constraint

37b1347

- Changed the Faker version from caret constraint to tilde constraint for compatibility. - Updated poetry.lock for changes in pyproject.toml content.

refactor(memory): use Sequence instead of list for prompt messages

a018002

- Improved flexibility by using Sequence instead of list, allowing for broader compatibility with different types of sequences. - Helps future-proof the method signature by leveraging the more generic Sequence type.

refactor(model_manager): update parameter type for flexibility

6810529

- Changed 'prompt_messages' parameter from list to Sequence for broader input type compatibility.

feat(llm_node): allow to use image file directly in the prompt.

fb506be

laipz8200 and others added 21 commits November 22, 2024 15:30

fix(api/core/app/task_pipeline/workflow_cycle_manage.py) workflow ses…

8039511

…sion management Replaces direct database operations with SQLAlchemy Session context to manage workflow_run more securely and effectively.

feat(api): support document file type in message handling

72258db

Extends file type handling to include documents in message processing. This enhances the application's ability to process a wider range of files.

fix: remove redundant exception message

ed00f7b

Removes the exception message content duplication in the logger to prevent unnecessary redundancy since the exception details are already captured by logger.exception.

feat: code editor and code input can not insert file type vars

02ec334

chore: jinja import not choose file

5882cdc

chore: use query not support file var

3528b2d

chore: use query prompt support file type var

0619e9a

laipz8200 force-pushed the refactor/prompts-convert-in-llm-node branch from 93f5b64 to 0619e9a Compare November 22, 2024 07:30

laipz8200 marked this pull request as ready for review November 22, 2024 07:31

laipz8200 changed the title ~~Refactor/prompts-convert-in-llm-node~~ feat: Allow using file variables directly in the LLM node and support more file types. Nov 22, 2024

crazywoola approved these changes Nov 22, 2024

View reviewed changes

crazywoola merged commit c5f7d65 into main Nov 22, 2024
15 checks passed

crazywoola deleted the refactor/prompts-convert-in-llm-node branch November 22, 2024 08:30

fdb02983rhy mentioned this pull request Nov 22, 2024

fix(gpt-4o-audio-preview): Remove the vision feature #10932

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Allow using file variables directly in the LLM node and support more file types. #10679

feat: Allow using file variables directly in the LLM node and support more file types. #10679

laipz8200 commented Nov 14, 2024 •

edited

Loading

feat: Allow using file variables directly in the LLM node and support more file types. #10679

feat: Allow using file variables directly in the LLM node and support more file types. #10679

Conversation

laipz8200 commented Nov 14, 2024 • edited Loading

Summary

Screenshots

Checklist

laipz8200 commented Nov 14, 2024 •

edited

Loading