A huge of fail request indexing-status when upload knowledge file #3207

trongnichi · 2024-04-09T04:43:33Z

Self Checks

This is only for bug report, if you would like to ask a quesion, please head to Discussions.
I have searched for existing issues search for existing issues, including closed ones.
I confirm that I am using English to submit this report (我已阅读并同意 Language Policy).
Pleas do not modify this template :) and fill in all the required fields.

Dify version

0.6.0-fix1

Cloud or Self Hosted

Self Hosted (Docker)

Steps to reproduce

Go to Knowledge -> Upload file

✔️ Expected Behavior

The knowledge file can be uploaded with success message.

❌ Actual Behavior

It sent multiple indexing-status requests parallel as a time and fail after that.

The knowledge was stopped upload, but it continued indexing and send error messages to the screen.

dosubot · 2024-04-09T04:44:12Z

Xin chào @trongnichietsu! Rất vui được gặp lại bạn trên đây. Hy vọng mọi thứ đều ổn với bạn!

Based on the information you've provided and the context from similar issues, it seems like the problem might be related to the handling of parallel requests and the management of long-running operations in the Dify platform. Here are some suggestions on how to potentially resolve this issue:

AbortController for Uploads: Ensure that any ongoing upload or related requests are properly aborted if a new upload starts or if the user navigates away. This can prevent unnecessary parallel requests.
Error Handling in Stream Processing: Make sure that error handling within the function that processes server-sent events (SSE) is robust, including catching network errors and handling incomplete JSON messages more gracefully.
Timeout Management: Consider increasing the timeout or implementing a more dynamic way of handling long-running operations, especially for large file uploads or slow network conditions.
Content-Type Handling in Uploads: Ensure that the Content-Type is set to multipart/form-data, which is necessary for file uploads. If there's any manipulation of headers that could interfere with this, it could cause issues with how the server processes the request.
Server-Side Handling: Check that the server-side implementation correctly handles multipart/form-data requests and does not have issues that could lead to failed uploads or multiple indexing-status requests.
Network and Infrastructure Check: Since this is a self-hosted Docker environment, check for any network issues or Docker-specific configurations that might affect long-running connections or large uploads.
Client-Side Fetch Error Handling: Ensure that the client-side code is prepared to handle fetch errors gracefully and retry or alert the user as appropriate.
Debugging and Logging: Add more detailed logging on both the client and server sides to capture the sequence of events leading up to the failure.

These suggestions are based on common issues related to file uploads, SSE, and fetch API usage in web applications. Implementing more robust error handling, ensuring proper use of headers, and reviewing server-side processing are key steps in resolving the issue. You can refer to the web/service/base.ts file for more details.

Regarding your question about the 'indexing-status' endpoint, it is designed to provide the indexing status of documents, either individually or in batches. It handles multiple parallel requests by relying on the Flask framework and its underlying Werkzeug server, which can handle multiple requests concurrently. Each request is processed in isolation, ensuring that the retrieval and calculation of document statuses do not interfere with one another. You can find more details in the api/controllers/console/datasets/datasets_document.py file.

Please try these suggestions and let me know if they help resolve the issue or if you need further assistance.

Sources

About Dosu

This response is meant to be useful and save you time. It is not meant to be a precise solution, but rather a starting point for your own research.

Help me be more useful! Please leave a 👍 if this is helpful and 👎 if it is irrelevant.

If you want to continue the conversation, start your reply with @dosu-bot.

trongnichi · 2024-04-09T06:45:42Z

Also happens in Cloud!!

JohnJyong · 2024-04-09T07:30:04Z

Are you using the econmy index type ? @trongnichietsu

trongnichi · 2024-04-09T07:38:02Z

Are you using the econmy index type ? @trongnichietsu

I used High Quality index type to use re-rank model.

trongnichi · 2024-04-09T07:46:24Z

I tested again with Economical index type, but it also has the same error.

coswind · 2024-04-09T10:59:05Z

I have the same problem when I upgrade Dify from 0.5.11 to 0.6.0-fix1

trongnichi · 2024-04-10T03:31:01Z

I have the same problem when I upgrade Dify from 0.5.11 to 0.6.0-fix1

I don't know why they can release new version 0.6,1 without checking this, I have tested again on the cloud today, but still has this issue.

JohnJyong · 2024-04-11T09:26:26Z

Could you please provide more details, such as provide your files and embedding models? @trongnichi

trongnichi · 2024-04-11T09:39:03Z

Could you please provide more details, such as provide your files and embedding models? @trongnichi

My files are all excel files from 300 - 700 kb, I use the default embedding models from OpenAI which is text-embedded-3-small.
The error happened when I clicked on "Go to document" after upload file --> Click to a file which is indexing and after 1 - 2s a huge of requests come.

JohnJyong · 2024-04-11T11:32:40Z

@JzoNgKVO

iamjoel · 2024-04-12T08:03:21Z

Fixed: #3411

dosubot bot added the 🐞 bug Something isn't working label Apr 9, 2024

trongnichi changed the title ~~Multiple request indexing-status when upload knowledge file~~ A huge of fail request indexing-status when upload knowledge file Apr 9, 2024

crazywoola assigned JohnJyong Apr 10, 2024

dosubot bot mentioned this issue Apr 10, 2024

model embedding credentials is not initialized #3295

Closed

4 tasks

dosubot bot mentioned this issue Apr 11, 2024

Segmented tasks unable to complete after platform idle #3369

Closed

4 tasks

iamjoel mentioned this issue Apr 12, 2024

chore: replace all set interval #3411

Merged

14 tasks

iamjoel closed this as completed Apr 12, 2024

dosubot bot mentioned this issue Apr 18, 2024

cannot release a lock that's no longer owned #3588

Closed

4 tasks

dosubot bot mentioned this issue May 11, 2024

upload a file to a knowledge but the indexing_status always waiting and the progress is 0% #4295

Closed

4 tasks

dosubot bot mentioned this issue Oct 3, 2024

Retrieval Failure After Indexing New Files into Knowledge Base #8996

Closed

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A huge of fail request indexing-status when upload knowledge file #3207

A huge of fail request indexing-status when upload knowledge file #3207

trongnichi commented Apr 9, 2024

dosubot bot commented Apr 9, 2024 •

edited

Loading

About Dosu

trongnichi commented Apr 9, 2024

JohnJyong commented Apr 9, 2024

trongnichi commented Apr 9, 2024

trongnichi commented Apr 9, 2024

coswind commented Apr 9, 2024

trongnichi commented Apr 10, 2024

JohnJyong commented Apr 11, 2024

trongnichi commented Apr 11, 2024

JohnJyong commented Apr 11, 2024

iamjoel commented Apr 12, 2024

A huge of fail request indexing-status when upload knowledge file #3207

A huge of fail request indexing-status when upload knowledge file #3207

Comments

trongnichi commented Apr 9, 2024

Self Checks

Dify version

Cloud or Self Hosted

Steps to reproduce

✔️ Expected Behavior

❌ Actual Behavior

dosubot bot commented Apr 9, 2024 • edited Loading

Sources

About Dosu

trongnichi commented Apr 9, 2024

JohnJyong commented Apr 9, 2024

trongnichi commented Apr 9, 2024

trongnichi commented Apr 9, 2024

coswind commented Apr 9, 2024

trongnichi commented Apr 10, 2024

JohnJyong commented Apr 11, 2024

trongnichi commented Apr 11, 2024

JohnJyong commented Apr 11, 2024

iamjoel commented Apr 12, 2024

dosubot bot commented Apr 9, 2024 •

edited

Loading