[Question]: How can I improve the concurrency performance of the ragflow_stream_output api？ #3641

Weishaoya · 2024-11-25T16:39:52Z

Describe your problem

For ragflow_streaming_output api, when I set the number of concurrent requests to 1, 10, and 100, the first token latency was 0.6719s, 4.7593s, and 41.9158s, respectively. Due to the existence of the retrieval link, the concurrency performance of ragflow_stream_output api is weak, which is not conducive to large-scale applications. How can I improve the concurrency performance of ragflow_stream_output api?

KevinHuSh · 2024-11-26T01:19:10Z

Change the run_simple function in api/ragflow_server.py.

Weishaoya · 2024-11-26T16:00:31Z

Change the run_simple function in api/ragflow_server.py.

I have changed the run_simple function to the Gunicorn. The concurrency performance of ragflow_stream_output improves when I set workers to 10, but it has a problem that the embedding model will be loaded 10 times on gpu-0. Can you provide a better way to improve the concurrency performance? Thank you!

Weishaoya added the question Further information is requested label Nov 25, 2024

Weishaoya changed the title ~~[Question]: How can I improve the concurrency performance of the ragflow stream output api？~~ [Question]: How can I improve the concurrency performance of the ragflow_stream_output api？ Nov 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question]: How can I improve the concurrency performance of the ragflow_stream_output api？ #3641

[Question]: How can I improve the concurrency performance of the ragflow_stream_output api？ #3641

Weishaoya commented Nov 25, 2024

KevinHuSh commented Nov 26, 2024

Weishaoya commented Nov 26, 2024

[Question]: How can I improve the concurrency performance of the ragflow_stream_output api？ #3641

[Question]: How can I improve the concurrency performance of the ragflow_stream_output api？ #3641

Comments

Weishaoya commented Nov 25, 2024

Describe your problem

KevinHuSh commented Nov 26, 2024

Weishaoya commented Nov 26, 2024