Optimization of Performance for Large Text Segmentation in the Knowledge Base #10881

xiaohuihuige · 2024-11-20T03:02:07Z

Self Checks

I have searched for existing issues search for existing issues, including closed ones.
I confirm that I am using English to submit this report (我已阅读并同意 Language Policy).
[FOR CHINESE USERS] 请务必使用英文提交 Issue，否则会被关闭。谢谢！:）
Please do not modify this template :) and fill in all the required fields.

1. Is this request related to a challenge you're experiencing? Tell me about your story.

When using the knowledge base, during text segmentation and cleaning, the uploaded text will be read and split. When clicking "Save and Process," the split operation will be repeated. If the text is too large, this repeated operation takes a long time, resulting in poor performance. During the text splitting process, large texts significantly challenge performance. Is it possible to use CUDA for accelerated processing?

2. Additional context or comments

在使用知识库的时候，在文本分段与清洗里面，这个时候会读取上传的文本，进行split分割文本。
当点击保存并处理，此时会重复上面的split分割文本操作，如果文本过大的时候，重复操作耗时十分长，性能太差。
进行split分割文本的时候，文本过大的时候，对性能考验比较大，能否使用CUDA进行加速处理。

3. Can you help us with this feature?

I am interested in contributing to this feature.

dosubot bot added the 💪 enhancement New feature or request label Nov 20, 2024

crazywoola assigned Yawen-1010 Nov 20, 2024

Yawen-1010 assigned JohnJyong Nov 20, 2024

highhead mentioned this issue Nov 20, 2024

Knowledge Retrieval Testing timing out #10916

Closed

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimization of Performance for Large Text Segmentation in the Knowledge Base #10881

Optimization of Performance for Large Text Segmentation in the Knowledge Base #10881

xiaohuihuige commented Nov 20, 2024

Optimization of Performance for Large Text Segmentation in the Knowledge Base #10881

Optimization of Performance for Large Text Segmentation in the Knowledge Base #10881

Comments

xiaohuihuige commented Nov 20, 2024

Self Checks

1. Is this request related to a challenge you're experiencing? Tell me about your story.

2. Additional context or comments

3. Can you help us with this feature?