Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FEATURE] Ability to upload text,csv or pdf by endusers #2763

Closed
mahmoudsorf opened this issue Jul 4, 2024 · 5 comments
Closed

[FEATURE] Ability to upload text,csv or pdf by endusers #2763

mahmoudsorf opened this issue Jul 4, 2024 · 5 comments
Labels
enhancement New feature or request

Comments

@mahmoudsorf
Copy link

Currently with simple text file uploads (applicable to pdf and csv files as well) The text needs to be converted to embeddings, stored to a vector DB then the chatbot can be used/published. If we can make this first step executed automatically if a user uploaded a text file within the embed-chat itself it would really be a huge improvement (of course file size limits should apply).

image

@HenryHengZJ HenryHengZJ added the enhancement New feature or request label Jul 12, 2024
@HenryHengZJ
Copy link
Contributor

same as - #2444

@haimco50
Copy link

Same as #2084 as well :-)
@HenryHengZJ - would it work by calling the Flowise API ? (i.e instead of passing base64 encoded image will pass PDF url or whatever Claude needs to support this) - any work around you could think of?
btw, with the multi modality ramping up quickly this should be an higher priority item.

@haimco50
Copy link

@HenryHengZJ - I took a look at Anthropic Claude API doc and it looks like the Create message API accepts either text or based64 encoded image which means that in order to support PDF/None image format they had to do some pre-processing extraction activity of text/images before calling the API so is this the way to plan to support this or maybe you plan on using some kind of embedding technique etc.. - I have a use case of PDF files that have only images in it (scanned documents) so I need to figure out the base way to support this - I can extract images from the PDF (one image for each PDF page) and then call the API with the extracted images but I am not sure this is the most effective way dealing with this request.

@HenryHengZJ
Copy link
Contributor

Closing as #3059 merged!

@haimco50
Copy link

Wow - finally !! this is really amazing

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants