Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

how to use gemini instead of class OpenAILLMProvider LLMProvider? #8

Open
jjstory24 opened this issue Jan 20, 2025 · 8 comments
Open

Comments

@jjstory24
Copy link

how to use gemini instead of class OpenAILLMProvider LLMProvider?

@sshh12
Copy link
Owner

sshh12 commented Jan 20, 2025

Hey! you'll need to implement an instance of "LLMProvider" and then add it to LLM_PROVIDERS.

@dchoward9
Copy link

@jjstory24 if you were successful, could you kindly share your edits? I have tried to update the files to use both deepseek and gemini and were unsuccessful.

@sshh12
Copy link
Owner

sshh12 commented Jan 24, 2025

Definitely considering adding deepseek to have an open-source backend thats supported. How do folks typically use these (like the API)? OpenRouter?

@dchoward9
Copy link

@sshh12 if you are able to do so and have the time, please do :)

I was able to use deepseek-chat with openrouter but each message and response took at least 10 minutes.

I'm not sure which all models support tool use / function calling, if that matters.

@sshh12
Copy link
Owner

sshh12 commented Jan 25, 2025

oh darn deepseek-ai/DeepSeek-R1#9 yeah deepseek doesn't have function calling, will have to think about this a bit more

@gabelul
Copy link

gabelul commented Feb 4, 2025

would be great to be able to use custom openai compatible endpoints and to be able to use a model from there

@sshh12
Copy link
Owner

sshh12 commented Feb 4, 2025

Makes sense, will add a config for a custom base url

@sshh12
Copy link
Owner

sshh12 commented Feb 5, 2025

Image

Added

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants