Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Roadmap] AzureAI support for large language models #608

Open
wxwatcher2004 opened this issue Jul 26, 2024 · 2 comments
Open

[Roadmap] AzureAI support for large language models #608

wxwatcher2004 opened this issue Jul 26, 2024 · 2 comments

Comments

@wxwatcher2004
Copy link

Why
Llama 3.1 405b just released and one of the only way to get access to it is through AzureAI hosting it.

Description
Support AzureAI for non OpenAI models as a cloud service.

Requirements
Here is the current token error when I try to use AzureAI on my Llama 3.1 server. [Issue] OpenAI/azure: Bad Request - "status": "Auth token must be passed as a header called Authorization"

@enricoros
Copy link
Owner

@wxwatcher2004 to replicate this, how do I setup a service, and where do you see the error? Please add some details so I can reproduce easily.

@wxwatcher2004
Copy link
Author

So the issue occurs in the models page and I think is an issue with how Azure works. They have a connection side to OpenAI, I followed the instructions you have and found that to work correctly. The other open source models use serverless options. Best I can tell these are accessed through the model deployment URL and Key. I also tried https://x.cognitiveservices.azure.com/ configuration but that only showed the GPT4 model as well.

working OpenAI connection
image
Not working Llama 3.1 connection
image
Azure model setting screen
image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants