Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Model link for Llama-3-instruct 70B is wrong #1369

Closed
slobentanzer opened this issue Apr 24, 2024 · 3 comments · Fixed by #1370
Closed

Model link for Llama-3-instruct 70B is wrong #1369

slobentanzer opened this issue Apr 24, 2024 · 3 comments · Fixed by #1370
Labels
Milestone

Comments

@slobentanzer
Copy link

Describe the bug

Trying to use the builtin Llama-3-instruct 70B (gguf) from the Python client (.launch_model()) fails with this Traceback (excerpt):

Traceback (most recent call last):
  File "/Users/slobentanzer/miniforge3/envs/xinference/lib/python3.10/site-packages/huggingface_hub/utils/_errors.py", line 286, in hf_raise_for_status
    response.raise_for_status()
  File "/Users/slobentanzer/miniforge3/envs/xinference/lib/python3.10/site-packages/requests/models.py", line 1021, in raise_for_status
    raise HTTPError(http_error_msg, response=self)
requests.exceptions.HTTPError: 404 Client Error: Not Found for url: https://huggingface.co/lmstudio-community/Meta-Llama-3-70B-Instruct-GGUF/resolve/main/Meta-Llama-3-8B-Instruct-Q4_K_M.gguf

I guess it should be /Meta-Llama-3-70B-Instruct-Q4_K_M.gguf in the file path, right?

I did (xinference 0.3.10):

client.launch_model(
            model_name="llama-3-instruct",
            model_size_in_billions=70,
            model_format="ggufv2",
            quantization="Q4_K_M",
        )

I don't think other details matter in this issue, but feel free to correct me in that. :)

@ChengjieLi28
Copy link
Contributor

@slobentanzer Thanks for reporting. Fix this by PR #1370 for the next release.

@XprobeBot XprobeBot modified the milestones: v0.11.0, v0.11.1, v0.11.2 May 11, 2024
@XprobeBot XprobeBot modified the milestones: v0.11.2, v0.11.3 May 24, 2024
@XprobeBot XprobeBot modified the milestones: v0.11.3, v0.11.4, v0.12.0, v0.12.1 May 31, 2024
@XprobeBot XprobeBot modified the milestones: v0.12.1, v0.12.2 Jun 14, 2024
@XprobeBot XprobeBot modified the milestones: v0.12.2, v0.12.4, v0.13.0, v0.13.1 Jun 28, 2024
@XprobeBot XprobeBot modified the milestones: v0.13.1, v0.13.2 Jul 12, 2024
@XprobeBot XprobeBot modified the milestones: v0.13.2, v0.13.4 Jul 26, 2024
Copy link

github-actions bot commented Aug 6, 2024

This issue is stale because it has been open for 7 days with no activity.

@github-actions github-actions bot added the stale label Aug 6, 2024
Copy link

This issue was closed because it has been inactive for 5 days since being marked as stale.

@github-actions github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Aug 12, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants