Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Request] Preview model disk size usage #88

Open
czlr opened this issue Aug 3, 2024 · 6 comments
Open

[Request] Preview model disk size usage #88

czlr opened this issue Aug 3, 2024 · 6 comments
Assignees

Comments

@czlr
Copy link

czlr commented Aug 3, 2024

Request: As a casual user without much knowledge in LLMs, it would be nice to know upfront how much disk space the models need.

Currently: The various posts and docs only mention that llama 3.1 comes in different variants: 8B, 70B, 405B; regular vs instruct; etc. But they don't mention things like required disk size usage, or minimum system specs to run things smoothly. (Unless I accidentally missed those details)
I can only start a download blindly and monitor while it's going. The 70B model also seems to come in 17GB .pth parts -- I can't see how many parts are remaining either.

@DAOZHENREN
Copy link

For the 70B model, it may take 131GB

@DAOZHENREN
Copy link

And 15GB for the 8B model. The regular and instruct are equal in size, but the latter is finetuned toward chat and is safer(less toxic).

@samuelselvan samuelselvan self-assigned this Aug 7, 2024
@vonpetersenn
Copy link

I have a related question: if the model has 70B parameters, wouldn't we expect the size of the model to be 70*4=280 GB?
A Float32 parameter takes up 4 bytes of storage. Are the parameters of LLaMA Float16-numbers?

@DAOZHENREN
Copy link

I think that you have misunderstood the 'B' in the 70B. Actually, it means billion.

@ashwinb
Copy link
Contributor

ashwinb commented Nov 5, 2024

@vonpetersenn Yes they are float16 -- specifically "bf16" (Brain Float 16) format.

@vonpetersenn
Copy link

thank you very much!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants