[Question] Any way to access the FP32 checkpoints? #259

veritas9872 · 2025-01-12T09:04:45Z

Hello!
I am currently doing research into quantizing Llama models.
However, I have noticed that the Llama checkpoints made public are always in BF16, including the embeddings.
Although this may be suitable for most use cases, I am currently trying some research that may be more precision-sensitive than most works.
Is there any way to access the master FP32 weights? It is my understanding that all weights have their master copies in FP32 but have some operations that are computed in BF16. I would be most grateful if the FP32 checkpoints could be made accessible.

d-kleine · 2025-01-14T09:21:01Z

Why do not just use Huggingface for that? You can specify the dtype as FP32 there.

veritas9872 · 2025-01-15T05:06:35Z

The HuggingFace checkpoints are in BF16. Even when I load the results in FP32, they are simply BF16 checkpoints cast to FP32.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question] Any way to access the FP32 checkpoints? #259

[Question] Any way to access the FP32 checkpoints? #259

veritas9872 commented Jan 12, 2025 •

edited

Loading

d-kleine commented Jan 14, 2025

veritas9872 commented Jan 15, 2025

[Question] Any way to access the FP32 checkpoints? #259

[Question] Any way to access the FP32 checkpoints? #259

Comments

veritas9872 commented Jan 12, 2025 • edited Loading

d-kleine commented Jan 14, 2025

veritas9872 commented Jan 15, 2025

veritas9872 commented Jan 12, 2025 •

edited

Loading