HF Model loading fixes #59

farzadab · 2024-07-26T00:04:16Z

This PR fixes the following:

gets rid of the unexpected weights warnings
makes HF model loading faster by skipping random initialization

Example of the unexpected weights warning that this PR is supposed to resolve:

Some weights of the model checkpoint at fixie-ai/ultravox_dev were not used when initializing UltravoxModel: ['language_model.lm_head.weight', 'language_model.model.embed_tokens.weight', 'language_model.model.layers.0.input_layernorm.weight',

...

- This IS expected if you are initializing UltravoxModel from the checkpoint of a model trained on another task or with another architecture (e.g. initializing a BertForSequenceClassification model from a BertForPreTraining model).
- This IS NOT expected if you are initializing UltravoxModel from the checkpoint of a model that you expect to be exactly identical (initializing a BertForSequenceClassification model from a BertForSequenceClassification model).

ultravox/model/ultravox_model.py

farzadab added 2 commits July 25, 2024 17:00

fix unexpected keys warning

631273f

fix slow model loading due to random init

47adebf

farzadab requested review from juberti and zqhuang211 July 26, 2024 00:08

farzadab commented Jul 26, 2024

View reviewed changes

ultravox/model/ultravox_model.py Outdated Show resolved Hide resolved

zqhuang211 approved these changes Jul 26, 2024

View reviewed changes

juberti approved these changes Jul 26, 2024

View reviewed changes

ultravox/model/ultravox_model.py Outdated Show resolved Hide resolved

comments for unexpected and missing keys_to_ignore

35f7c86

farzadab enabled auto-merge (squash) July 26, 2024 23:24

farzadab merged commit 4212376 into main Jul 26, 2024
1 check passed

farzadab deleted the farzad/hf-model-load-fixes branch July 26, 2024 23:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HF Model loading fixes #59

HF Model loading fixes #59

farzadab commented Jul 26, 2024

HF Model loading fixes #59

HF Model loading fixes #59

Conversation

farzadab commented Jul 26, 2024