Fix saving of generation_config for Llama-3 #1134

eldarkurtic · 2024-04-24T11:40:35Z

The existing version of HuggingFaceCheckpointer does not respect model's generation_config.json and during saving it initializes it from the config.json. In the case of Llama-3 model, there is a discrepancy with eos_token_id which is set to 128001 in config.json but to [128001, 128009] in generation_config.json. This means that Llama-3 models saved with HuggingFaceCheckpointer have "eos_token_id": 128001 instead of "eos_token_id": [128001, 128009],.

This creates problems when we want to use Llama-3 models produced with llm-foundry as they will most likely always generate text until the max number of tokens is exhausted instead of stopping at 128009 token.

Applying the same fix as here: mosaicml#1134

dakinggg

Thanks for the fix! Please run pre-commit and then we can merge

eldarkurtic added a commit to IST-DASLab/llm-foundry that referenced this pull request Apr 24, 2024

Fix saving of generation_config for Llama-3

6a9ce04

Applying the same fix as here: mosaicml#1134

dakinggg approved these changes Apr 24, 2024

View reviewed changes

eldarkurtic added 2 commits April 24, 2024 22:42

Fix saving of generation_config for Llama-3

a86abb0

apply pre-commit changes

d6deebb

eldarkurtic force-pushed the patch-4 branch from 639e7bc to d6deebb Compare April 24, 2024 20:42

Merge branch 'main' into patch-4

266af8c

dakinggg merged commit 15abf8c into mosaicml:main Apr 25, 2024
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix saving of generation_config for Llama-3 #1134

Fix saving of generation_config for Llama-3 #1134

eldarkurtic commented Apr 24, 2024

dakinggg left a comment

Fix saving of generation_config for Llama-3 #1134

Fix saving of generation_config for Llama-3 #1134

Conversation

eldarkurtic commented Apr 24, 2024

dakinggg left a comment

Choose a reason for hiding this comment