Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Set pretrained model name correctly, if provided, in HF Checkpointer #1407

Merged
merged 7 commits into from
Jul 29, 2024

Conversation

snarayan21
Copy link
Contributor

@snarayan21 snarayan21 commented Jul 29, 2024

Sets the pretrained model name correctly (if provided via mlflow config) in HF Checkpointer. This also addresses the case where lora adapters may get saved out without the correct base model name.

Manual test worked: adapter-config-composer-llama3-1-8b-base-new-PvcjIr and adapter-config-composer-llama3-1-8b-base-new-0HGUfd
Registered models have "base_model_name_or_path": "meta-llama/Meta-Llama-3.1-8B", as expected

@snarayan21 snarayan21 requested a review from a team as a code owner July 29, 2024 15:46
@dakinggg dakinggg enabled auto-merge (squash) July 29, 2024 17:03
@dakinggg dakinggg merged commit 6f4aa8c into main Jul 29, 2024
9 checks passed
@dakinggg dakinggg deleted the saaketh/base_model_name branch August 6, 2024 18:40
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants