added support for llama v2 and codellama in weight conversion for issue #28241 #28767

christoukmaji · 2024-01-29T23:44:46Z

What does this PR do?

This PR adds support for LLaMa V2 and CodeLLaMa while maintaining backwards compatibility for LLaMa V1 in the LLaMa-HuggingFace weight conversion script src/transformers/models/llama/convert_llama_weights_to_hf.py. This PR changes the max_position_embeddings for LLaMa V2 to 4096, and for CodeLLaMa to 16384, while maintaining a default max_position_embeddings of 2048 for LLaMa V1.

Fixes #28241

Who can review?

@ArthurZucker @amyeroberts

ArthurZucker · 2024-01-31T01:43:31Z

Hey @christoukmaji, thanks for opening the PR! Seems like #28754 was opened a bit earlier so we'll try to get it merged! 🤗

christoukmaji · 2024-02-02T19:28:22Z

Hi @ArthurZucker, thanks for the response. I would like to avoid the duplication of work for future contributions.

What is the PR selection process for HuggingFace contributions; is it first comment or first PR? I thought it was the first comment as outlined in the Contribution documentation and how other PR's have been handled.

ArthurZucker · 2024-02-05T02:15:31Z

Hey, pretty sure that if you look at the PR, it's first PR first, then if there is no activity anyone can take it.
I'll update the contribution guidelines as commenting is not really enough as we can't track the progress / if you are stuck of if you even started. Sorry for that! 🤗

github-actions · 2024-02-29T08:03:37Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

christoukmaji added 3 commits January 29, 2024 14:41

added support for llama v2 and codellama

a8357ee

remove whitespace

b886c55

formatting changes

20d05c3

karl-hajjar mentioned this pull request Jan 30, 2024

Fix max_position_embeddings default value for llama2 to 4096 #28241 #28754

Merged

ArthurZucker mentioned this pull request Feb 5, 2024

[Doc] update contribution guidelines #28858

Merged

github-actions bot closed this Mar 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

added support for llama v2 and codellama in weight conversion for issue #28241 #28767

added support for llama v2 and codellama in weight conversion for issue #28241 #28767

christoukmaji commented Jan 29, 2024

ArthurZucker commented Jan 31, 2024

christoukmaji commented Feb 2, 2024

ArthurZucker commented Feb 5, 2024

github-actions bot commented Feb 29, 2024

added support for llama v2 and codellama in weight conversion for issue #28241 #28767

added support for llama v2 and codellama in weight conversion for issue #28241 #28767

Conversation

christoukmaji commented Jan 29, 2024

What does this PR do?

Who can review?

ArthurZucker commented Jan 31, 2024

christoukmaji commented Feb 2, 2024

ArthurZucker commented Feb 5, 2024

github-actions bot commented Feb 29, 2024