[InternLM] Add support for InternLM #26302

Rocketknight1 · 2023-09-20T17:27:53Z

InternLM is based on the LLaMA code but adds a config.bias parameter. We can support those models by adding config.bias to LLaMA, and preserve backward compatibility by defaulting it to False

…MA checkpoints

HuggingFaceDocBuilderDev · 2023-09-20T17:49:31Z

The documentation is not available anymore as the PR was closed or merged.

ArthurZucker

Thanks! Quick question, do they use dropout after attention?
Let's also add a InternLM.md, add it to the doc etc 😉

src/transformers/models/llama/configuration_llama.py

Rocketknight1 · 2023-09-21T16:34:31Z

@ArthurZucker Also, to answer the question, there is no Dropout anywhere in the LLaMA code or the InternLM code.

* Add config.bias to LLaMA to allow InternLM models to be ported as LLaMA checkpoints * Rename bias -> attention_bias and add docstring

Add config.bias to LLaMA to allow InternLM models to be ported as LLa…

0a1d1f0

…MA checkpoints

Rocketknight1 requested review from LysandreJik and ArthurZucker September 20, 2023 17:27

ArthurZucker reviewed Sep 21, 2023

View reviewed changes

src/transformers/models/llama/configuration_llama.py Outdated Show resolved Hide resolved

Rocketknight1 changed the title ~~Add config.bias to LLaMA for InternLM~~ [InternLM] Add support for InternLM Sep 21, 2023

Rename bias -> attention_bias and add docstring

9281056

ArthurZucker mentioned this pull request Sep 22, 2023

AttributeError: 'InternLMTokenizer' object has no attribute 'sp_model' #26340

Closed

4 tasks

LysandreJik approved these changes Sep 22, 2023

View reviewed changes

Merge branch 'main' into port_internlm_as_llama

bcb105d

Rocketknight1 merged commit 6ba63ac into main Sep 26, 2023

Rocketknight1 deleted the port_internlm_as_llama branch September 26, 2023 15:52

ArthurZucker mentioned this pull request Nov 8, 2023

Add RoPE dimensions to LLaMA config #27267

Closed

5 tasks

ArthurZucker mentioned this pull request Jan 31, 2024

Add InternLM1 & InternLM2 model #28792

Closed

2 tasks

ArthurZucker mentioned this pull request Aug 27, 2024

Make qwen2 attention_qkv_bias optional #32893

Open

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[InternLM] Add support for InternLM #26302

[InternLM] Add support for InternLM #26302

Rocketknight1 commented Sep 20, 2023

HuggingFaceDocBuilderDev commented Sep 20, 2023 •

edited

Loading

ArthurZucker left a comment

Rocketknight1 commented Sep 21, 2023

[InternLM] Add support for InternLM #26302

[InternLM] Add support for InternLM #26302

Conversation

Rocketknight1 commented Sep 20, 2023

HuggingFaceDocBuilderDev commented Sep 20, 2023 • edited Loading

ArthurZucker left a comment

Choose a reason for hiding this comment

Rocketknight1 commented Sep 21, 2023

HuggingFaceDocBuilderDev commented Sep 20, 2023 •

edited

Loading