Skip to content

Commit c7ffe93

Browse files
AlnusjaponicanopperlCecilwang
authored
[Model] Support TP/PP/mamba2 kernel for PLaMo2 (#19674)
Signed-off-by: Shinichi Hemmi <shemmi@preferred.jp> Signed-off-by: Shinichi Hemmi <50256998+Alnusjaponica@users.noreply.github.com> Co-authored-by: Calvin Metzger <metzger@preferred.jp> Co-authored-by: Sixue Wang <cecilwang@preferred.jp>
1 parent 15a72ac commit c7ffe93

File tree

4 files changed

+365
-213
lines changed

4 files changed

+365
-213
lines changed

docs/models/supported_models.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -389,7 +389,7 @@ th {
389389
| `PhiMoEForCausalLM` | Phi-3.5-MoE | `microsoft/Phi-3.5-MoE-instruct`, etc. | ✅︎ | ✅︎ | ✅︎ |
390390
| `Phi4FlashForCausalLM` | Phi-4-mini-flash-reasoning | `microsoft/microsoft/Phi-4-mini-instruct`, etc. | | | |
391391
| `PersimmonForCausalLM` | Persimmon | `adept/persimmon-8b-base`, `adept/persimmon-8b-chat`, etc. | | ✅︎ | ✅︎ |
392-
| `Plamo2ForCausalLM` | PLaMo2 | `pfnet/plamo-2-1b`, `pfnet/plamo-2-8b`, etc. | | | |
392+
| `Plamo2ForCausalLM` | PLaMo2 | `pfnet/plamo-2-1b`, `pfnet/plamo-2-8b`, etc. | | ✅︎ | |
393393
| `QWenLMHeadModel` | Qwen | `Qwen/Qwen-7B`, `Qwen/Qwen-7B-Chat`, etc. | ✅︎ | ✅︎ | ✅︎ |
394394
| `Qwen2ForCausalLM` | QwQ, Qwen2 | `Qwen/QwQ-32B-Preview`, `Qwen/Qwen2-7B-Instruct`, `Qwen/Qwen2-7B`, etc. | ✅︎ | ✅︎ | ✅︎ |
395395
| `Qwen2MoeForCausalLM` | Qwen2MoE | `Qwen/Qwen1.5-MoE-A2.7B`, `Qwen/Qwen1.5-MoE-A2.7B-Chat`, etc. | ✅︎ | ✅︎ | ✅︎ |

tests/distributed/test_pipeline_parallel.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -175,6 +175,7 @@ def iter_params(self, model_id: str):
175175
"internlm/internlm2-chat-7b": PPTestSettings.fast(),
176176
"inceptionai/jais-13b-chat": PPTestSettings.fast(),
177177
"ai21labs/Jamba-tiny-dev": PPTestSettings.fast(),
178+
"pfnet/plamo-2-1b": PPTestSettings.fast(),
178179
"meta-llama/Llama-3.2-1B-Instruct": PPTestSettings.detailed(),
179180
# Tests TransformersForCausalLM
180181
"hmellor/Ilama-3.2-1B": PPTestSettings.fast(),

tests/quantization/test_experts_int8.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -9,7 +9,7 @@
99

1010
from tests.quantization.utils import is_quant_method_supported
1111

12-
MODELS = ["ai21labs/Jamba-tiny-random"]
12+
MODELS = ["ai21labs/Jamba-tiny-random", "pfnet/plamo-2-1b"]
1313

1414

1515
@pytest.mark.skipif(not is_quant_method_supported("experts_int8"),

0 commit comments

Comments
 (0)