|
336 | 336 | | `DeepseekV2ForCausalLM` | DeepSeek-V2 | `deepseek-ai/DeepSeek-V2`, `deepseek-ai/DeepSeek-V2-Chat`, etc. | | ✅︎ | ✅︎ | |
337 | 337 | | `DeepseekV3ForCausalLM` | DeepSeek-V3 | `deepseek-ai/DeepSeek-V3-Base`, `deepseek-ai/DeepSeek-V3`, etc. | | ✅︎ | ✅︎ | |
338 | 338 | | `Dots1ForCausalLM` | dots.llm1 | `rednote-hilab/dots.llm1.base`, `rednote-hilab/dots.llm1.inst`, etc. | | ✅︎ | ✅︎ | |
339 | | -| `Ernie4_5_ForCausalLM` | Ernie4.5 | `baidu/ERNIE-4.5-0.3B-PT`, etc. | ✅︎ | ✅︎ | ✅︎ | |
| 339 | +| `Ernie4_5ForCausalLM` | Ernie4.5 | `baidu/ERNIE-4.5-0.3B-PT`, etc. | ✅︎ | ✅︎ | ✅︎ | |
340 | 340 | | `Ernie4_5_MoeForCausalLM` | Ernie4.5MoE | `baidu/ERNIE-4.5-21B-A3B-PT`, `baidu/ERNIE-4.5-300B-A47B-PT`, etc. |✅︎| ✅︎ | ✅︎ | |
341 | 341 | | `ExaoneForCausalLM` | EXAONE-3 | `LGAI-EXAONE/EXAONE-3.0-7.8B-Instruct`, etc. | ✅︎ | ✅︎ | ✅︎ | |
342 | 342 | | `Exaone4ForCausalLM` | EXAONE-4 | `LGAI-EXAONE/EXAONE-4.0-32B`, etc. | ✅︎ | ✅︎ | ✅︎ | |
@@ -634,10 +634,10 @@ Some models are supported only via the [Transformers backend](#transformers). Th |
634 | 634 | |--------------|--------|--------|-------------------|-----------------------------|-----------------------------------------|---------------------| |
635 | 635 | | `Emu3ForConditionalGeneration` | Emu3 | T + I | `BAAI/Emu3-Chat-hf` | ✅︎ | ✅︎ | ✅︎ | |
636 | 636 |
|
637 | | -<sup>^</sup> You need to set the architecture name via `--hf-overrides` to match the one in vLLM. |
638 | | - • For example, to use DeepSeek-VL2 series models: |
639 | | - `--hf-overrides '{"architectures": ["DeepseekVLV2ForCausalLM"]}'` |
640 | | -<sup>E</sup> Pre-computed embeddings can be inputted for this modality. |
| 637 | +<sup>^</sup> You need to set the architecture name via `--hf-overrides` to match the one in vLLM. |
| 638 | + • For example, to use DeepSeek-VL2 series models: |
| 639 | + `--hf-overrides '{"architectures": ["DeepseekVLV2ForCausalLM"]}'` |
| 640 | +<sup>E</sup> Pre-computed embeddings can be inputted for this modality. |
641 | 641 | <sup>+</sup> Multiple items can be inputted per text prompt for this modality. |
642 | 642 |
|
643 | 643 | !!! warning |
|
0 commit comments