[WIP] Enable DeepSeek Models on Intel Gaudi device #3326

YangQun1 · 2025-02-06T01:34:12Z

Example

DeepSeek-V2-Lite

Run on single Gaudi2

python examples/runtime/engine/offline_batch_inference.py \
    --device hpu \
    --model-path deepseek-ai/DeepSeek-V2-Lite \
    --trust-remote-code \
    --disable-mla

python3 -m sglang.bench_one_batch \
    --batch-size 1 \
    --input 1024 \
    --output 8 \
    --model deepseek-ai/DeepSeek-V2-Lite \
    --trust-remote-code \
    --device hpu \
    --disable-mla

DeepSeek-R1

Run on x8 Gaudi3

python3 -m sglang.bench_one_batch \
    --batch-size 1 \
    --input 1024 \
    --output 8 \
    --model /software/data/DeepSeek-R1 \
    --trust-remote-code \
    --device hpu \
    --tp 8 \
    --load-format dummy \
    --disable-mla

YangQun1 added 2 commits February 5, 2025 15:13

enable deepseek-v2-lite on hpu

c28df68

enable deepseek-r1 without mla on hpu

9cec9fb

YangQun1 changed the title ~~[Draft] Enable DeepSeek Models on Intel Gaudi device~~ [WIP] Enable DeepSeek Models on Intel Gaudi device Feb 6, 2025

YangQun1 closed this Feb 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Enable DeepSeek Models on Intel Gaudi device #3326

[WIP] Enable DeepSeek Models on Intel Gaudi device #3326

YangQun1 commented Feb 6, 2025

[WIP] Enable DeepSeek Models on Intel Gaudi device #3326

[WIP] Enable DeepSeek Models on Intel Gaudi device #3326

Conversation

YangQun1 commented Feb 6, 2025

Example

DeepSeek-V2-Lite

DeepSeek-R1