Add DeepSeek V2.5 Example #171

dsikka · 2024-09-12T18:55:22Z

SUMMARY:

Add an example for quantizing deepseek v2.5 and running on vLLM

github-actions · 2024-09-12T18:55:33Z

👋 Hi! Thank you for contributing to llm-compressor. Please add the ready label when the PR is ready for review.

mgoin

LGTM only nits for future consideration

mgoin · 2024-09-12T23:34:32Z

examples/quantizing_moe/deepseek_moe_w4a16.py

+    MODEL_ID,
+    reserve_for_hessians=True,
+    num_gpus=2,
+    torch_dtype=torch.bfloat16,


In the future we should rely on auto dtype, but it seems like deepseek v2.5 is bfloat16 so this is okay

mgoin · 2024-09-12T23:36:12Z

examples/quantizing_moe/deepseek_recipe_w4a16.yaml

+      ignore: [lm_head, "re:.*mlp.gate$"]
+      config_groups:
+        group_0:
+          weights: {num_bits: 4, type: int, symmetric: true, strategy: channel, dynamic: false}


We may want to use grouped and actorder for the final example based on how the accuracy comes out, but this is good for now

add deepseek example

77ca928

mgoin approved these changes Sep 12, 2024

View reviewed changes

dsikka added 2 commits September 12, 2024 20:52

Merge branch 'main' into add_deepseek_moe_example

a85454c

Merge branch 'main' into add_deepseek_moe_example

2e9da01

dsikka merged commit 4fe45de into main Sep 13, 2024
5 of 7 checks passed

dsikka deleted the add_deepseek_moe_example branch September 13, 2024 15:31

markmc pushed a commit to markmc/llm-compressor that referenced this pull request Nov 13, 2024

ignore list (vllm-project#171)

7351fdb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add DeepSeek V2.5 Example #171

Add DeepSeek V2.5 Example #171

dsikka commented Sep 12, 2024

github-actions bot commented Sep 12, 2024

mgoin left a comment

mgoin Sep 12, 2024

mgoin Sep 12, 2024

Add DeepSeek V2.5 Example #171

Add DeepSeek V2.5 Example #171

Conversation

dsikka commented Sep 12, 2024

github-actions bot commented Sep 12, 2024

mgoin left a comment

Choose a reason for hiding this comment

mgoin Sep 12, 2024

Choose a reason for hiding this comment

mgoin Sep 12, 2024

Choose a reason for hiding this comment