Skip to content

Commit

Permalink
llm: disable fast decoding path for mistral
Browse files Browse the repository at this point in the history
  • Loading branch information
Nuullll committed Dec 5, 2024
1 parent 76aa494 commit 62920f6
Showing 1 changed file with 9 additions and 0 deletions.
9 changes: 9 additions & 0 deletions service/llm_biz.py
Original file line number Diff line number Diff line change
Expand Up @@ -27,6 +27,15 @@
import model_config


import ipex_llm.transformers.models.mistral

# W/A for https://github.com/intel/AI-Playground/issues/94
# Disable decoding_fast_path to avoid calling forward_qkv() which is not supported by bigdl-core-xe-*-23
ipex_llm.transformers.models.mistral.use_decoding_fast_path = (
lambda *args, **kwargs: False
)


class LLMParams:
prompt: List[Dict[str, str]]
device: int
Expand Down

0 comments on commit 62920f6

Please sign in to comment.