[Bug]: AMD MultiStep Feature Issue. Missing argument: 'turn_prefills_into_decodes' in advance_step()
#9111
Closed
1 task done
Labels
bug
Something isn't working
Your current environment
The output of `python collect_env.py`
Model Input Dumps
No response
🐛 Describe the bug
Cause
There is a change in advance_step API in PR #8378 . The changed is made to
flash_attn.py
andflashinfer.py
but not other backend.Code Snippet to highlight the difference in
advance_step
APIflash_attn.py
rocm_flash_attn.py
Confusion whether this is a bug or because multistep feature is not supported on AMD
I saw there are other PR that is stating that multi steps feature are working on AMD e.g. #8474 .
Logs
The error logs and traceback is as follows:
Before submitting a new issue...
The text was updated successfully, but these errors were encountered: