File tree Expand file tree Collapse file tree 3 files changed +0
-85
lines changed Expand file tree Collapse file tree 3 files changed +0
-85
lines changed Original file line number Diff line number Diff line change 115115# Future Plan:
116116# Revert it when the related pr is merged in vllm and vllm-ascend.
117117#
118- # ** File: worker/patch_common/patch_eagle.py **
119- # ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
120- # 1. `vllm.v1.spec_decode.eagle.prepare_inputs`
121- # Why:
122- # We need to use the patched `prepare_input_kernel` in `eagle.prepare_inputs`.
123- # The mainly reason to overwrite `prepare_input_kernel` is this is a triton
124- # kernel, ascend is now not support triton kernel.
125- # How:
126- # Re-implementation the `prepare_input_kernel` triton kernel by pytorch
127- # Related PR (if no, explain why):
128- # - Ascend doesn't support triton
129- # Future Plan:
130- # Revert it when the ascend support triton kernel.
131- #
132118# ** File: worker/patch_common/patch_sampler.py **
133119# ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
134120# 1. `vllm.v1.sample.sampler.Sampler.apply_top_k_top_p`
Original file line number Diff line number Diff line change 1919# patch files.
2020import vllm_ascend .patch .worker .patch_common .patch_utils # noqa isort:skip
2121import vllm_ascend .patch .worker .patch_common .patch_distributed # noqa
22- import vllm_ascend .patch .worker .patch_common .patch_eagle # noqa
2322import vllm_ascend .patch .worker .patch_common .patch_minicpm # noqa
2423import vllm_ascend .patch .worker .patch_common .patch_multi_step_worker # noqa
2524import vllm_ascend .patch .worker .patch_common .patch_sampler # noqa
Load Diff This file was deleted.
You can’t perform that action at this time.
0 commit comments