You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Using flashinfer in sglang with google/gemma-7b-it
File "/home/ubuntu/sglang-venv/lib/python3.11/site-packages/flashinfer/prefill.py", line 462, in forward
return self._wrapper.forward(
^^^^^^^^^^^^^^^^^^^^^^
RuntimeError: BatchPrefillWithPagedKVCache failed to dispatch with dtype Half
I don't know if this is caused by Gemma's bfloat16 dtype or my inappropriate usage.
The text was updated successfully, but these errors were encountered:
Using
flashinfer
insglang
withgoogle/gemma-7b-it
I don't know if this is caused by Gemma's
bfloat16
dtype or my inappropriate usage.The text was updated successfully, but these errors were encountered: