-
Notifications
You must be signed in to change notification settings - Fork 113
Wrong prediction from "bloom-deepspeed-inference-int8" #10
Comments
Also got error "probability tensor contains either |
@zomux can you try updating to the latest deepspeed (0.7.3)? |
@mayank31398 I'm running ds with the latest github checkout
Thanks for the pointer, I will check the discussion there. |
I'm also getting the "CUDA error: an illegal memory access was encountered" error with little bit longer prompt , same as microsoft/DeepSpeed#2217 (comment) Is it possible that the checkpoints in https://huggingface.co/microsoft/bloom-deepspeed-inference-int8/tree/main are produced before that fix was merged? |
The checkpoints dont have anything to do with this |
Try using this branch |
Awesome thanks, gonna check it. Let me know if you want more details for reproducing this problem. |
@mayank31398 Thanks for the pointers. I think my issue is solved after putting different things together. Thanks! |
Resolving this issue. |
I'm running bloom-deepspeed-inference-int8 using the following command on 8 x 40G A100 machine.
I got the generation result, but they have a lot of repetition which is not the case for accelerate-based bloom int8 implementation.
The text was updated successfully, but these errors were encountered: