You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When I using openvino==2024.2.0 to run qwen2-7b-int4 and llama2-7b-int4 on iGPU, I got an inference error
"Due to the driver bug any subsequent OpenCL API call will cause application hang, so GPU plugin can't finish correctly. Please try to update the driver or reduce memory consumption (use smaller batch size, less streams, lower precision, etc)to avoid CL_OUT_OF_RESOURCES exception"
Then I change the device to CPU, and there is no error.
Is there a bug of openvino or somothing I did wrong? Please help.
Step-by-step reproduction
llama2 7b int4 model quantized by optimum-cli
intel ultra 155h PC
openvino==2024.2.0
Relevant log output
"Due to the driver bug any subsequent OpenCL API call will cause application hang, so GPU plugin can't finish correctly. Please try to update the driver or reduce memory consumption (use smaller batch size, less streams, lower precision, etc)to avoid CL_OUT_OF_RESOURCES exception"
Issue submission checklist
I'm reporting an issue. It's not a question.
I checked the problem with the documentation, FAQ, open issues, Stack Overflow, etc., and have not found a solution.
There is reproducer code and related data files such as images, videos, models, etc.
The text was updated successfully, but these errors were encountered:
OpenVINO Version
2024.2.0
Operating System
Windows System
Device used for inference
GPU
Framework
PyTorch
Model used
llama2 7b
Issue description
When I using openvino==2024.2.0 to run qwen2-7b-int4 and llama2-7b-int4 on iGPU, I got an inference error
Then I change the device to CPU, and there is no error.
Is there a bug of openvino or somothing I did wrong? Please help.
Step-by-step reproduction
Relevant log output
"Due to the driver bug any subsequent OpenCL API call will cause application hang, so GPU plugin can't finish correctly. Please try to update the driver or reduce memory consumption (use smaller batch size, less streams, lower precision, etc)to avoid CL_OUT_OF_RESOURCES exception"
Issue submission checklist
The text was updated successfully, but these errors were encountered: