-
Notifications
You must be signed in to change notification settings - Fork 247
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Irregular numbers on IPEX 2.3.0 and 2.3.100 for Bert-Large and Distil-bert #677
Comments
Could you please share more details? What the perf numbers did you get? What script did you use? |
Observing 5 qps for BS 60 in IPEX 2.3.100 while with IPEX 2.2.x, I am observing 41.78 qps. I am noticed multiple models behaving in the same way for IPEX 2.3.x |
Hey @jgong5, |
@ramaseshan1 Thanks for the detailed info. I guess @huiyan2021 is trying to repro the issue and see where the problem is. @huiyan2021 is there any finding? |
Hey @huiyan2021, |
Sincerely sorry that I missed this issue. Let me try to reproduce and come back asap. |
Hi @ramaseshan1 , I tried your benchmark code on a SPR 6438Y+ with the latest ipex 2.4.0 So could you try the latest ipex 2.4.0 and update your result here? |
Describe the issue
On Bert-large, & distil-bert, I am getting very low throughput numbers (BS 96), after i use ipex.optimize(model) followed by torch.compile(model, backend="ipex"). I see similar poor numbers on ipex.optimize followed by torchscript path as well.
In PT 2.2.x and PT 2.1.x, the numbers are fine.
Can you let me know if this is a known bug?
The text was updated successfully, but these errors were encountered: