-
Notifications
You must be signed in to change notification settings - Fork 1.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Accuracy regression updating triton pin in inductor #2456
Comments
could you try to comment out this line in latest code: If it solves your problem we can probably disable that for now as it was meant to help further vectorization work that I ended up putting on hold. |
Yup that fixes the reproducer for me. |
ok thanks, let me send a PR to disable this pass and I'll debug offline |
Sent: #2458 |
Running on a newer commit I still see some failures but the model the reproducer was from is fixed so those must be separate issues. |
I'm trying to debug that but I'm a bit confused as the reproducer just seem to print some performance number, how can I see the reproduce the functional bug? |
@ThomasRaoux I've updated the triton code in the issue so it runs and compares accuracy against eager pytorch. |
Thanks @peterbell10! We have to revert the revert to avoid some perf regressions in our internal workloads (#2498). I'll be debugging this next week, from a first look it seems like this exposes a NVTPX issue I'll try to find a solution, let me know if you have any concerns. |
@ThomasRaoux do you have any updates on this, or possibly some more details on the PTX issue you're seeing? |
Look like the underlying cause is the same as #2483, so closing this one. |
Thanks for figuring out this bug and finding a fix! |
* maually applied, rebased, cleanup, fix lint errors * support clang for windows * disable MSVC CXX warnings
* maually applied, rebased, fix lint errors * support clang for windows * use set_target_properties(), cleanup for windows * windows ninja does not support platform option '/A' * remove unknown option '/m' Signed-off-by: Won-Kyu Park <wkpark@gmail.com>
* maually applied, rebased, fix lint errors * support clang for windows * use set_target_properties(), cleanup for windows * windows ninja does not support platform option '/A' * remove unknown option '/m' Signed-off-by: Won-Kyu Park <wkpark@gmail.com>
* WIN32 fix using LoadLibrary Signed-off-by: Won-Kyu Park <wkpark@gmail.com>
* WIN32 fix using LoadLibrary Signed-off-by: Won-Kyu Park <wkpark@gmail.com>
* based on Windows support PR triton-lang#2456 by @andreigh * DISPATCH_ARGS fix by @andreigh * WIN32 fix using LoadLibrary
* based on Windows support PR triton-lang#2456 by @andreigh * DISPATCH_ARGS fix by @andreigh * WIN32 fix using LoadLibrary
* based on Windows support PR triton-lang#2456 by @andreigh * DISPATCH_ARGS fix by @andreigh * WIN32 fix using LoadLibrary
* based on Windows support PR triton-lang#2456 by @andreigh * DISPATCH_ARGS fix by @andreigh * WIN32 fix using LoadLibrary
* based on Windows support PR triton-lang#2456 by @andreigh * DISPATCH_ARGS fix by @andreigh * WIN32 fix using LoadLibrary
* based on Windows support PR triton-lang#2456 by @andreigh * DISPATCH_ARGS fix by @andreigh * WIN32 fix using LoadLibrary
* based on Windows support PR triton-lang#2456 by @andreigh * DISPATCH_ARGS fix by @andreigh * WIN32 fix using LoadLibrary
* based on Windows support PR triton-lang#2456 by @andreigh * DISPATCH_ARGS fix by @andreigh * WIN32 fix using LoadLibrary
* based on Windows support PR triton-lang#2456 by @andreigh * DISPATCH_ARGS fix by @andreigh * WIN32 fix using LoadLibrary
* based on Windows support PR triton-lang#2456 by @andreigh * DISPATCH_ARGS fix by @andreigh * WIN32 fix using LoadLibrary
* based on Windows support PR triton-lang#2456 by @andreigh * DISPATCH_ARGS fix by @andreigh * WIN32 fix using LoadLibrary
* based on Windows support PR triton-lang#2456 by @andreigh * DISPATCH_ARGS fix by @andreigh * WIN32 fix using LoadLibrary
* based on Windows support PR triton-lang#2456 by @andreigh * DISPATCH_ARGS fix by @andreigh * WIN32 fix using LoadLibrary
* based on Windows support PR triton-lang#2456 by @andreigh * DISPATCH_ARGS fix by @andreigh * WIN32 fix using LoadLibrary
* based on Windows support PR triton-lang#2456 by @andreigh * DISPATCH_ARGS fix by @andreigh * WIN32 fix using LoadLibrary
* based on Windows support PR triton-lang#2456 by @andreigh * DISPATCH_ARGS fix by @andreigh * WIN32 fix using LoadLibrary
* based on Windows support PR triton-lang#2456 by @andreigh * DISPATCH_ARGS fix by @andreigh * WIN32 fix using LoadLibrary
* based on Windows support PR triton-lang#2456 by @andreigh * DISPATCH_ARGS fix by @andreigh * WIN32 fix using LoadLibrary
* based on Windows support PR triton-lang#2456 by @andreigh * DISPATCH_ARGS fix by @andreigh * WIN32 fix using LoadLibrary
* based on Windows support PR triton-lang#2456 by @andreigh * WIN32 fix using LoadLibrary
* based on Windows support PR triton-lang#2456 by @andreigh * WIN32 fix using LoadLibrary
* based on Windows support PR triton-lang#2456 by @andreigh * WIN32 fix using LoadLibrary
* based on Windows support PR triton-lang#2456 by @andreigh * WIN32 fix using LoadLibrary
* based on Windows support PR triton-lang#2456 by @andreigh * WIN32 fix using LoadLibrary
* based on Windows support PR triton-lang#2456 by @andreigh * WIN32 fix using LoadLibrary
* based on Windows support PR triton-lang#2456 by @andreigh * WIN32 fix using LoadLibrary
* based on Windows support PR triton-lang#2456 by @andreigh * WIN32 fix using LoadLibrary
* based on Windows support PR triton-lang#2456 by @andreigh * WIN32 fix using LoadLibrary
* based on Windows support PR triton-lang#2456 by @andreigh * WIN32 fix using LoadLibrary
* based on Windows support PR triton-lang#2456 by @andreigh * WIN32 fix using LoadLibrary
* based on Windows support PR triton-lang#2456 by @andreigh * WIN32 fix using LoadLibrary
* based on Windows support PR triton-lang#2456 by @andreigh * WIN32 fix using LoadLibrary
* based on Windows support PR triton-lang#2456 by @andreigh * WIN32 fix using LoadLibrary
* based on Windows support PR triton-lang#2456 by @andreigh * WIN32 fix using LoadLibrary
* based on Windows support PR triton-lang#2456 by @andreigh * WIN32 fix using LoadLibrary
* based on Windows support PR triton-lang#2456 by @andreigh * WIN32 fix using LoadLibrary * nvidia: nvcuda.dll, cupti*.dll, cublasLT*.dll * amd: fix dlfcn to support win32 * lint
* based on Windows support PR triton-lang#2456 by @andreigh * WIN32 fix using LoadLibrary * nvidia: nvcuda.dll, cupti*.dll, cublasLT*.dll * amd: fix dlfcn to support win32 * hsa: not supported but added for future use * lint
* based on Windows support PR triton-lang#2456 by @andreigh * WIN32 fix using LoadLibrary * nvidia: nvcuda.dll, cupti*.dll, cublasLT*.dll * amd: fix dlfcn to support win32 * hsa: not supported but added for future use * lint
* based on Windows support PR triton-lang#2456 by @andreigh * WIN32 fix using LoadLibrary * nvidia: nvcuda.dll, cupti*.dll, cublasLT*.dll * amd: fix dlfcn to support win32 * hsa: not supported but added for future use * lint
* based on Windows support PR triton-lang#2456 by @andreigh * WIN32 fix using LoadLibrary * nvidia: nvcuda.dll, cupti*.dll, cublasLT*.dll * amd: fix dlfcn to support win32 * hsa: not supported but added for future use * lint
I'm trying to update PyTorch's triton pin in pytorch/pytorch#109601 but am seeing accuracy regressions in several models.
I was able to get a minimized pytorch program which produces two kernels
PyTorch reproducer
and the corresponding triton code is here
Full reproducer
git bisect
suggests that #2285 is the root cause of these failures, and I see the PR changes the llvm IR generated for the second kernel with the for loop.The text was updated successfully, but these errors were encountered: