-
Notifications
You must be signed in to change notification settings - Fork 635
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[xformers/benchmarks/benchmark_encoder.py] fatal error: cuda.h: No such file or directory + AttributeError: module 'triton' has no attribute 'code_gen' #516
Comments
|
One step closer it seems!
⇒ find / -name cuda.h
/opt/conda/envs/dreambooth/lib/python3.10/site-packages/nvidia/cuda_runtime/include/cuda.h
/opt/conda/envs/dreambooth/lib/python3.10/site-packages/torch/include/torch/csrc/api/include/torch/cuda.h
+ /opt/conda/envs/dreambooth/include/cuda.h
/opt/conda/lib/python3.7/site-packages/torch/include/torch/csrc/api/include/torch/cuda.h
/opt/conda/lib/python3.7/site-packages/nvidia/cuda_runtime/include/cuda.h
/opt/conda/pkgs/pytorch-1.12.0-py3.7_cuda11.3_cudnn8.3.2_0/lib/python3.7/site-packages/torch/include/torch/csrc/api/include/torch/cuda.h
/opt/conda/pkgs/cuda-cudart-dev-11.8.89-0/include/cuda.h
find: '/proc/tty/driver': Permission denied
/usr/include/linux/cuda.h Yet still getting the error: I noticed that the call to
|
|
Setting
Which may be related to this: |
I wonder if the stuff I figured in the following will help here? (to explore when I get a chance): |
the "triton has no code_gen attritbute" is unrelated, tied to a recent triton update, sorry about that. Fixed in #528 |
* removing the fp16 blocksparse crutch make the softmax kernel bfloat16 compatible * partial fix for #516 * nit, adding blfoat16 to the layernorm benchmark * dead code removal, improve code coverage Co-authored-by: Benjamin Lefaudeux <benjamin@photoroom.com>
* Add fp16 support to bucketize * Add bf16 support to bucketize * clang-format
🐛 Bug
Trying to follow along with:
Command
To Reproduce
Steps to reproduce the behavior:
!conda run -n dreambooth --live-stream python3 xformers/benchmarks/benchmark_encoder.py --activations relu --plot -emb 256 -bs 32 -heads 16
Expected behavior
The benchmark would run successfully.
Environment
Additional context
The text was updated successfully, but these errors were encountered: