Flash Attention 2.0 doesn't work: undefined symbol: _ZNK3c1010TensorImpl27throw_data_ptr_access_errorEv #451

FrancescoSaverioZuppichini · 2023-08-15T11:41:43Z

Hi there,

cloing the repo and running pytest tests/test_flash_attn.py gives

ImportError while importing test module '/root/flash-attention/tests/test_flash_attn.py'.
Hint: make sure your test modules/packages have valid Python names.
Traceback:
/usr/lib/python3.10/importlib/__init__.py:126: in import_module
    return _bootstrap._gcd_import(name[level:], package, level)
tests/test_flash_attn.py:10: in <module>
    from flash_attn import flash_attn_func, flash_attn_kvpacked_func, flash_attn_qkvpacked_func
/usr/local/lib/python3.10/dist-packages/flash_attn/__init__.py:3: in <module>
    from flash_attn.flash_attn_interface import flash_attn_func
/usr/local/lib/python3.10/dist-packages/flash_attn/flash_attn_interface.py:4: in <module>
    import flash_attn_2_cuda as flash_attn_cuda
E   ImportError: /usr/local/lib/python3.10/dist-packages/flash_attn_2_cuda.cpython-310-x86_64-linux-gnu.so: undefined symbol: _ZNK3c1010TensorImpl27throw_data_ptr_access_errorEv

maybe somebody else have encountered this

Thanks a lot,

Fra

The text was updated successfully, but these errors were encountered:

FrancescoSaverioZuppichini · 2023-08-15T11:44:05Z

Error persist also on a small stupid tests

import torch 
from torch import nn
from flash_attn import flash_attn_qkvpacked_func

qvk = nn.Linear(1, 3, 196, 512).cuda()

res = flash_attn_qkvpacked_func(qvk=qvk)
print(res.shape)
print(res)

Traceback (most recent call last):
  File "/root/main.py", line 3, in <module>
    from flash_attn import flash_attn_qkvpacked_func
  File "/usr/local/lib/python3.10/dist-packages/flash_attn/__init__.py", line 3, in <module>
    from flash_attn.flash_attn_interface import flash_attn_func
  File "/usr/local/lib/python3.10/dist-packages/flash_attn/flash_attn_interface.py", line 4, in <module>
    import flash_attn_2_cuda as flash_attn_cuda
ImportError: /usr/local/lib/python3.10/dist-packages/flash_attn_2_cuda.cpython-310-x86_64-linux-gnu.so: undefined symbol: _ZNK3c1010TensorImpl27throw_data_ptr_access_errorEv

FrancescoSaverioZuppichini · 2023-08-15T11:45:51Z

I think you compiled with 2.0.7 and I am using torch 2.1.0

FrancescoSaverioZuppichini · 2023-08-15T12:00:45Z

using

FROM nvcr.io/nvidia/pytorch:23.06-py3

so python 3.10, cuda 12.1 and torch 2.1 (see nvidia doc) and the correct wheel (https://github.com/Dao-AILab/flash-attention/releases/download/v2.0.7/flash_attn-2.0.7+cu121torch2.1cxx11abiTRUE-cp310-cp310-linux_x86_64.whl) still result in the same error

import flash_attn_2_cuda as flash_attn_cuda

ImportError: /usr/local/lib/python3.10/dist-packages/flash_attn_2_cuda.cpython-310-x86_64-linux-gnu.so: undefined symbol: _ZNK3c1010TensorImpl27throw_data_ptr_access_errorEv

tridao · 2023-08-15T15:44:11Z

Thanks for the report. I saw just this error on nvcr 23.06 as well. nvcr 23.07 should work, can you try?
The error is due to pytorch interface changing between the version used in 23.06 and 23.07.

FrancescoSaverioZuppichini · 2023-08-15T16:05:48Z

to

Thanks a lot, will try as soon as I go back to my pc, out of curiosity could to give me more details about due to pytorch interface changing between the version used in 23.06 and 23.07?

tridao · 2023-08-15T16:09:45Z

Oh it's a low-level change in error handling. Pytorch added this "throw_data_ptr_access_error" function in May 11. nvcr 23.06 uses pytorch version on May 2 and nvcr 23.07 uses pytorch version on June 7.

janelu9 · 2023-08-17T02:57:09Z

Try a cxx11abiFALSE version

junjun3518 · 2023-08-25T04:39:51Z

Hi @tridao! I have some troubles with python3.10, so I need to use nvcr 23.04.
Is their any suggestion to handle this import error with lower version of images?

tridao · 2023-08-25T04:44:44Z

You can compile from source with FLASH_ATTENTION_FORCE_BUILD=TRUE:
FLASH_ATTENTION_FORCE_BUILD=TRUE pip install flash-attn.

junjun3518 · 2023-08-25T04:47:11Z

Thank you for fast response! I will try it

junjun3518 · 2023-08-25T05:34:34Z

Thank you It seems that it is working now

nghiadt22 · 2024-01-18T06:28:45Z

Thanks for the report. I saw just this error on nvcr 23.06 as well. nvcr 23.07 should work, can you try? The error is due to pytorch interface changing between the version used in 23.06 and 23.07.

Hi @tridao what is nvcr and how to change its version?

Niyathi3011 · 2024-01-23T05:46:13Z

import flash_attn_2_cuda as flash_attn_cuda
ImportError: /home/nallu002/anaconda3/envs/myenv/lib/python3.9/site-packages/flash_attn_2_cuda.cpython-39-x86_64-linux-gnu.so: undefined symbol: _ZNK3c106SymIntltEl

I am still facing this error. Chnaged the torch version from 2.1.2 to 2.1.0. And it is still not working.

ghost · 2024-02-01T19:20:46Z

Also facing this error, in Databricks.
Torch Version: 2.1.0+cu121
torch.cuda.get_device_capability(): (7,0)
Compiled from source but that did not work for me.

ImportError: /databricks/python/lib/python3.10/site-packages/flash_attn_2_cuda.cpython-310-x86_64-linux-gnu.so: undefined symbol: _ZNK3c106SymIntltEl

sleeper1023 · 2024-02-04T03:27:16Z

Try a cxx11abiFALSE version 试用版本 cxx11abiFALSE
You can use this to solve the problem. Use pip to install the version package that matches yours directly.
I use python 10, cuda11.8 torch 2.0, cxx11abiFALSE

https://github.com/Dao-AILab/flash-attention/releases/download/v2.5.2/flash_attn-2.5.2+cu118torch2.0cxx11abiFALSE-cp310-cp310-linux_x86_64.whl

jaanli · 2024-02-15T16:23:25Z

Also facing this.

LeonEricsson · 2024-02-27T12:46:59Z

Also facing this issue with nvcr 24.01-py3

tridao · 2024-02-27T16:37:00Z

For nvcr 23.12 and 24.01 please use flash-attn 2.5.1.post1

LeonEricsson · 2024-02-27T18:14:27Z

For nvcr 23.12 and 24.01 please use flash-attn 2.5.1.post1

'preciate the immediate response 🙌🏼

caoyang-sufe · 2024-03-07T07:54:12Z

Try a cxx11abiFALSE version

Really helpful!

kanseaveg · 2024-03-16T02:17:04Z

Try a cxx11abiFALSE version 试用版本 cxx11abiFALSE
You can use this to solve the problem. Use pip to install the version package that matches yours directly.
I use python 10, cuda11.8 torch 2.0, cxx11abiFALSE

https://github.com/Dao-AILab/flash-attention/releases/download/v2.5.2/flash_attn-2.5.2+cu118torch2.0cxx11abiFALSE-cp310-cp310-linux_x86_64.whl

thank you. That's work for me.

ponyzym · 2024-03-23T10:47:19Z

Thanks for the report. I saw just this error on nvcr 23.06 as well. nvcr 23.07 should work, can you try? The error is due to pytorch interface changing between the version used in 23.06 and 23.07.

Hello, how can i use the version in nvcr 23.07, any details?

yiyepiaoling0715 · 2024-04-22T06:12:46Z

You can compile from source with FLASH_ATTENTION_FORCE_BUILD=TRUE: FLASH_ATTENTION_FORCE_BUILD=TRUE pip install flash-attn.

is this genral useful for different nvcr versions?
I a using "pytorch/pytorch:2.2.2-cuda12.1-cudnn8-devel" "# FROM nvcr.io/nvidia/pytorch:22.07-py3" "# FROM pytorch/pytorch:2.0.1-cuda11.7-cudnn8-devel
"

modelsplaid · 2024-04-28T01:39:53Z

I got similar error. I think this error is caused by cuda version. I added:
export CUDA_HOME=/usr/local/cuda
export PATH="/usr/local/cuda/bin${PATH:+:${PATH}}"
export LD_LIBRARY_PATH="/usr/local/cuda/lib64${LD_LIBRARY_PATH:+:${LD_LIBRARY_PATH}}"

to the end of ~/.bashrc
and source ~/.bashrc
then reinstall pytorch and flash attention.
It works.

Laz4rz · 2024-07-01T10:35:03Z

Doing this:

Try a cxx11abiFALSE version 试用版本 cxx11abiFALSE
You can use this to solve the problem. Use pip to install the version package that matches yours directly.
I use python 10, cuda11.8 torch 2.0, cxx11abiFALSE

https://github.com/Dao-AILab/flash-attention/releases/download/v2.5.2/flash_attn-2.5.2+cu118torch2.0cxx11abiFALSE-cp310-cp310-linux_x86_64.whl

and also forcing:

pip install xformers==v0.0.22

Fixed the issue for me.

UPDATE: it didnt, more problems down the line with missing torch operations

FrancescoSaverioZuppichini changed the title ~~Tests don't work~~ Flash Attention 2.0 doesn't work: undefined symbol: _ZNK3c1010TensorImpl27throw_data_ptr_access_errorEv Aug 15, 2023

PeterBaelish mentioned this issue Aug 21, 2023

Can flashattention run on Jetson AGX Orin with compute capability of 8.7？ #449

Closed

zslittlehelper mentioned this issue Oct 5, 2023

flash_attn_2 undefined symbol oobabooga/text-generation-webui#4182

Closed

1 task

fat-tire mentioned this issue Oct 6, 2023

flash-attn >= 2.0.0 breaks exllamav2, AutoGPTQ support on GPUS older than ampere oobabooga/text-generation-webui#4172

Closed

1 task

LRY89757 mentioned this issue Oct 9, 2023

[Dependency Issue] How to run at H100 about CUDA 12, Torch 2.1.0 vllm-project/vllm#1301

Closed

zhulinJulia24 mentioned this issue Dec 6, 2023

[Bug] qwen-7b quantization still failed when flash-attn is install. InternLM/lmdeploy#804

Closed

2 tasks

z3ugma mentioned this issue Dec 24, 2023

Encountered flash_attn_2_cuda error while running finetune_lora.sh X-PLUG/mPLUG-Owl#192

Open

ctxwing mentioned this issue Feb 3, 2024

/h2ogpt_conda/lib/python3.10/site-packages/flash_attn_2_cuda.cpython-310-x86_64-linux-gnu.so: undefined symbol h2oai/h2ogpt#1348

Closed

yekta mentioned this issue Feb 18, 2024

Flash Attention 2 Error -> undefined symbol: _ZN2at4_ops9_pad_enum4callERKNS_6TensorEN3c108ArrayRefINS5_6SymIntEEElNS5_8optionalIdEE #836

Open

hippothewild mentioned this issue Mar 14, 2024

VSSL-8148 Enable streaming + Flash Attention 2 on PDF RAG chatbot vessl-ai/examples#34

Merged

glorgao mentioned this issue Mar 17, 2024

[For your information] Ways to build environment and run openrlhf codes on a slurm cluster OpenRLHF/OpenRLHF#251

Closed

Lucas-TY mentioned this issue Jun 26, 2024

Out of memory on H800 Infini-AI-Lab/TriForce#7

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Flash Attention 2.0 doesn't work: undefined symbol: _ZNK3c1010TensorImpl27throw_data_ptr_access_errorEv #451

Flash Attention 2.0 doesn't work: undefined symbol: _ZNK3c1010TensorImpl27throw_data_ptr_access_errorEv #451

FrancescoSaverioZuppichini commented Aug 15, 2023

FrancescoSaverioZuppichini commented Aug 15, 2023

FrancescoSaverioZuppichini commented Aug 15, 2023

FrancescoSaverioZuppichini commented Aug 15, 2023

tridao commented Aug 15, 2023

FrancescoSaverioZuppichini commented Aug 15, 2023

tridao commented Aug 15, 2023

janelu9 commented Aug 17, 2023

junjun3518 commented Aug 25, 2023

tridao commented Aug 25, 2023

junjun3518 commented Aug 25, 2023

junjun3518 commented Aug 25, 2023

nghiadt22 commented Jan 18, 2024

Niyathi3011 commented Jan 23, 2024

ghost commented Feb 1, 2024

sleeper1023 commented Feb 4, 2024

jaanli commented Feb 15, 2024

LeonEricsson commented Feb 27, 2024

tridao commented Feb 27, 2024

LeonEricsson commented Feb 27, 2024

caoyang-sufe commented Mar 7, 2024

kanseaveg commented Mar 16, 2024

ponyzym commented Mar 23, 2024

yiyepiaoling0715 commented Apr 22, 2024

modelsplaid commented Apr 28, 2024

Laz4rz commented Jul 1, 2024 •

edited

Loading

Flash Attention 2.0 doesn't work: undefined symbol: _ZNK3c1010TensorImpl27throw_data_ptr_access_errorEv #451

Flash Attention 2.0 doesn't work: undefined symbol: _ZNK3c1010TensorImpl27throw_data_ptr_access_errorEv #451

Comments

FrancescoSaverioZuppichini commented Aug 15, 2023

FrancescoSaverioZuppichini commented Aug 15, 2023

FrancescoSaverioZuppichini commented Aug 15, 2023

FrancescoSaverioZuppichini commented Aug 15, 2023

tridao commented Aug 15, 2023

FrancescoSaverioZuppichini commented Aug 15, 2023

tridao commented Aug 15, 2023

janelu9 commented Aug 17, 2023

junjun3518 commented Aug 25, 2023

tridao commented Aug 25, 2023

junjun3518 commented Aug 25, 2023

junjun3518 commented Aug 25, 2023

nghiadt22 commented Jan 18, 2024

Niyathi3011 commented Jan 23, 2024

ghost commented Feb 1, 2024

sleeper1023 commented Feb 4, 2024

jaanli commented Feb 15, 2024

LeonEricsson commented Feb 27, 2024

tridao commented Feb 27, 2024

LeonEricsson commented Feb 27, 2024

caoyang-sufe commented Mar 7, 2024

kanseaveg commented Mar 16, 2024

ponyzym commented Mar 23, 2024

yiyepiaoling0715 commented Apr 22, 2024

modelsplaid commented Apr 28, 2024

Laz4rz commented Jul 1, 2024 • edited Loading

Laz4rz commented Jul 1, 2024 •

edited

Loading