Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

error when run finetune.py #294

Closed
mazzzystar opened this issue Apr 8, 2023 · 3 comments
Closed

error when run finetune.py #294

mazzzystar opened this issue Apr 8, 2023 · 3 comments

Comments

@mazzzystar
Copy link

mazzzystar commented Apr 8, 2023

When I run python finetune.py, it will first shows error like this:

  File "/home/ubuntu/miniconda3/envs/lora/lib/python3.10/site-packages/bitsandbytes/functional.py", line 1505, in get_colrow_absmax
    lib.cget_col_row_stats(ptrA, ptrRowStats, ptrColStats, ptrNnzrows, ct.c_float(threshold), rows, cols)
  File "/home/ubuntu/miniconda3/envs/lora/lib/python3.10/ctypes/__init__.py", line 387, in __getattr__
    func = self.__getitem__(name)
  File "/home/ubuntu/miniconda3/envs/lora/lib/python3.10/ctypes/__init__.py", line 392, in __getitem__
    func = self._FuncPtr((name_or_ordinal, self))
AttributeError: /home/ubuntu/miniconda3/envs/lora/lib/python3.10/site-packages/bitsandbytes/libbitsandbytes_cpu.so: undefined symbol: cget_col_row_stats

Then I tried as this: bitsandbytes-foundation/bitsandbytes#156 (comment)

The error then become:

Loading checkpoint shards:   0%|                                                                                              | 0/33 [00:00<?, ?it/s]
Error named symbol not found at line 508 in file /mmfs1/gscratch/zlab/timdettmers/git/bitsandbytes/csrc/ops.cu

I'm using Python 3.10 + CUDA 11.7.

@ParaNoth
Copy link

ParaNoth commented Apr 10, 2023

Which architecture graphics card are you using? If your graphics card's architecture is not Turing or Ampere, you should use libbitsandbytes_cuda{your_cuda_version}_nocublaslt.so instead of libbitsandbytes_cuda{your_cuda_version}.so

It means you should use this command:

cp libbitsandbytes_cuda117_nocublaslt.so libbitsandbytes_cpu.so

@mazzzystar
Copy link
Author

@ParaNoth Thank you very much!

My graphics is NVIDIA V100, I figured out it was Volta architecture. I've changed to libbitsandbytes_cuda{cuda_version}_nocublaslt.so and it works!

image

@herobrine19
Copy link

It works same to me! Thank you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants