Illegal instruction (core dumped) when trying to load model #839

R-Yordanov-AltScale · 2023-10-23T12:03:39Z

Prerequisites

Please answer the following questions for yourself before submitting an issue.

[ x ] I am running the latest code. Development is very rapid so there are no tagged versions as of now.
[ x ] I carefully followed the README.md.
[ x ] I searched using keywords relevant to my issue to make sure that I am creating a new issue that is not already open (or closed).
[ x ] I reviewed the Discussions, and have a new bug or useful enhancement to share.

Expected Behavior

To load the model

Please provide a detailed written description of what you were trying to do, and what you expected llama-cpp-python to do.

Current Behavior

when i try to load the model with llm = Llama(model_path="./llama.cpp/models/llama-2-7b-chat.Q5_K_M.gguf")
it response with: Illegal instruction (core dumped)

This is from my syslog:
kernel: [1728595.660950] traps: python3[213941] trap invalid opcode ip:7f4aa44a4e94 sp:7ffceec92e60 error:0 in libllama.so[7f4aa448a000+9f000]

Environment and Context

lscpu

Architecture: x86_64
CPU op-mode(s): 32-bit, 64-bit
Address sizes: 40 bits physical, 48 bits virtual
Byte Order: Little Endian
CPU(s): 8
On-line CPU(s) list: 0-7
Vendor ID: AuthenticAMD
Model name: AMD Opteron 63xx class CPU
CPU family: 21
Model: 2
Thread(s) per core: 1
Core(s) per socket: 1
Socket(s): 8
Stepping: 0
BogoMIPS: 5200.00
Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 syscall nx pdpe1gb rdtscp lm rep_good nopl cpuid extd_apicid tsc_known_freq pni pclmulqd
q ssse3 fma cx16 sse4_1 sse4_2 x2apic popcnt aes xsave avx f16c hypervisor lahf_lm svm abm sse4a misalignsse 3dnowprefetch xop fma4 tbm vmmcall arat npt nrip_save
Virtualization features:
Virtualization: AMD-V
Hypervisor vendor: KVM
Virtualization type: full
Caches (sum of all):
L1d: 512 KiB (8 instances)
L1i: 512 KiB (8 instances)
L2: 4 MiB (8 instances)
L3: 128 MiB (8 instances)
NUMA:
NUMA node(s): 1
NUMA node0 CPU(s): 0-7

it is a Vritual with ubuntu 22.04

$ uname -a
Linux trying-to-train-llama2 5.15.0-46-generic #49-Ubuntu SMP Thu Aug 4 18:03:25 UTC 2022 x86_64 x86_64 x86_64 GNU/Linux

SDK version, e.g. for Linux:

$ python3 --version
Python 3.10.12

$ make --version
GNU Make 4.3

$ g++ --version
g++ (Ubuntu 11.4.0-1ubuntu1~22.04) 11.4.0

The text was updated successfully, but these errors were encountered:

m-from-space · 2023-10-25T08:09:38Z

Your CPU is very old and doesn't support certain instructions like AVX2 (you can see that it is missing in the list of "flags" using lscpu). Try to reinstall using:

CMAKE_ARGS="-DLLAMA_AVX2=OFF" FORCE_CMAKE=1 pip install llama-cpp-python --no-cache-dir --force-reinstall

fgeo23 · 2023-10-26T18:58:56Z

Same issue here

After doing some digging, it turns out that CMAKE_ARGS are not passed to the pip install command. Still trying to figure out why

…e in the makefile (abetlen#839)

dimaioksha · 2023-12-10T14:33:47Z

@fgeo23 have you found out the reasons and solution?

dimaioksha · 2024-01-18T22:28:59Z

Hello everyone.
I've solved this issue by upgrading nvidia-cuda-toolkit from 11.6 to 11.8 version (the latest llama-cpp-python==0.2.29 is working well)

antoine-lizee pushed a commit to antoine-lizee/llama-cpp-python that referenced this issue Oct 30, 2023

cmake should link openblas properly with -lopenblas like how it's don…

f2d1c47

…e in the makefile (abetlen#839)

abetlen added the bug Something isn't working label Nov 8, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Illegal instruction (core dumped) when trying to load model #839

Illegal instruction (core dumped) when trying to load model #839

R-Yordanov-AltScale commented Oct 23, 2023

m-from-space commented Oct 25, 2023 •

edited

Loading

fgeo23 commented Oct 26, 2023 •

edited

Loading

dimaioksha commented Dec 10, 2023

dimaioksha commented Jan 18, 2024

Illegal instruction (core dumped) when trying to load model #839

Illegal instruction (core dumped) when trying to load model #839

Comments

R-Yordanov-AltScale commented Oct 23, 2023

Prerequisites

Expected Behavior

Current Behavior

Environment and Context

m-from-space commented Oct 25, 2023 • edited Loading

fgeo23 commented Oct 26, 2023 • edited Loading

dimaioksha commented Dec 10, 2023

dimaioksha commented Jan 18, 2024

m-from-space commented Oct 25, 2023 •

edited

Loading

fgeo23 commented Oct 26, 2023 •

edited

Loading