CUDA: fix LoRAs #3130

JohannesGaessler · 2023-09-11T22:18:28Z

As pointed out by #3110 (comment) , the CUDA code for LoRAs was broken by #3110 . This PR fixes this.

ggml-cuda.cu

JohannesGaessler · 2023-09-12T08:13:58Z

Previously ggml_cpy_tensor_2d was not called for LoRAs. As it turns out the logic for src0_on_device was incorrect so the tensor was being copied unnecessarily. I'm keeping the extended logic for ggml_cpy_tensor_2d.

slaren reviewed Sep 11, 2023

View reviewed changes

ggml-cuda.cu Show resolved Hide resolved

JohannesGaessler force-pushed the cuda-fix-lora branch from 7f52cb5 to f97d546 Compare September 11, 2023 22:45

slaren approved these changes Sep 11, 2023

View reviewed changes

CUDA: fix LoRAs

f866663

JohannesGaessler force-pushed the cuda-fix-lora branch from f97d546 to f866663 Compare September 12, 2023 08:11

JohannesGaessler merged commit 4f7cd6b into ggerganov:master Sep 12, 2023
28 checks passed

pkrmf pushed a commit to morlockstudios-com/llama.cpp that referenced this pull request Sep 26, 2023

CUDA: fix LoRAs (ggerganov#3130)

b958c90

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CUDA: fix LoRAs #3130

CUDA: fix LoRAs #3130

JohannesGaessler commented Sep 11, 2023

JohannesGaessler commented Sep 12, 2023

CUDA: fix LoRAs #3130

CUDA: fix LoRAs #3130

Conversation

JohannesGaessler commented Sep 11, 2023

JohannesGaessler commented Sep 12, 2023