CLBlast: Fix handling of on-device tensor data #3447

shibe2 · 2023-10-02T20:11:42Z

Fixes #3307 and related issues.

Uploading was tested by reading data back from VRAM (for which there is no API). It now works properly in all cases.

Matrix multiplication works as well when src0 is already in VRAM. It allowed me to test broadcasting (#3402) in more configurations.

Special code for matrix-vector multiplication remains broken. I intend to disable it in a separate request. But I can add it here too.

ggml_cl_mul (non-matrix multiplication) appears to have a problem with offsets too, but it is broken for other reasons as well, and I didn't touch it here.

I used nullptr instead of NULL in some cases. Is it okay?

Fix uploading tensor data to device, including 3D, 4D, and non-contiguous tensors. Use correct offsets into data that is already in VRAM. Correct handling of OpenCL events when multiple commands are queued.

CLBlast: Fix handling of on-device tensor data

7d9e3ca

Fix uploading tensor data to device, including 3D, 4D, and non-contiguous tensors. Use correct offsets into data that is already in VRAM. Correct handling of OpenCL events when multiple commands are queued.

shibe2 force-pushed the cl-h2d branch from f58ebcb to 7d9e3ca Compare October 5, 2023 12:07

shibe2 changed the title ~~CLBlast: Fix uploading tensor data to device~~ CLBlast: Fix handling of on-device tensor data Oct 5, 2023

shibe2 marked this pull request as ready for review October 5, 2023 12:18

ggerganov approved these changes Oct 5, 2023

View reviewed changes

shibe2 merged commit e2583cb into ggerganov:master Oct 5, 2023
27 checks passed

shibe2 mentioned this pull request Oct 8, 2023

CLBlast: Fix matrix-vector multiplication #3544

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CLBlast: Fix handling of on-device tensor data #3447

CLBlast: Fix handling of on-device tensor data #3447

shibe2 commented Oct 2, 2023 •

edited

Loading

CLBlast: Fix handling of on-device tensor data #3447

CLBlast: Fix handling of on-device tensor data #3447

Conversation

shibe2 commented Oct 2, 2023 • edited Loading

shibe2 commented Oct 2, 2023 •

edited

Loading