metal : simplify kernel arguments using a struct #3229

ggerganov · 2023-09-17T17:10:35Z

Create a struct ggml_metal_locals and populate using GGML_TENSOR_LOCALS similar to what we do in ggml.c:

https://github.com/ggerganov/llama.cpp/blob/3b4bab6a38502d9e68587c2c19f26472480ec4dd/ggml.c#L244-L256

Refactor all kernels to accept a single struct of ggml_metal_locals in order to avoid long lists of arguments such as:

https://github.com/ggerganov/llama.cpp/blob/3b4bab6a38502d9e68587c2c19f26472480ec4dd/ggml-metal.m#L753-L782

https://github.com/ggerganov/llama.cpp/blob/3b4bab6a38502d9e68587c2c19f26472480ec4dd/ggml-metal.metal#L29-L61

The text was updated successfully, but these errors were encountered:

ZacharyDK · 2023-09-19T20:25:19Z

Hey @ggerganov . So I've done some digging. Objective C is a headache, but necessary because Apple makes it a requirement to use metal. Unreal Engine is C++, and they use a C++ wrapper library, avoiding all objective C. This is the library by naleksiev.

https://github.com/naleksiev/mtlpp/blob/master/LICENSE

Refactoring the ggml-metal.m file and relevant files to use this library would have the benefits of cutting out objective C, simplifying the code base, and squashing any bugs related to using objective C. Also would likely fix the numerous kernel loading bugs on Macs with AMD. This change should let Mac users utilize the GPU on whatever, shouldn't make a difference between M1, M2, and AMD

The mtlpp library has been tried and tested with Unreal Engine, so it probably will do the heaving lifting without too much pain.

ggerganov · 2024-05-12T06:52:18Z

✨✨ Here's an AI-assisted sketch of how you might approach this issue saved by @ggerganov using Copilot Workspace v0.17

Topic

Has the ggml_metal_locals struct been created and used to simplify kernel arguments in the metal kernels?

Before

No, the ggml_metal_locals struct has not been created in the code.
The metal kernels still use long lists of arguments as shown in ggml-metal.m and ggml-metal.metal.
There is no evidence in the provided code snippets of a struct being used to simplify kernel arguments for Metal kernels.

After

Yes, the ggml_metal_locals struct has been created and is now used in the Metal kernels.
The struct is defined in ggml-metal.h, simplifying kernel arguments by encapsulating them.
Metal kernels in ggml-metal.metal now accept a single ggml_metal_locals struct as their argument.
The long lists of arguments in Metal kernels are replaced with the ggml_metal_locals struct, enhancing code readability and maintainability.

Plan

ggml-metal.h (CHANGE)
- Define a new struct ggml_metal_locals to encapsulate kernel arguments.
- Include necessary headers and dependencies.
ggml-metal.m (CHANGE)
- Refactor all kernel function calls to use the ggml_metal_locals struct for arguments.
- Update the function signatures to accept ggml_metal_locals instead of individual arguments.
ggml-metal.metal (CHANGE)
- Update kernel function signatures to accept a single ggml_metal_locals struct as their argument.
- Adjust the kernel implementations to access arguments through the ggml_metal_locals struct.

Sketch of implementation

View the changes

Details

Code analyzed at b228aba

ggerganov · 2024-05-12T06:56:33Z

Playing with the tech preview of "Copilot Workspaces": https://copilot-workspace.githubnext.com/ggerganov/llama.cpp/issues/3229?shareId=9c38fc11-f7d8-45b7-b1bc-81678a27a9e0

It does not like big files 😢

BB-fat · 2025-03-05T07:42:53Z

@ggerganov Is help still needed with this issue? If so, I can try.

ggerganov · 2025-03-05T07:45:09Z

Yes. It's pretty straight-forward - just apply the same pattern as in #10238 for the rest of the operators.

BB-fat · 2025-03-05T07:50:47Z

@ggerganov Okay, I'm happy to help, please assign it to me.

ggerganov · 2025-03-05T07:54:32Z

please assign it to me

It's best if you open a draft PR so people can track your progress. Otherwise the experience is that an assigned issue might end up dead because people who want to work on it would think that someone else is already working on it, while they aren't.

BB-fat · 2025-03-05T07:59:47Z

please assign it to me

It's best if you open a draft PR so people can track your progress. Otherwise the experience is that an assigned issue might end up dead because people who want to work on it would think that someone else is already working on it, while they aren't.

Appreciate the suggestion! I'll open a draft PR.

BB-fat · 2025-03-05T10:33:25Z

Hi @ggerganov , I've implemented the struct-based parameter optimization for the im2col kernel. If you have time, could you please review my changes? Assuming everything looks good, I'll continue optimizing the remaining kernel functions in the near future. #12194

* metal : refactor im2col parameters into a struct * metal: Change im2col offset types from int32_t to uint64_t to support larger memory offsets * metal : refactor sum_rows parameters into a struct * metal : refactor soft_max parameters into a struct * metal : refactor diag_mask_inf parameters into a struct * metal : refactor ssm_conv parameters into a struct * metal : refactor ssm_scan parameters into a struct * metal : refactor get_rows parameters into a struct * metal : refactor group_norm parameters into a struct * metal : refactor conv_transpose_1d parameters into a struct * metal : refactor upscale parameters into a struct * metal : refactor pad parameters into a struct * metal : refactor pad_reflect_1d parameters into a struct * metal : refactor arange parameters into a struct * metal : refactor timestep_embedding parameters into a struct * metal : refactor argsort parameters into a struct * metal : refactor leaky_relu parameters into a struct * metal : refactor pool_2d parameters into a struct * metal : fix trailing whitespace --------- Co-authored-by: alexju <alexju@tencent.com>

ggerganov · 2025-03-07T07:40:54Z

Resolved via #12194

…l-org#12194) * metal : refactor im2col parameters into a struct * metal: Change im2col offset types from int32_t to uint64_t to support larger memory offsets * metal : refactor sum_rows parameters into a struct * metal : refactor soft_max parameters into a struct * metal : refactor diag_mask_inf parameters into a struct * metal : refactor ssm_conv parameters into a struct * metal : refactor ssm_scan parameters into a struct * metal : refactor get_rows parameters into a struct * metal : refactor group_norm parameters into a struct * metal : refactor conv_transpose_1d parameters into a struct * metal : refactor upscale parameters into a struct * metal : refactor pad parameters into a struct * metal : refactor pad_reflect_1d parameters into a struct * metal : refactor arange parameters into a struct * metal : refactor timestep_embedding parameters into a struct * metal : refactor argsort parameters into a struct * metal : refactor leaky_relu parameters into a struct * metal : refactor pool_2d parameters into a struct * metal : fix trailing whitespace --------- Co-authored-by: alexju <alexju@tencent.com>

ggerganov added good first issue Good for newcomers refactoring Refactoring labels Sep 17, 2023

ggerganov added this to ggml : roadmap Oct 18, 2023

ggerganov moved this to Todo in ggml : roadmap Oct 18, 2023

ggerganov mentioned this issue Nov 12, 2023

whisper : try to fix the parallel whisper_state functionality ggml-org/whisper.cpp#1479

Merged

3 tasks

jmousseau mentioned this issue Jan 10, 2024

metal : wrap each operation in debug group ggml-org/ggml#690

Merged

ggerganov added this to github : copilot workspace May 15, 2024

ggerganov mentioned this issue Nov 9, 2024

metal : refactor kernel args into structs #10238

Merged

12 tasks

ggerganov self-assigned this Nov 10, 2024

ggerganov moved this from Todo to In Progress in ggml : roadmap Nov 10, 2024

ggerganov added the roadmap Part of a roadmap project label Feb 4, 2025

BB-fat mentioned this issue Mar 5, 2025

metal : simplify kernel arguments using a struct (#3229) #12194

Merged

danbev mentioned this issue Mar 6, 2025

Misc. bug: llama.swiftui simulator error #12219

Closed

ggerganov closed this as completed Mar 7, 2025

github-project-automation bot moved this to Done in github : copilot workspace Mar 7, 2025

ggerganov moved this from In Progress to Done in ggml : roadmap Mar 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

metal : simplify kernel arguments using a struct #3229

metal : simplify kernel arguments using a struct #3229

ggerganov commented Sep 17, 2023 •

edited

Loading

ZacharyDK commented Sep 19, 2023

ggerganov commented May 12, 2024

ggerganov commented May 12, 2024

BB-fat commented Mar 5, 2025

ggerganov commented Mar 5, 2025

BB-fat commented Mar 5, 2025

ggerganov commented Mar 5, 2025

BB-fat commented Mar 5, 2025

BB-fat commented Mar 5, 2025

ggerganov commented Mar 7, 2025

metal : simplify kernel arguments using a struct #3229

metal : simplify kernel arguments using a struct #3229

Comments

ggerganov commented Sep 17, 2023 • edited Loading

ZacharyDK commented Sep 19, 2023

ggerganov commented May 12, 2024

ggerganov commented May 12, 2024

BB-fat commented Mar 5, 2025

ggerganov commented Mar 5, 2025

BB-fat commented Mar 5, 2025

ggerganov commented Mar 5, 2025

BB-fat commented Mar 5, 2025

BB-fat commented Mar 5, 2025

ggerganov commented Mar 7, 2025

ggerganov commented Sep 17, 2023 •

edited

Loading