ggml : dynamic ggml_sched_max_splits based on graph_size #9047

nicoboss · 2024-08-15T19:13:18Z

This fixes #9044

Sets ggml_sched_max_splits to be equal to graph_size as recommended by @slaren in #9044 (comment) since at most there is one split for each node in the graph.

Thanks to this change I was able to run GPU accelerated inference on BigLlama-3.1-681B-Instruct which prior to this change caused llama.cpp to crash.

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

ggml/src/ggml-backend.c

* ggml : Dynamic ggml_sched_max_splits based on graph_size * Fixed and readded debug code for causes

ggml : Dynamic ggml_sched_max_splits based on graph_size

d86f10a

github-actions bot added the ggml changes relating to the ggml tensor library for machine learning label Aug 15, 2024

nicoboss mentioned this pull request Aug 15, 2024

Bug: GGML_SCHED_MAX_SPLITS must be increased to run BigLlama-3.1-681B-Instruct using GPU acceleration #9044

Closed

slaren reviewed Aug 16, 2024

View reviewed changes

ggml/src/ggml-backend.c Show resolved Hide resolved

Fixed and readded debug code for causes

fe40950

slaren approved these changes Aug 16, 2024

View reviewed changes

slaren merged commit e3f6fd5 into ggml-org:master Aug 16, 2024

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Nov 15, 2024

ggml : dynamic ggml_sched_max_splits based on graph_size (ggml-org#9047)

7c7e814

* ggml : Dynamic ggml_sched_max_splits based on graph_size * Fixed and readded debug code for causes

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Nov 18, 2024

ggml : dynamic ggml_sched_max_splits based on graph_size (ggml-org#9047)

402efe1

* ggml : Dynamic ggml_sched_max_splits based on graph_size * Fixed and readded debug code for causes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ggml : dynamic ggml_sched_max_splits based on graph_size #9047

ggml : dynamic ggml_sched_max_splits based on graph_size #9047

Uh oh!

nicoboss commented Aug 15, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ggml : dynamic ggml_sched_max_splits based on graph_size #9047

ggml : dynamic ggml_sched_max_splits based on graph_size #9047

Uh oh!

Conversation

nicoboss commented Aug 15, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants