Skip to content

Comments

metal : adjust extra size for FA buffer to avoid reallocations#18545

Merged
ggerganov merged 1 commit intomasterfrom
gg/metal-adjust-fa-extra-size
Jan 2, 2026
Merged

metal : adjust extra size for FA buffer to avoid reallocations#18545
ggerganov merged 1 commit intomasterfrom
gg/metal-adjust-fa-extra-size

Conversation

@ggerganov
Copy link
Member

@ggerganov ggerganov commented Jan 2, 2026

cont #17143
rel #17617

Pad using the worst case size between the vec and non-vec buffers. Prevents unwanted graph reallocations in some cases.

@github-actions github-actions bot added ggml changes relating to the ggml tensor library for machine learning Apple Metal https://en.wikipedia.org/wiki/Metal_(API) labels Jan 2, 2026
@ggerganov ggerganov merged commit f38de16 into master Jan 2, 2026
70 of 71 checks passed
@ggerganov ggerganov deleted the gg/metal-adjust-fa-extra-size branch January 2, 2026 17:02
blime4 pushed a commit to blime4/llama.cpp that referenced this pull request Feb 5, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant