Skip to content

Conversation

gabe-l-hart
Copy link
Collaborator

@gabe-l-hart gabe-l-hart commented Sep 22, 2025

This is a configuration of the hparams in the GraniteHybrid architecture that devolves to the Granite (or GraniteMoe) architecture (ie Granite 3.x). It may be used for some models in the Granite 4 family with the GraniteHybrid architecture acting as a superset arch. Rather than support it directly in the c++ graph, we simply coerce the architecture flag back to the correct "granite" or "granitemoe" architecture.

@gabe-l-hart gabe-l-hart requested a review from CISC as a code owner September 22, 2025 17:27
@github-actions github-actions bot added the python python script changes label Sep 22, 2025
@gabe-l-hart gabe-l-hart force-pushed the GraniteNonHybridConversion branch from 3584895 to bb98b31 Compare September 22, 2025 17:29
@gabe-l-hart
Copy link
Collaborator Author

@CISC thanks for catching my slop. Too much multi-tasking!

This is a configuration of the hparams in the GraniteHybrid architecture
that devolves to the Granite (or GraniteMoe) architecture (ie Granite 3.x).
It may be used for some models in the Granite 4 family with the
GraniteHybrid architecture acting as a superset arch. Rather than support
it directly in the c++ graph, we simply coerce the architecture flag back
to the correct "granite" or "granitemoe" architecture.

Branch: gabe-l-hart/GraniteNonHybridConversion

Signed-off-by: Gabe Goodhart <ghart@us.ibm.com>

Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
@gabe-l-hart gabe-l-hart force-pushed the GraniteNonHybridConversion branch from b7f9b8c to 2668925 Compare September 22, 2025 18:24
@gabe-l-hart
Copy link
Collaborator Author

Rebased, squashed, and added you as co-author for your troubles. Much appreciated.

@CISC
Copy link
Collaborator

CISC commented Sep 22, 2025

Will merge once you've verified conversion works as expected (unless you have access to merge yourself, not sure).

@gabe-l-hart
Copy link
Collaborator Author

Confirmed, conversion works. I seem to have lost the merge button some time over the weekend, so not sure the cause there.

@CISC CISC merged commit 1d0125b into ggml-org:master Sep 22, 2025
7 checks passed
@CISC
Copy link
Collaborator

CISC commented Sep 22, 2025

Confirmed, conversion works. I seem to have lost the merge button some time over the weekend, so not sure the cause there.

See #16113

@gabe-l-hart
Copy link
Collaborator Author

Ah, makes complete sense. Thanks for the pointer!

@gabe-l-hart gabe-l-hart deleted the GraniteNonHybridConversion branch September 22, 2025 19:31
gabe-l-hart added a commit to gabe-l-hart/llama.cpp that referenced this pull request Sep 23, 2025
* origin/master: (39 commits)
ci : disable AMD workflows + update NVIDIA workflows (ggml-org#16200)
ci : enable Vulkan workflow on Mac (ggml-org#16194)
ggml-cpu: Respect cpumask settings (ggml-org#16164)
ggml : fix uninitialized is_on_grid in quantize_row_iq3_xxs_impl (ggml-org#15928)
zdnn: refactor codebase + add docs (ggml-org#16178)
codeowners : add @danbev to model-conversion example [no ci] (ggml-org#16190)
devops: add s390x containers (ggml-org#15915)
ggml-cpu : fix typo in gemm comments [no ci] (ggml-org#16189)
feat: Add conversion support in GraniteHybrid for non-hybrid (all attn) (ggml-org#16177)
clang-tidy : disable warning about performance enum size (ggml-org#16127)
ggml : implement set_rows with i32 index (ggml-org#16159)
codeowners : update + cleanup (ggml-org#16174)
common : enable `--offline` mode without curl support (ggml-org#16137)
webui : fix handling incomplete chunks (ggml-org#16107)
embedding : fix typos in README (ggml-org#16171)
common : remove unused local variables (ggml-org#16140)
ggml : extend ggml_can_fuse to work with non-sequential nodes (ggml-org#16123)
ggml : add ggml_op_is_empty (ggml-org#16122)
codeowners : update ownership for @ngxson and @allozuar (ggml-org#16128)
Vulkan: add conv_transpose_2d operation (ggml-org#16022)
...
struct pushed a commit to struct/llama.cpp that referenced this pull request Sep 26, 2025
…n) (ggml-org#16177)

This is a configuration of the hparams in the GraniteHybrid architecture
that devolves to the Granite (or GraniteMoe) architecture (ie Granite 3.x).
It may be used for some models in the Granite 4 family with the
GraniteHybrid architecture acting as a superset arch. Rather than support
it directly in the c++ graph, we simply coerce the architecture flag back
to the correct "granite" or "granitemoe" architecture.

Branch: gabe-l-hart/GraniteNonHybridConversion

Signed-off-by: Gabe Goodhart <ghart@us.ibm.com>

Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

python python script changes

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants