fix: Cherrypick fixed context length, and openmp dependency #1332

grahamking · 2025-06-02T20:44:12Z

- Add Granite to our tokenizer - Fix pre-processor to load context length correctly - Add strftime_now Jinja function for prompt templates - Update llama.cpp - Handle trtllm errors when not using trtllm Support depends on the engine: - `mistral.rs`, our default engine, doesn't support Granite yet. - `llama.cpp` does and works very well: ``` dynamo-run out=llamacpp ~/llms/granite-3.3-2b-instruct-Q4_K_M.gguf --context-length 16384 ``` - `vllm` also works very well: ``` dynamo-run in=http out=vllm ~/llms/granite-3.3-2b-instruct --context-length 16384 ``` - `sglang` mostly works, but it doesn't catch the stop token, so we do in the HTTP ingress, and log an error. The Text ingress doesn't catch it because I disabled it to make the raw echo engine work. A bit of work to do here. Closes: #1245

Do not include by default as it needs libgomp1 at runtime. Add a feature to enable it at build time.

grahamking added 2 commits June 2, 2025 16:42

feat: Make llama.cpp Gnu OpenMP dependency optional (#1331)

93ca7b0

Do not include by default as it needs libgomp1 at runtime. Add a feature to enable it at build time.

grahamking requested review from a team, GuanLuo, PeaBrane, alec-flowers, biswapanda, ishandhanani, jthomson04, kkranen, nnshah1, oandreeva-nv, paulhendricks, piotrm-nvidia, ptarasiewiczNV, rmccorm4, ryanolson, tanmayv25, tedzhouhk and tmonty12 as code owners June 2, 2025 20:44

pull-request-size bot added the size/M label Jun 2, 2025

copy-pr-bot bot temporarily deployed to GITLAB June 2, 2025 20:44 Inactive

grahamking changed the title ~~Cherrypick fixed context length, and openmp dependency~~ fix: Cherrypick fixed context length, and openmp dependency Jun 2, 2025

github-actions bot added the fix label Jun 2, 2025

grahamking enabled auto-merge (squash) June 2, 2025 20:52

Merge branch 'release/0.3.0' into gk-cp-1

26655c5

copy-pr-bot bot temporarily deployed to GITLAB June 3, 2025 04:37 Inactive

nv-anants approved these changes Jun 3, 2025

View reviewed changes

nv-anants disabled auto-merge June 3, 2025 13:06

nv-anants merged commit 63ef24f into release/0.3.0 Jun 3, 2025
15 checks passed

nv-anants deleted the gk-cp-1 branch June 3, 2025 13:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: Cherrypick fixed context length, and openmp dependency #1332

fix: Cherrypick fixed context length, and openmp dependency #1332

Uh oh!

grahamking commented Jun 2, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fix: Cherrypick fixed context length, and openmp dependency #1332

fix: Cherrypick fixed context length, and openmp dependency #1332

Uh oh!

Conversation

grahamking commented Jun 2, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants