docs: update function-calling.md w/ template override needed by functionary-small-v3.2 #12214

ochafik · 2025-03-06T01:05:11Z

Fixes misleading command in doc revealed by #12213 (cc/ @edmcman): bartowski/functionary-small-v3.2-GGUF needs a template override.

Also, now using predownloaded templates under model/templates (used by test_tool_call.py) to simplify commands, and add extra blurb about the python script to get more templates.

…-v3.2

…-v3.2 (ggml-org#12214)

pepijndevos · 2025-03-14T20:37:34Z

Is there any plan to automatically provide the right templates if they are shipped with llama.cpp anyway? I might just apply them in my wrapper but just checking.

ochafik · 2025-03-16T17:58:42Z

Is there any plan to automatically provide the right templates if they are shipped with llama.cpp anyway? I might just apply them in my wrapper but just checking.

I think there are multiple options here:

(Ideally) Reach out to each maintainer of a GGUF with bad or out of date templates, and have them fix them.
Detect and explode on "bad" templates (maybe advising on using the GGUF editor to fix the template?)
Add a simple way to specify which HF repo to pull a template from (-hft / --hf-template flag? add the original HF repo as a key in the GGUF?)
Bundle a collection of "good" templates and use a mix of predicates on the model metadata to pick the right template (ignoring the template built into the GGUF)

I reckon we could add a flag that switches between the behaviours of the 3 last bullets

cc/ @ngxson @ggerganov WDYT?

ngxson · 2025-03-16T22:35:29Z

If the number of known broken templates are rare (i.e. the case of functionary, it is one of the first model to support function call so it was quite messy), then I don't want to spend too much effort fixing this.

The easiest solution is to simply bundle a collection of good templates as you said on your last point. I believe we will end up with a list of around 10 of them. Newer models should not need this, because they should already have a good built-in template.

…-v3.2 (ggml-org#12214)

update function-calling.md w/ template override for functionary-small…

1b3c00c

…-v3.2

github-actions bot added the documentation Improvements or additions to documentation label Mar 6, 2025

ochafik marked this pull request as ready for review March 6, 2025 01:06

ggerganov approved these changes Mar 6, 2025

View reviewed changes

ochafik merged commit 4299404 into ggml-org:master Mar 6, 2025
2 checks passed

ochafik mentioned this pull request Mar 6, 2025

Tool call support (generic + native for Llama, Functionary, Hermes, Mistral, Firefunction, DeepSeek) w/ lazy grammars #9639

Merged

41 tasks

mglambda pushed a commit to mglambda/llama.cpp that referenced this pull request Mar 8, 2025

update function-calling.md w/ template override for functionary-small…

97d3336

…-v3.2 (ggml-org#12214)

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Mar 19, 2025

update function-calling.md w/ template override for functionary-small…

8b4cccb

…-v3.2 (ggml-org#12214)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

docs: update function-calling.md w/ template override needed by functionary-small-v3.2 #12214

docs: update function-calling.md w/ template override needed by functionary-small-v3.2 #12214

Uh oh!

ochafik commented Mar 6, 2025

Uh oh!

Uh oh!

pepijndevos commented Mar 14, 2025

Uh oh!

ochafik commented Mar 16, 2025

Uh oh!

ngxson commented Mar 16, 2025 •

edited

Loading

Uh oh!

Uh oh!

docs: update function-calling.md w/ template override needed by functionary-small-v3.2 #12214

docs: update function-calling.md w/ template override needed by functionary-small-v3.2 #12214

Uh oh!

Conversation

ochafik commented Mar 6, 2025

Uh oh!

Uh oh!

pepijndevos commented Mar 14, 2025

Uh oh!

ochafik commented Mar 16, 2025

Uh oh!

ngxson commented Mar 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

ngxson commented Mar 16, 2025 •

edited

Loading