feat: Support `response_format` and structured JSON responses. #3785

actow · 2024-06-24T05:25:30Z

I have searched the existing issues

Is your feature request related to a problem? Please describe it

There is not a away to force the model to return structured json at the API level.

Describe the solution

The response_format parameter is supported by certain models, such as Groq's llama3 (8b & 70b), Fireworks AI's llama3 70b and OpenAI gpt 3.5 and 4 etc.

https://console.groq.com/docs/api-reference#chat-create
https://platform.openai.com/docs/api-reference/audio/createSpeech#audio-createspeech-response_format
https://readme.fireworks.ai/docs/structured-response-formatting

If will be great if Jan provide a UI to set that.

Teachability, documentation, adoption, migration strategy

https://console.groq.com/docs/api-reference#chat-create
https://platform.openai.com/docs/api-reference/audio/createSpeech#audio-createspeech-response_format
https://readme.fireworks.ai/docs/structured-response-formatting

What is the motivation / use case for changing the behavior?

The JSON structure response is very useful to experiment the models' capability to extend beyond a normal chat bot.

The text was updated successfully, but these errors were encountered:

dan-homebrew · 2024-09-08T12:07:29Z

@nguyenhoangthuan99 I'm marking this for Sprint 20, but it's possible that this is not a simple upstream of llama.cpp capability.

Can you take a look and assess
If it's too big, we will postpone to Sprint 21

dan-homebrew · 2024-09-29T06:29:25Z

Also: linked to janhq/cortex.cpp#295?

actow added the type: feature request A new feature label Jun 24, 2024

imtuyethan self-assigned this Jul 2, 2024

imtuyethan removed their assignment Aug 28, 2024

imtuyethan transferred this issue from janhq/jan Sep 2, 2024

0xSage added the category: app shell Installation, updater, global application issues label Sep 6, 2024

dan-homebrew assigned nguyenhoangthuan99 Sep 8, 2024

dan-homebrew assigned louis-jan Sep 9, 2024

dan-homebrew mentioned this issue Sep 11, 2024

epic: llama.cpp params are settable via API call or model.yaml janhq/cortex.cpp#1151

Closed

7 tasks

dan-homebrew unassigned louis-jan Sep 29, 2024

gabrielle-ong transferred this issue from janhq/cortex.cpp Oct 13, 2024

gabrielle-ong added the category: providers Local & remote inference providers label Oct 13, 2024

gabrielle-ong moved this from Icebox to Investigating in Jan & Cortex Oct 13, 2024

0xSage added category: model settings Inference params, presets, templates and removed category: app shell Installation, updater, global application issues labels Oct 15, 2024

0xSage moved this from Investigating to Planning in Jan & Cortex Oct 15, 2024

imtuyethan mentioned this issue Oct 18, 2024

chore: Structure Icebox in Github Projects #3840

Open

gabrielle-ong mentioned this issue Oct 21, 2024

feat: Cortex supports Function Calling janhq/cortex.cpp#295

Closed

1 task

imtuyethan added the move to Cortex label Nov 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Support `response_format` and structured JSON responses. #3785

feat: Support `response_format` and structured JSON responses. #3785

actow commented Jun 24, 2024

dan-homebrew commented Sep 8, 2024

dan-homebrew commented Sep 29, 2024

feat: Support response_format and structured JSON responses. #3785

feat: Support response_format and structured JSON responses. #3785

Comments

actow commented Jun 24, 2024

Is your feature request related to a problem? Please describe it

Describe the solution

Teachability, documentation, adoption, migration strategy

What is the motivation / use case for changing the behavior?

dan-homebrew commented Sep 8, 2024

dan-homebrew commented Sep 29, 2024

feat: Support `response_format` and structured JSON responses. #3785

feat: Support `response_format` and structured JSON responses. #3785