feat: add env SUMMARIZATION_MODEL
to specify summarization model on the server side
#3288
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
The idea originated from the discussion with @H0llyW00dzZ in #3260.
This patch can be useful when using custom models. Similar to the idea of #2583, user can use
SUMMARIZATION_MODEL
to specify a cheap and fast model for summarization while using a powerful model for chat.SUMMARIZATION_MODEL
andCUSTOM_MODELS
are not set,gpt-3.5-turbo
will be sent to client and be used for summarization. This is the same as the current logic.SUMMARIZATION_MODEL
is set, the value will be used.SUMMARIZATION_MODEL
is not set butCUSTOM_MODELS
is set,""
will be sent to client and session model will be used on the client.