Support text completion and chat on Phi-3 model for Ollama and LlamaCpp #345
+254
−0
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Support text completion and chat on Phi-3 model for Ollama and LlamaCpp.
Embeddings and other options not yet implemented.
NOTE: The Phi-3 docs say that a system messages are not supported, however looking at various templates online it seems most projects are using them, so I left it there. Alternatively, an error could be thrown, or the system message could be automatically included in the first user message using some sort of scheme similar to what is being done with Mistral.