Skip to content

How to make llama.cpp return control to add additional context? #692

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
simplejackcoder opened this issue Apr 1, 2023 · 2 comments
Closed
Labels
enhancement New feature or request generation quality Quality of model output stale

Comments

@simplejackcoder
Copy link

I want to be able to tell the model that if it can't reply something useful to return control so I can give more information.

Similarly, how do I add more context so that it can reason about a full conversation or say a specific set of documents?

For example, I ask it something and it should say I don't know can you provide me more information? And then I give it a document. Then I can add another document to the prompt, so it can understand from that and so on.

I've heard this is some sort of chaining, but I don't understand.

@Carnildo
Copy link

Carnildo commented Apr 2, 2023

It's not possible. Large-language models are incapable of identifying things they don't know. They work by predicting the most likely continuation of the input text based on the training data they've seen. This is a remarkably good approximation to actual thinking in many cases, but it also means an LLM is unable to say "I don't know".

(If the training data says that something isn't known, an LLM is able to give the correct response, but this is different from being able to identify gaps in the training data.)

@gjmulder gjmulder added enhancement New feature or request generation quality Quality of model output labels Apr 6, 2023
Deadsg pushed a commit to Deadsg/llama.cpp that referenced this issue Dec 19, 2023
@github-actions github-actions bot added the stale label Mar 25, 2024
Copy link
Contributor

This issue was closed because it has been inactive for 14 days since being marked as stale.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request generation quality Quality of model output stale
Projects
None yet
Development

No branches or pull requests

3 participants