-
Notifications
You must be signed in to change notification settings - Fork 11.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Setting temp=0
does not work as expected
#684
Comments
Hi, I did a PR trying to fix this, can you test it? Thanks |
This comment was marked as resolved.
This comment was marked as resolved.
Why not just handle the case explicitly as |
Deadsg
pushed a commit
to Deadsg/llama.cpp
that referenced
this issue
Dec 19, 2023
PR: [issue: 672] add __version__
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Prerequisites
Please answer the following questions for yourself before submitting an issue.
Expected Behavior
Setting sampling temperature to
0
should produce valid and "predictable" tokens.Current Behavior
Setting temperature to
0
causes sampling to fail completely. This is due toplogits
being scaled by1.0f/temp
before sampling here. I believe a workaround for this would be to make sampling deterministic whentemp==0
by settingtop_p=0.0
andtop_k=1
and settingtemp>0
.The text was updated successfully, but these errors were encountered: