-
-
Notifications
You must be signed in to change notification settings - Fork 4.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Core][Bugfix] Support prompt_logprobs returned with speculative decoding #8047
[Core][Bugfix] Support prompt_logprobs returned with speculative decoding #8047
Commits on Sep 18, 2024
-
fix: use INVALID_TOKEN_ID instead of uninitialized tensor data
Signed-off-by: Travis Johnson <tsjohnso@us.ibm.com>
Configuration menu - View commit details
-
Copy full SHA for 488240d - Browse repository at this point
Copy the full SHA 488240dView commit details -
refactor: use INVALID_TOKEN_ID instead of magic number
Signed-off-by: Travis Johnson <tsjohnso@us.ibm.com>
Configuration menu - View commit details
-
Copy full SHA for fb58cd4 - Browse repository at this point
Copy the full SHA fb58cd4View commit details -
fix: skip entries for prompt tokens in spec decoding
Signed-off-by: Travis Johnson <tsjohnso@us.ibm.com>
Configuration menu - View commit details
-
Copy full SHA for 9bcf56b - Browse repository at this point
Copy the full SHA 9bcf56bView commit details -
refactor: move INVALID_TOKEN_ID to sequence.py and rename
Signed-off-by: Travis Johnson <tsjohnso@us.ibm.com>
Configuration menu - View commit details
-
Copy full SHA for 526f14c - Browse repository at this point
Copy the full SHA 526f14cView commit details -
fix: detokenize negative id to empty string
Signed-off-by: Travis Johnson <tsjohnso@us.ibm.com>
Configuration menu - View commit details
-
Copy full SHA for 041e5fa - Browse repository at this point
Copy the full SHA 041e5faView commit details -
feat: support prompt_logprobs output with spec decoding
Signed-off-by: Travis Johnson <tsjohnso@us.ibm.com>
Configuration menu - View commit details
-
Copy full SHA for ad367ff - Browse repository at this point
Copy the full SHA ad367ffView commit details -
test: allow generate_w_logprobs even if no logprobs requested
Signed-off-by: Travis Johnson <tsjohnso@us.ibm.com>
Configuration menu - View commit details
-
Copy full SHA for 6f420db - Browse repository at this point
Copy the full SHA 6f420dbView commit details -
test: update spec_decode e2e tests
Include logprobs cases and where disable_logprobs is True. Signed-off-by: Travis Johnson <tsjohnso@us.ibm.com>
Configuration menu - View commit details
-
Copy full SHA for a15fea6 - Browse repository at this point
Copy the full SHA a15fea6View commit details
Commits on Sep 23, 2024
-
Signed-off-by: Travis Johnson <tsjohnso@us.ibm.com>
Configuration menu - View commit details
-
Copy full SHA for 6e8e54d - Browse repository at this point
Copy the full SHA 6e8e54dView commit details