-
-
Notifications
You must be signed in to change notification settings - Fork 11.2k
[BugFix] Fix MinPLogitsProcessor.update_states()
#23401
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
min_p_count was not updated correctly for added requests. Signed-off-by: Nick Hill <nhill@redhat.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Code Review
This pull request addresses a bug in MinPLogitsProcessor.update_states() where min_p_count was incorrectly updated for added requests. The fix correctly adjusts the count based on whether a request with a non-zero min_p is added to a slot that previously had a zero min_p, and vice-versa. Additionally, the code for handling moved requests has been simplified for better readability. The changes are correct and improve the robustness of the logits processor.
Signed-off-by: Nick Hill <nhill@redhat.com>
MinPLogitsProcessor.update_states()MinPLogitsProcessor.update_states()
Signed-off-by: Nick Hill <nhill@redhat.com>
Signed-off-by: Nick Hill <nhill@redhat.com> Signed-off-by: Xiao Yu <xiao.yu@amd.com>
Signed-off-by: Nick Hill <nhill@redhat.com>
Signed-off-by: Nick Hill <nhill@redhat.com>
Signed-off-by: Nick Hill <nhill@redhat.com>
Signed-off-by: Nick Hill <nhill@redhat.com> Signed-off-by: Ekagra Ranjan <3116519+ekagra-ranjan@users.noreply.github.com>
Signed-off-by: Nick Hill <nhill@redhat.com>
min_p_countwas not updated correctly for replaced requests.cc @afeldman-nm