-
Notifications
You must be signed in to change notification settings - Fork 29
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[pull] master from kserve:master #361
Conversation
Signed-off-by: Vincent Hou <shou73@bloomberg.net>
Hi @pull[bot]. Thanks for your PR. I'm waiting for a opendatahub-io member to verify that this patch is reasonable to test. If it is, they should reply with Once the patch is verified, the new status will be reflected by the I understand the commands that are listed here. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. |
Signed-off-by: Sivanantham Chinnaiyan <sivanantham.chinnaiyan@ideas2it.com>
fix conflict Signed-off-by: jooho <jlee@redhat.com>
* Add precaution again running v1 endpoints on openai models Signed-off-by: grandbora <grandbora@fb.com> * Remove the check from explain Signed-off-by: grandbora <grandbora@fb.com> * Add a warning log for explain Signed-off-by: grandbora <grandbora@fb.com> --------- Signed-off-by: grandbora <grandbora@fb.com>
* Add FP16 datatype support for OIP grpc Add grpc server tests Signed-off-by: Sivanantham Chinnaiyan <sivanantham.chinnaiyan@ideas2it.com> * Add grpcio-testing as test dependency Signed-off-by: Sivanantham Chinnaiyan <sivanantham.chinnaiyan@ideas2it.com> * Fix model repository initialization default value Signed-off-by: Sivanantham Chinnaiyan <sivanantham.chinnaiyan@ideas2it.com> * Remove fp16 global map Signed-off-by: Sivanantham Chinnaiyan <sivanantham.chinnaiyan@ideas2it.com> * Resolve comments Signed-off-by: Sivanantham Chinnaiyan <sivanantham.chinnaiyan@ideas2it.com> --------- Signed-off-by: Sivanantham Chinnaiyan <sivanantham.chinnaiyan@ideas2it.com>
* added flag to return raw prediction results Signed-off-by: Andrews Arokiam <andrews.arokiam@ideas2it.com> * black fix Signed-off-by: Andrews Arokiam <andrews.arokiam@ideas2it.com> * unit test bug fix Signed-off-by: Andrews Arokiam <andrews.arokiam@ideas2it.com> * unittest for token classification Signed-off-by: Andrews Arokiam <andrews.arokiam@ideas2it.com> * verify codegen Signed-off-by: Andrews Arokiam <andrews.arokiam@ideas2it.com> * bug fix Signed-off-by: Andrews Arokiam <andrews.arokiam@ideas2it.com> --------- Signed-off-by: Andrews Arokiam <andrews.arokiam@ideas2it.com>
Signed-off-by: Alex Peters <alpe@users.noreply.github.com> Signed-off-by: Dan Sun <dsun20@bloomberg.net> Co-authored-by: Dan Sun <dsun20@bloomberg.net>
Fix model_id and model_dir precendence Signed-off-by: Dan Sun <dsun20@bloomberg.net>
Signed-off-by: Dan Sun <dsun20@bloomberg.net>
* Fixup max_length for HF and model info for vLLM Signed-off-by: Dattu Sharma <venkatadattasainimmaturi@gmail.com> * Use vLLM's implementation for max_length Also fixup error in calculating input sequence lenngth Signed-off-by: Dattu Sharma <venkatadattasainimmaturi@gmail.com> * Fixup linter Signed-off-by: Dattu Sharma <venkatadattasainimmaturi@gmail.com> * Add license to new file Signed-off-by: Dattu Sharma <venkatadattasainimmaturi@gmail.com> * Add bloom test case for max_tokens Revert input length fix Signed-off-by: Dattu Sharma <venkatadattasainimmaturi@gmail.com> * Set limit on opt chat competion e2e test Signed-off-by: Dattu Sharma <venkatadattasainimmaturi@gmail.com> --------- Signed-off-by: Dattu Sharma <venkatadattasainimmaturi@gmail.com>
#3712) * Fix input token count and add completion usage Signed-off-by: Sivanantham Chinnaiyan <sivanantham.chinnaiyan@ideas2it.com> * Add max_length for test models Signed-off-by: Sivanantham Chinnaiyan <sivanantham.chinnaiyan@ideas2it.com> --------- Signed-off-by: Sivanantham Chinnaiyan <sivanantham.chinnaiyan@ideas2it.com>
update version to 0.13.0 Signed-off-by: Sivanantham Chinnaiyan <sivanantham.chinnaiyan@ideas2it.com>
/ok-to-test |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/lgtm
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: israel-hdez, pull[bot] The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
97c83cf
into
opendatahub-io:master
updating ksi file
See Commits and Changes for more details.
Created by
pull[bot]
Can you help keep this open source service alive? 💖 Please sponsor : )