-
Notifications
You must be signed in to change notification settings - Fork 15
sync release with main #205
sync release with main #205
Conversation
dtrifiro
commented
Oct 16, 2024
- gha: remove sync with upstream workflow
- bump vllm-tgis-adapter to 0.5.3 (bump adapter to 0.5.3 #199)
- Dockerfile*.ubi: install vllm and vllm-tgis-adapter in the same step to make sure the correct version is installed
- Sync with upstream @ v0.6.3
…oject#8872) Signed-off-by: kevin <kevin@anyscale.com>
Signed-off-by: Max de Bayser <mbayser@br.ibm.com>
Signed-off-by: Peter Pan <Peter.Pan@daocloud.io>
Co-authored-by: mgoin <michael@neuralmagic.com>
vllm-project#8378) Co-authored-by: Varun Sundar Rabindranath <varun@neuralmagic.com>
Signed-off-by: tylertitsworth <tyler.titsworth@intel.com> Co-authored-by: youkaichao <youkaichao@126.com>
Signed-off-by: Prashant Gupta <prashantgupta@us.ibm.com>
…9209) Signed-off-by: Max de Bayser <mbayser@br.ibm.com>
…-project#9309) Co-authored-by: youkaichao <youkaichao@126.com>
Has changes for loading catikit style *.pt adapters
Co-authored-by: sanghol <sanghol@allenai.org> Co-authored-by: Roger Wang <136131678+ywang96@users.noreply.github.com> Co-authored-by: Roger Wang <ywang@roblox.com>
Sync with upstream @ v0.6.3
…to make sure the correct version is installed
|
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: dtrifiro The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
|
@dtrifiro: The following tests failed, say
Full PR test history. Your PR dashboard. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here. |
- remove build steps/dependencies - allow for installing pre-built flash-attention/vllm wheels - default ROCM_VERSION to 6.3.4, allowing ovverride with env vars - cleanup rocm docker bake, defaults - amdsmi: use setup.py to build - add amdsmi bind mount - remove flashinfer from rocm target - bump vllm-tgis-adapter to 0.7.0 - Dockerfile*.ubi: bump ubi base