[New feature] Add inference load balance controller for fastdeploy llm #1699
Workflow file for this run
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
name: Build | |
on: [pull_request] | |
jobs: | |
macOS-latest-py: | |
runs-on: macos-latest | |
steps: | |
- name: Clone | |
uses: actions/checkout@v1 | |
- name: Get CMake | |
uses: lukka/get-cmake@latest | |
- name: Get Python | |
uses: actions/setup-python@v4 | |
with: | |
python-version: '3.10' | |
- name: Build FastDeploy | |
working-directory: ./python | |
run: | | |
export ENABLE_ORT_BACKEND=ON | |
export ENABLE_PADDLE_BACKEND=OFF | |
export ENABLE_OPENVINO_BACKEND=OFF | |
export ENABLE_VISION=ON | |
export ENABLE_TEXT=ON | |
python -m pip install wheel | |
python setup.py build | |
python setup.py bdist_wheel | |
ls -l |