Skip to content

[ROCm] Fix fp8 unrolledx4 matmul kernel. #120

[ROCm] Fix fp8 unrolledx4 matmul kernel.

[ROCm] Fix fp8 unrolledx4 matmul kernel. #120

Workflow file for this run

name: PR Test (AMD)
on:
push:
branches: [ main ]
paths:
- "python/sglang/**"
- "test/**"
pull_request:
branches: [ main ]
paths:
- "python/sglang/**"
- "test/**"
workflow_dispatch:
concurrency:
group: pr-test-amd-${{ github.ref }}
cancel-in-progress: true
jobs:
accuracy-test-1-gpu:
if: github.repository == 'sgl-project/sglang' || github.event_name == 'pull_request'
runs-on: linux-mi300-gpu-1
container:
image: lmsysorg/sglang:v0.4.2.post2-rocm630
options: --user root --device=/dev/kfd --device=/dev/dri --ipc=host --group-add video --cap-add=SYS_PTRACE --security-opt seccomp=unconfined
env:
HF_TOKEN: ${{ secrets.HF_TOKEN }}
steps:
- name: Checkout code
uses: actions/checkout@v3
- name: Install dependencies
run: |
pip install --upgrade pip
cd sgl-kernel
python3 setup_rocm.py install
cd ..
pip install -e "python[dev_hip]"
git clone https://github.com/merrymercy/human-eval.git
cd human-eval
pip install -e .
- name: Evaluate Accuracy
timeout-minutes: 20
run: |
cd test/srt
python3 test_eval_accuracy_large.py
finish:
needs: [
accuracy-test-1-gpu
]
runs-on: ubuntu-latest
steps:
- name: Finish
run: echo "This is an empty step to ensure that all jobs are completed."