Skip to content

Commit f6e9adb

Browse files
loadamsumchand
authored andcommitted
Update workflows that use cu116 to cu117 (deepspeedai#5361)
The following workflows were specifying runners with cu116, we are updating these to cu117. Workflows impacted: - [x] nv-accelerate-v100 - [new build](https://github.com/microsoft/DeepSpeed/actions/runs/8557768042/job/23450811816?pr=5361): 22 passed, 5 skipped, 11 warnings in 129.04s (0:02:09) - [old build](https://github.com/microsoft/DeepSpeed/actions/runs/8547131990/job/23418750315): 22 passed, 5 skipped, 11 warnings in 318.84s (0:05:18) - [x] nv-ds-chat - [new build](https://github.com/microsoft/DeepSpeed/actions/runs/8546543733/job/23417119129): 15 passed, 1 skipped in 2729.91s (0:45:29) - [old build](https://github.com/microsoft/DeepSpeed/actions/runs/8531148226/job/23370268262): 15 passed, 1 skipped in 3511.82s (0:58:31) - [x] nv-inference - recently failing and disabled, needs fixes. - [new build](https://github.com/microsoft/DeepSpeed/actions/runs/8558749560): 36 failed, 74 passed, 95 skipped, 4 warnings in 877.45s (0:14:37) - [old build](https://github.com/microsoft/DeepSpeed/actions/runs/8546382497/job/23416626521): 36 failed, 74 passed, 95 skipped, 4 warnings in 3633.34s (1:00:33) - [x] nv-mii - [new build](https://github.com/microsoft/DeepSpeed/actions/runs/8557768075/job/23450812054?pr=5361): 4 passed, 23 deselected, 3 warnings in 116.28s (0:01:56) - [old build](https://github.com/microsoft/DeepSpeed/actions/runs/8547246351/job/23419064526): 4 passed, 23 deselected, 3 warnings in 196.79s (0:03:16) - [x] nv-nightly - [new build](https://github.com/microsoft/DeepSpeed/actions/runs/8557763671/job/23450792634): 3 passed, 3 skipped, 4713 deselected, 1 warning in 1831.83s (0:30:31) - [old build](https://github.com/microsoft/DeepSpeed/actions/runs/8547230983/job/23419020962): 3 passed, 3 skipped, 4713 deselected, 1 warning in 2459.06s (0:40:59) - [x] nv-torch-latest-v100 - [new build](https://github.com/microsoft/DeepSpeed/actions/runs/8557768039/job/23450811779): 947 passed, 169 skipped, 4 warnings in 2550.25s (0:42:30) and 61 passed, 4 skipped, 4643 deselected, 1 warning in 563.34s (0:09:23) - [old build](https://github.com/microsoft/DeepSpeed/actions/runs/8547232496/job/23419024966): 947 passed, 169 skipped, 4 warnings in 3216.47s (0:53:36) and 61 passed, 4 skipped, 4643 deselected, 1 warning in 611.17s (0:10:11) - [x] nv-torch-nightly-v100 - [new build](https://github.com/microsoft/DeepSpeed/actions/runs/8558930744): 13 failed, 982 passed, 121 skipped, 4 warnings in 2691.26s (0:44:51) - [old build](https://github.com/microsoft/DeepSpeed/actions/runs/8558895638): 13 failed, 982 passed, 121 skipped, 4 warnings in 3117.03s (0:51:57) - [x] nv-transformers-v100 - disabled for 4 months, needs work regardless.
1 parent 7b211b6 commit f6e9adb

File tree

8 files changed

+10
-10
lines changed

8 files changed

+10
-10
lines changed

.github/workflows/nv-accelerate-v100.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -19,7 +19,7 @@ concurrency:
1919

2020
jobs:
2121
unit-tests:
22-
runs-on: [self-hosted, nvidia, cu116, v100]
22+
runs-on: [self-hosted, nvidia, cu117, v100]
2323

2424
steps:
2525
- uses: actions/checkout@v3

.github/workflows/nv-ds-chat.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -21,7 +21,7 @@ permissions:
2121

2222
jobs:
2323
unit-tests:
24-
runs-on: [self-hosted, nvidia, cu116, v100]
24+
runs-on: [self-hosted, nvidia, cu117, v100]
2525

2626
steps:
2727
- uses: actions/checkout@v3

.github/workflows/nv-inference.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -22,7 +22,7 @@ concurrency:
2222

2323
jobs:
2424
unit-tests:
25-
runs-on: [self-hosted, nvidia, cu116, v100]
25+
runs-on: [self-hosted, nvidia, cu117, v100]
2626

2727
steps:
2828
- uses: actions/checkout@v3

.github/workflows/nv-mii.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -27,7 +27,7 @@ concurrency:
2727

2828
jobs:
2929
unit-tests:
30-
runs-on: [self-hosted, nvidia, cu116, v100]
30+
runs-on: [self-hosted, nvidia, cu117, v100]
3131

3232
steps:
3333
- uses: actions/checkout@v3

.github/workflows/nv-nightly.yml

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -15,7 +15,7 @@ permissions:
1515

1616
jobs:
1717
unit-tests:
18-
runs-on: [self-hosted, nvidia, cu116, v100]
18+
runs-on: [self-hosted, nvidia, cu117, v100]
1919

2020
steps:
2121
- uses: actions/checkout@v3
@@ -25,7 +25,7 @@ jobs:
2525

2626
- name: Install pytorch
2727
run: |
28-
pip install -U --cache-dir $TORCH_CACHE torch==1.13.1 torchvision --index-url https://download.pytorch.org/whl/cu116
28+
pip install -U --cache-dir $TORCH_CACHE torch==1.13.1 torchvision --index-url https://download.pytorch.org/whl/cu117
2929
python -c "import torch; print('torch:', torch.__version__, torch)"
3030
python -c "import torch; print('CUDA available:', torch.cuda.is_available())"
3131
@@ -55,7 +55,7 @@ jobs:
5555
run: |
5656
unset TORCH_CUDA_ARCH_LIST # only jit compile for current arch
5757
cd tests
58-
pytest $PYTEST_OPTS --forked -m 'nightly' unit/ --torch_ver="1.13" --cuda_ver="11.6"
58+
pytest $PYTEST_OPTS --forked -m 'nightly' unit/ --torch_ver="1.13" --cuda_ver="11.7"
5959
6060
- name: Open GitHub issue if nightly CI fails
6161
if: ${{ failure() && (github.event_name == 'schedule') }}

.github/workflows/nv-torch-latest-v100.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -19,7 +19,7 @@ concurrency:
1919

2020
jobs:
2121
unit-tests:
22-
runs-on: [self-hosted, nvidia, cu116, v100]
22+
runs-on: [self-hosted, nvidia, cu117, v100]
2323

2424
steps:
2525
- uses: actions/checkout@v3

.github/workflows/nv-torch-nightly-v100.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -15,7 +15,7 @@ permissions:
1515

1616
jobs:
1717
unit-tests:
18-
runs-on: [self-hosted, nvidia, cu116, v100]
18+
runs-on: [self-hosted, nvidia, cu117, v100]
1919

2020
steps:
2121
- uses: actions/checkout@v3

.github/workflows/nv-transformers-v100.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -18,7 +18,7 @@ concurrency:
1818

1919
jobs:
2020
unit-tests:
21-
runs-on: [self-hosted, nvidia, cu116, v100]
21+
runs-on: [self-hosted, nvidia, cu117, v100]
2222

2323
steps:
2424
- uses: actions/checkout@v3

0 commit comments

Comments
 (0)