Add support for CI testing #124

sandeepd-nv · 2024-09-23T12:34:23Z

No description provided.

sandeepd-nv · 2024-09-26T03:42:44Z

Blocked on #128.

leofang · 2024-10-09T03:20:40Z

Blocked on #128.

See #128 (comment), thx!

.github/workflows/gh-build-and-test.yml

copy-pr-bot · 2024-12-07T19:23:58Z

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

leofang · 2024-12-07T19:24:46Z

/ok to test

leofang · 2024-12-08T22:00:37Z

/ok to test

leofang · 2024-12-08T22:04:04Z

/ok to test

leofang · 2024-12-08T22:42:55Z

/ok to test

leofang · 2024-12-08T22:55:41Z

/ok to test

leofang · 2024-12-08T23:07:21Z

/ok to test

leofang · 2024-12-08T23:08:15Z

/ok to test

leofang · 2024-12-08T23:22:20Z

Here are some updates since Friday:

it makes the CI logs hard to browse (everything from the same workflow is collapsed inside the same title)

It turns out that this is a misunderstanding (of mine), sorry! It's the other way around: It's the composite actions that do this, not reusable workflows, see, e.g. https://docs.github.com/en/actions/sharing-automations/avoiding-duplication#comparison-of-reusable-workflows-and-composite-actions. So I refactored in the opposite (and wrong) direction.

Let us do this (change all composite actions to reusable workflows) in a follow-up PR since the CI is now working and there's no reason to delay.

just discovered: environment variables cannot be passed across workflows (https://github.com/orgs/community/discussions/26671), which therefore requires both build and test workflows to do the same setup, and it requires complex logic to manage

This is another misunderstanding (of mine), sorry (again)! For both cases (distinct jobs vs distinct workflows), it require some handling of input/output. There's no way for sharing the env vars in either case.

We are overcomplicating the CI for a simple project and bending backward

As part of this I removed all CI scripts in commit 7b074f0. It is the best that we focus on testing pip-based workflows for now, and add conda next (which would be treated differently). Mixing-and-matching is not ideal.

leofang · 2024-12-08T23:24:49Z

cuda_core/tests/conftest.py

+@pytest.fixture(scope="session", autouse=True)
+def always_init_cuda():
+    handle_return(driver.cuInit(0))


FYI, @ksimpson-work the CI was able to catch this issue: Depending on how the tests are run, it could be possible that a test ends without CUDA even initialized. So we must ensure it ourselves by the test start time.

leofang · 2024-12-08T23:25:48Z

cuda_core/tests/conftest.py

+    ctx = handle_return(driver.cuCtxGetCurrent())
+    if int(ctx) == 0:
+        # no active context, do nothing
+        return


FYI, @ksimpson-work another issue caught by the CI (and also back in #261): A test could end early without a CUDA context set current, so we need to detect this at the test teardown time.

leofang · 2024-12-08T23:26:33Z

cuda_core/tests/example_tests/utils.py

@@ -10,7 +10,6 @@
 import os
 import sys

-import cupy as cp


For now I treat CuPy as an optional test dependency, so any reference to CuPy in this file should be removed. (We're not using too much memory during tests anyway.)

leofang · 2024-12-08T23:28:52Z

cuda_core/tests/test_program.py

+def can_load_generated_ptx():
+    _, driver_ver = cuda.cuDriverGetVersion()
+    _, nvrtc_major, nvrtc_minor = nvrtc.nvrtcVersion()
+    if nvrtc_major * 1000 + nvrtc_minor * 10 > driver_ver:
+        return False
+    return True


FYI @ksimpson-work this is akin to this snippet that I added to CuPy in the past (PTX might not be loadable/JIT'able if it's newer than the driver):
https://github.com/cupy/cupy/blob/8eb16ac910e85c119a20f68a69de9a2e6034069c/tests/cupy_tests/core_tests/test_raw.py#L557-L568

leofang · 2024-12-08T23:30:30Z

Thanks for help, @sandeepd-nv!

sandeepd-nv added the CI/CD CI/CD infrastructure label Sep 23, 2024

sandeepd-nv self-assigned this Sep 23, 2024

sandeepd-nv force-pushed the add_ci_test branch from 7763fc9 to 8e11153 Compare November 15, 2024 14:45

sandeepd-nv force-pushed the add_ci_test branch from 19b1525 to 5ffa6b7 Compare November 26, 2024 22:31

sandeepd-nv added 15 commits December 3, 2024 03:08

Adding support for CI testing.

c626b95

Supply python-version.

5467b52

Update test driver to test bindings and core separately.

c78ebfd

Adding support for CI testing.

e5bf104

Adding support for CI testing. Attempt 2.

360e1b2

Adding support for CI testing. Attempt 3.

67b7aed

Use container for tests on the GPU runner.

6fab977

Use container for tests on the GPU runner. Attempt 2.

f2a0939

Remove build caching.

508a83c

Hard select Build (without container).

72062aa

Use container with preinstalled conda for build.

32ca908

Use container with preinstalled conda for build. Attempt 2.

970a8e5

Use container with preinstalled conda for build. Attempt 3.

be969e5

Updated paths.

3382d68

Removed duplicate tests section.

a9ed0c6

sandeepd-nv force-pushed the add_ci_test branch from 868d01b to a9ed0c6 Compare December 2, 2024 21:39

leofang mentioned this pull request Dec 3, 2024

Add docs, tests, and samples for StridedMemoryView/@args_viewable_as_strided_memory #247

Merged

leofang reviewed Dec 4, 2024

View reviewed changes

.github/workflows/gh-build-and-test.yml Outdated Show resolved Hide resolved

leofang added the P0 High priority - Must do! label Dec 4, 2024

sandeepd-nv and others added 2 commits December 4, 2024 09:49

Run cuda_core tests before cuda_binding.

84124b5

Merge branch 'main' into add_ci_test

47a7235

skip testing on win; remove mac

aeebaf7

leofang force-pushed the add_ci_test branch from 00a63ca to 8692ec1 Compare December 8, 2024 22:03

switch to use github cache to improve reuse

ed0386a

leofang force-pushed the add_ci_test branch from 8692ec1 to ed0386a Compare December 8, 2024 22:42

clean up legacy CI scripts

7b074f0

leofang closed this Dec 8, 2024

leofang reopened this Dec 8, 2024

leofang marked this pull request as ready for review December 8, 2024 23:09

leofang approved these changes Dec 8, 2024

View reviewed changes

leofang added this to the cuda.core beta 2 milestone Dec 8, 2024

leofang added test Improvements or additions to tests cuda.bindings Everything related to the cuda.bindings module cuda.core Everything related to the cuda.core module labels Dec 8, 2024

leofang reviewed Dec 8, 2024

View reviewed changes

leofang merged commit 0723d62 into NVIDIA:main Dec 8, 2024
46 of 48 checks passed

This was referenced Dec 8, 2024

Set up a public, GitHub-Action-based CI infrastructure #81

Closed

Add py313 builds #272

Merged

CI: Convert setup/build/test composite actions to reusable workflows #278

Closed

leofang linked an issue Dec 9, 2024 that may be closed by this pull request

Add tests to github actions #253

Closed

leofang mentioned this pull request Dec 9, 2024

Add tests to github actions #253

Closed

Add support for CI testing #124

Add support for CI testing #124

Uh oh!

Conversation

sandeepd-nv commented Sep 23, 2024

Uh oh!

sandeepd-nv commented Sep 26, 2024

Uh oh!

leofang commented Oct 9, 2024

Uh oh!

Uh oh!

copy-pr-bot bot commented Dec 7, 2024

Uh oh!

leofang commented Dec 7, 2024

Uh oh!

leofang commented Dec 8, 2024

Uh oh!

leofang commented Dec 8, 2024

Uh oh!

leofang commented Dec 8, 2024

Uh oh!

leofang commented Dec 8, 2024

Uh oh!

leofang commented Dec 8, 2024

Uh oh!

leofang commented Dec 8, 2024

Uh oh!

leofang commented Dec 8, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

leofang Dec 8, 2024

Choose a reason for hiding this comment

Uh oh!

leofang Dec 8, 2024

Choose a reason for hiding this comment

Uh oh!

leofang Dec 8, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

leofang Dec 8, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

leofang commented Dec 8, 2024

Uh oh!

Uh oh!

leofang commented Dec 8, 2024 •

edited

Loading

leofang Dec 8, 2024 •

edited

Loading