Add docs, tests, and samples for `StridedMemoryView`/`@args_viewable_as_strided_memory` #247

leofang · 2024-11-16T04:20:48Z

Close #143. Close #236.

rename viewable to args_viewable_as_strided_memory
rename the device_accessible data class member to is_device_accessible for better consistency
fix the device_id data class member to -1 if the pointer is only accessible on CPU
add docs, tests, and a code sample for StridedMemoryView & @args_viewable_as_strided_memory
modify .gitattributes to enforce Linux line ending

copy-pr-bot · 2024-11-16T04:20:52Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

cuda_core/cuda/core/experimental/_memoryview.pyx

rwgk

Looks good to me!

cuda_core/cuda/core/experimental/_memoryview.pyx

cuda_core/tests/test_utils.py

leofang · 2024-12-02T02:48:52Z

cuda_core/tests/conftest.py

Note: The diff renders weirdly because I changed the line ending in commit b5cfdce. The only true change is code added starting at line 44 (related to cffi)

leofang · 2024-12-02T02:51:49Z

cuda_core/cuda/core/experimental/utils.py

+from cuda.core.experimental._memoryview import args_viewable_as_strided_memory  # noqa: F401
+from cuda.core.experimental._memoryview import StridedMemoryView  # noqa: F401


Note: these were removed by Ruff by mistake (#201 (comment)). Fixing them here with noqa.

leofang · 2024-12-02T03:09:43Z

@rwgk @vzhurba01 This is ready for a final review. I've updated the PR description. I also took the liberty to rename the device_accessible data class member to is_device_accessible, because it is not yet a public API until after this PR. Same reasoning applies to args_viewable_as_strided_memory. (We did not document any of this back in v0.1.0.)

leofang · 2024-12-02T03:25:24Z

/ok to test

rwgk · 2024-12-02T06:47:38Z

cuda_core/cuda/core/experimental/_memoryview.pyx

@@ -284,7 +334,34 @@ cdef StridedMemoryView view_as_cai(obj, stream_ptr, view=None):
    return buf


-def viewable(tuple arg_indices):
+def args_viewable_as_strided_memory(tuple arg_indices):


Could this also be made to work with

def args_viewable_as_strided_memory(*arg_indices):

and then

@args_viewable_as_strided_memory(1) def my_func(arg0, arg1, arg2, stream: Stream):

I did think of this and I thought it was discussed somewhere (internally) but I can't find it. One challenge I see is extensibility: What if we also want to support keyword arguments? Extending the current signature is straightforward:

def args_viewable_as_strided_memory(tuple arg_indices, tuple kwarg_names): ...

then on the call site

@args_viewable_as_strided_memory((1,), ("argB",)) def my_func(arg0, arg1, arg2, *, argA=None, argB=None): ...

But if we change the signature to naive *args, **kwargs can we still support this extension?

cuda_core/examples/strided_memory_view.py

leofang · 2024-12-02T20:36:36Z

/ok to test

rwgk · 2024-12-02T23:19:48Z

cuda_core/tests/conftest.py

+    for f in files:
+        try:  # noqa: SIM105
+            os.remove(f)
+        except FileNotFoundError:


When could this occur? I'd expect that the glob above produces only files that exist.

General experience: Explicitly cleaning up right before running a test is more helpful. That's the most certain way to ensure that the artifacts do not exist when the test starts, and in case the test fails, retaining the artifacts can be very useful for debugging.

If you think that idea could be useful here: git status will show the artifacts. I'd generate them in a subdir that we can .gitignore.

I would like to do the opposite: Ensure the artifacts do not exist after testing (regardless if tests succeed or not). The reason is that: I don't want any artifact to remain after the tests finish. I find it troublesome having to update .gitignore to skip the artifacts (and, depending on where we run the tests, the artifact location could change). Without an RAII-like clean up, they'd still remain on the file system after tests, and a subsequent local run (outside of pytest) could accidentally reuse them, which is considered a side effect that I do not like.

Sounds good.

I'm still wondering when/why the except FileNotFoundError is needed, but it most likely will not do any harm to have it here.

It could happen if CFFI compilation fails for whatever reason (so that there's no artifact to remove).

The list of files is the result of glob.glob(os.path.join(os.getcwd(), "_cpu_obj*")), which will only produce what actually exists in the filesystem (it could be empty for example).

>>> import glob >>> glob.glob("_cpu_obj*") []

This could only go wrong if something concurrently deletes the files. (I wouldn't want to mask that.)

This could only go wrong if something concurrently deletes the files. (I wouldn't want to mask that.)

You might be onto something. I wrote this snippet a while back and perhaps back then the tests there were run under pytest-xdist which parallelized the tests.

FWIW, I asked chatgpt for help with temp files. The answer suggests that it's really easy to avoid filesystem races with pytest:

https://docs.google.com/document/d/1IuhmAgtITcnvrEJY5BTkCb-taWw2Soj2YBmU535Ttoc/edit?usp=sharing

Use tmp_path Fixture:

tmp_path provides a Path object from Python's pathlib, which is modern and more feature-rich.
This is the recommended approach for most use cases as Path is more intuitive and robust.

Yes, we also use it in test_nvjitlink.py. It's great when it works, here it does not work because we have no control since we don't directly generate the artifacts, the sample does (think of it as a subprocess, but worse because we exec() the samples).

rwgk · 2024-12-03T00:07:33Z

cuda_core/examples/strided_memory_view.py

+    from cffi import FFI
+except ImportError:
+    print("cffi is not installed, the CPU example would be skipped", file=sys.stderr)
+    cffi = None


FFI = None

I discovered that because I didn't have cffi when running the tests.

Which brings me to another question: Would it make sense to add cffi to the test dependencies (e.g. extras_require in setup.py)?

Good catch! Fixed in commit 6af4da3.

Would it make sense to add cffi to the test dependencies (e.g. extras_require in setup.py)?

Our test has not yet been enabled in the CI (#124). I would like to revisit this later, as the decision could go either way and I want to see how the CI test infra is set up before deciding. (The story with CFFI is a bit complicated, because we're using its "API mode" here which also needs a C/C++ compiler to present at run time.)

rwgk · 2024-12-03T00:55:59Z

cuda_core/tests/conftest.py

+    for f in files:
+        try:  # noqa: SIM105
+            os.remove(f)
+        except FileNotFoundError:


Sounds good.

I'm still wondering when/why the except FileNotFoundError is needed, but it most likely will not do any harm to have it here.

leofang · 2024-12-03T01:05:08Z

Thanks, @rwgk (and Satya, who reviewed offline and suggested some comment additions in commit 6af4da3)! Let's give @vzhurba01 some time to review the docs before merging.

cuda_core/tests/test_utils.py

vzhurba01

I'm done with my review. This was my first proper look into both DLPack and CAI, so the tests and samples helped a lot in trying to understand what's going on.

leofang · 2024-12-04T02:38:57Z

Thanks, Vlad! Since Ralf has approved, and the last commit is minor, let me merge ahead.

add tests for viewable

15ebef3

leofang added documentation Improvements or additions to documentation P0 High priority - Must do! test Improvements or additions to tests cuda.core Everything related to the cuda.core module labels Nov 16, 2024

leofang added this to the cuda.core beta 2 milestone Nov 16, 2024

leofang self-assigned this Nov 16, 2024

leofang added 3 commits November 16, 2024 04:50

use numba to test CAI & make test a bit cleaner

80b2556

add tests for creating views directly

8ce0aa6

add docs

b55b8ba

leofang commented Nov 16, 2024

View reviewed changes

cuda_core/cuda/core/experimental/_memoryview.pyx Outdated Show resolved Hide resolved

leofang requested review from rwgk and vzhurba01 November 26, 2024 19:42

rwgk previously approved these changes Nov 27, 2024

View reviewed changes

cuda_core/cuda/core/experimental/_memoryview.pyx Outdated Show resolved Hide resolved

cuda_core/cuda/core/experimental/_memoryview.pyx Outdated Show resolved Hide resolved

cuda_core/tests/test_utils.py Outdated Show resolved Hide resolved

leofang added 2 commits November 30, 2024 02:44

Merge branch 'main' into strided_memory_view

8295d56

fix formatting

f1239a2

leofang dismissed rwgk’s stale review via f1239a2 November 30, 2024 02:45

leofang added 3 commits November 30, 2024 03:06

address comments on the docstring

bcf3add

fix import accidentally removed by ruff

b11e1ae

fix device_id convention for CPU

ede5076

leofang mentioned this pull request Nov 30, 2024

Add ruff linter #201

Merged

leofang added 2 commits December 2, 2024 00:04

rename viewable to args_viewable_as_strided_memory

8027c78

rename device_accessible to is_device_accessible for consistency

66377d8

leofang changed the title ~~Add docs, tests, and samples for StridedMemoryView/@viewable~~ Add docs, tests, and samples for StridedMemoryView/@args_viewable_as_strided_memory_view Dec 2, 2024

leofang changed the title ~~Add docs, tests, and samples for StridedMemoryView/@args_viewable_as_strided_memory_view~~ Add docs, tests, and samples for StridedMemoryView/@args_viewable_as_strided_memory Dec 2, 2024

leofang added 2 commits December 2, 2024 02:42

enforce line ending in the whole codebase

b5cfdce

add a code sample for strided memory view

9572f8a

leofang force-pushed the strided_memory_view branch from 73f34ad to 9572f8a Compare December 2, 2024 02:43

leofang marked this pull request as ready for review December 2, 2024 02:47

leofang commented Dec 2, 2024

View reviewed changes

fix formatting again

069f057

leofang mentioned this pull request Dec 2, 2024

StridedMemoryView should expose access to the producer stream if possible #259

Closed

rwgk reviewed Dec 2, 2024

View reviewed changes

leofang added 2 commits December 2, 2024 15:34

address review comments

5e393f8

programmatically load cffi functions

8e209aa

leofang requested a review from rwgk December 2, 2024 21:11

Merge branch 'main' into strided_memory_view

bd1d944

rwgk reviewed Dec 3, 2024

View reviewed changes

fix import fallback; address review comments

6af4da3

rwgk previously approved these changes Dec 3, 2024

View reviewed changes

leofang mentioned this pull request Dec 3, 2024

Add cluster to LaunchConfig to support thread block clusters on Hopper #261

Merged

vzhurba01 reviewed Dec 3, 2024

View reviewed changes

cuda_core/tests/test_utils.py Show resolved Hide resolved

cuda_core/tests/test_utils.py Show resolved Hide resolved

cuda_core/tests/test_utils.py Outdated Show resolved Hide resolved

vzhurba01 reviewed Dec 3, 2024

View reviewed changes

test readonly with numpy; rename use_stream parameter

16fc9f6

leofang dismissed rwgk’s stale review via 16fc9f6 December 4, 2024 02:35

leofang merged commit a725723 into NVIDIA:main Dec 4, 2024

leofang deleted the strided_memory_view branch December 4, 2024 02:39

		from cuda.core.experimental._memoryview import args_viewable_as_strided_memory # noqa: F401
		from cuda.core.experimental._memoryview import StridedMemoryView # noqa: F401

Add docs, tests, and samples for StridedMemoryView/@args_viewable_as_strided_memory #247

Add docs, tests, and samples for StridedMemoryView/@args_viewable_as_strided_memory #247

Uh oh!

Conversation

leofang commented Nov 16, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

copy-pr-bot bot commented Nov 16, 2024

Uh oh!

Uh oh!

rwgk left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

leofang commented Dec 2, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

leofang commented Dec 2, 2024

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

leofang commented Dec 2, 2024

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

leofang Dec 4, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

leofang commented Dec 3, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

vzhurba01 left a comment

Choose a reason for hiding this comment

Uh oh!

leofang commented Dec 4, 2024

Uh oh!

Uh oh!

Add docs, tests, and samples for `StridedMemoryView`/`@args_viewable_as_strided_memory` #247

Add docs, tests, and samples for `StridedMemoryView`/`@args_viewable_as_strided_memory` #247

leofang commented Nov 16, 2024 •

edited

Loading

leofang commented Dec 2, 2024 •

edited

Loading

leofang Dec 4, 2024 •

edited

Loading