Build for new archs #8

carterbox · 2023-09-28T15:25:55Z

Refactor the build scripts to target consistent CUDA archs across toolkit versions and enable POWERPC builds.

CUDA compatability is increased. The binaries will now run on all architectures supported by each toolkit. Before, the minimum targeted arch for ppc64le and aarch64 was 50; now it is 35.

Compile-time optimization has decreased. I no longer target 61, these devices can run 60. I tried to target the lowest arch from each named generation. 35 now only gets PTX, so it has to be compiled by the end users' CUDA driver for those devices; this will increase first startup times for those devices. Actually, for Windows, all archs have been getting PTX only because it takes to long to generate the machine code here on the feedstock.

Checklist

Used a personal fork of the feedstock to propose changes
Bumped the build number (if the version is unchanged)
Reset the build number to 0 (if the version changed)
Re-rendered with the latest conda-smithy (Use the phrase @conda-forge-admin, please rerender in a comment in this PR for automated rerendering)
Ensured the license file is being packaged.

…nda-forge-pinning 2023.09.28.12.41.45

conda-forge-webservices · 2023-09-28T15:26:00Z

Hi! This is the friendly automated conda-forge-linting service.

I just wanted to let you know that I linted all conda-recipes in your PR (recipe) and found it was in an excellent condition.

carterbox · 2023-09-29T17:15:43Z

I have tried appending a new CUDAToolkit_ROOT to CMAKE_ARGS. IF that doesn't work, try replacing the existing one, so it isn't duplicated.

hmaarrfk · 2023-10-01T18:32:32Z

recipe/meta.yaml

  skip: true  # [cuda_compiler_version == "None"]
  skip: true  # [cuda_compiler_version == "10.2"]
  skip: true  # [cuda_compiler_version == "11.0"]
  skip: true  # [cuda_compiler_version == "11.1"]
-  skip: true  # [ppc64le]
+  skip: true  # [cuda_compiler_version == "11.2"]


sorry why is this necessary? can we not be compatible with both using little effort?

Until what date do you want me to continue to release builds for 11.2? The 11.2 build takes to long for the CI, so I will need to reduce the number of target archs.

can you install the newer cuda stuff with pytorch or tensorflow from 11.2? if so then that is ok with me.

We just don't have cuda != 11.2 for either of those.

For clarity, the addition of ppc64le is unrelated to the removal of 11.2.

I think you would need an 11.2 build of magma to build pytorch with 11.2. I don't think you can use 11.8 build of magma to build pytorch with 11.2, so I'll try to get the 11.2 build passing again.

we could go the other way too, and try to build pytorch and ensorflow with 11.8

In term of migrating to CUDA 11.8 both PyTorch ( conda-forge/pytorch-cpu-feedstock#195 ) and TensorFlow ( conda-forge/tensorflow-feedstock#344 ) have updated

Regarding CUDA 11.2, PyTorch dropped it ( conda-forge/pytorch-cpu-feedstock#195 ) and TensorFlow is planning to ( conda-forge/tensorflow-feedstock#347 (comment) )

So maybe we can simplify the builds here

Thanks! I'll keep that in mind for the next release!

Note: Tracking in issue ( #12 )

carterbox · 2023-10-04T22:50:13Z

The variablility in build times is frustrating; it's like variable by like 15%!

github-actions · 2023-10-05T04:58:57Z

Hi! This is the friendly conda-forge automerge bot!

I considered the following status checks when analyzing this PR:

linter: passed
azure: failed

Thus the PR was not passing and not merged.

github-actions · 2023-10-05T16:28:54Z

Hi! This is the friendly conda-forge automerge bot!

Commits were made to this PR after the automerge label was added. For security reasons, I have disabled automerge by removing the automerge label. Please add the automerge label again (or ask a maintainer to do so) if you'd like to enable automerge again!

hmaarrfk · 2023-10-05T22:16:07Z

so did you end up "reducing" anything? who will be affected if you did?

carterbox · 2023-10-06T16:22:46Z

Compatability is actually increased. The binaries will run on all architectures supported by the toolkit. Before, the minimum arch for ppc64le or aarch64 was 50; now it is 35.

Optimization has decreased. I no longer target 61 (these devices can run 60), and 35 only gets PTX, so it has to be compiled by the end users' CUDA driver for those devices. Actually, for Windows, all archs have been getting PTX only because it takes to long to generate the machine code.

hmaarrfk · 2023-10-06T17:26:15Z

Does Nvidia provide a table for what "61" means in terms of the products that you buy?

GTX????
RTX????

carterbox · 2023-10-06T17:29:02Z

https://en.wikipedia.org/wiki/CUDA#GPUs_supported

carterbox added 2 commits September 28, 2023 10:17

BLD: Adjust CUDA archs drop CUDA 11.2

ece4cac

MNT: Re-rendered with conda-build 3.26.1, conda-smithy 3.26.3, and co…

4063e55

…nda-forge-pinning 2023.09.28.12.41.45

carterbox requested a review from a team as a code owner September 28, 2023 15:25

carterbox added 2 commits September 28, 2023 10:27

Bump the build

75ab32c

Re-disable prune

131c16d

carterbox mentioned this pull request Sep 28, 2023

Link error with ppc64le, but not sbsa arm conda-forge/nvcc-feedstock#101

Closed

1 task

Set CUDAToolkit_ROOT explicitly for cross compile

5952a6e

hmaarrfk reviewed Oct 1, 2023

View reviewed changes

carterbox mentioned this pull request Oct 2, 2023

BUG: Set CUDAToolkit_ROOT to the cross-compile root conda-forge/nvcc-feedstock#102

Merged

5 tasks

carterbox added 3 commits October 2, 2023 20:35

REF: Reenable builds for CUDA 11.2

a27f34f

BUG: Use supported archs for Windows

f34618c

REF: Reduce archs

1029031

Reduce builds for 11.2

5614bbd

carterbox added the automerge Merge the PR when CI passes label Oct 4, 2023

Reduce archs

507b9db

github-actions bot removed the automerge Merge the PR when CI passes label Oct 5, 2023

carterbox merged commit 1765fda into conda-forge:main Oct 6, 2023

carterbox deleted the new-archs branch October 6, 2023 16:28

h-vetinari mentioned this pull request Oct 11, 2023

CI: Don't prune docker cache #9

Merged

5 tasks

carterbox mentioned this pull request Nov 14, 2023

Drop CUDA 11.2 with next magma release #12

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Build for new archs #8

Build for new archs #8

carterbox commented Sep 28, 2023 •

edited

Loading

conda-forge-webservices bot commented Sep 28, 2023

carterbox commented Sep 29, 2023

hmaarrfk Oct 1, 2023

carterbox Oct 2, 2023 •

edited

Loading

hmaarrfk Oct 2, 2023

carterbox Oct 3, 2023

hmaarrfk Oct 3, 2023

jakirkham Nov 11, 2023

carterbox Nov 14, 2023

jakirkham Nov 14, 2023

carterbox commented Oct 4, 2023

github-actions bot commented Oct 5, 2023

github-actions bot commented Oct 5, 2023

hmaarrfk commented Oct 5, 2023

carterbox commented Oct 6, 2023

hmaarrfk commented Oct 6, 2023

carterbox commented Oct 6, 2023

Build for new archs #8

Build for new archs #8

Conversation

carterbox commented Sep 28, 2023 • edited Loading

conda-forge-webservices bot commented Sep 28, 2023

carterbox commented Sep 29, 2023

hmaarrfk Oct 1, 2023

Choose a reason for hiding this comment

carterbox Oct 2, 2023 • edited Loading

Choose a reason for hiding this comment

hmaarrfk Oct 2, 2023

Choose a reason for hiding this comment

carterbox Oct 3, 2023

Choose a reason for hiding this comment

hmaarrfk Oct 3, 2023

Choose a reason for hiding this comment

jakirkham Nov 11, 2023

Choose a reason for hiding this comment

carterbox Nov 14, 2023

Choose a reason for hiding this comment

jakirkham Nov 14, 2023

Choose a reason for hiding this comment

carterbox commented Oct 4, 2023

github-actions bot commented Oct 5, 2023

github-actions bot commented Oct 5, 2023

hmaarrfk commented Oct 5, 2023

carterbox commented Oct 6, 2023

hmaarrfk commented Oct 6, 2023

carterbox commented Oct 6, 2023

carterbox commented Sep 28, 2023 •

edited

Loading

carterbox Oct 2, 2023 •

edited

Loading