Remove NumPy <2 pin #6031

seberg · 2024-08-19T14:11:18Z

This PR removes the NumPy<2 pin which is expected to work for
RAPIDS projects once CuPy 13.3.0 is released (CuPy 13.2.0 had
some issues preventing the use with NumPy 2).

jakirkham · 2024-08-24T05:50:10Z

Updating branch to pull in the latest upstream changes and restart CI now that cuDF is done: rapidsai/cudf#16300

jakirkham · 2024-08-24T07:01:10Z

One GHA job failed with an unrelated CUDA initialization error

Unfortunately this seems to be showing up more in CI:

Will raise this offline for discussion

E   UserWarning: Error getting driver and runtime versions:
E
E   stdout:
E
E
E
E   stderr:
E
E   Traceback (most recent call last):
E     File "/opt/conda/envs/test/lib/python3.11/site-packages/numba/cuda/cudadrv/driver.py", line 254, in ensure_initialized
E       self.cuInit(0)
E     File "/opt/conda/envs/test/lib/python3.11/site-packages/numba/cuda/cudadrv/driver.py", line 327, in safe_cuda_api_call
E       self._check_ctypes_error(fname, retcode)
E     File "/opt/conda/envs/test/lib/python3.11/site-packages/numba/cuda/cudadrv/driver.py", line 395, in _check_ctypes_error
E       raise CudaAPIError(retcode, msg)
E   numba.cuda.cudadrv.driver.CudaAPIError: [999] Call to cuInit results in CUDA_ERROR_UNKNOWN
E
E   During handling of the above exception, another exception occurred:
E
E   Traceback (most recent call last):
E     File "<string>", line 4, in <module>
E     File "/opt/conda/envs/test/lib/python3.11/site-packages/numba/cuda/cudadrv/driver.py", line 292, in __getattr__
E       self.ensure_initialized()
E     File "/opt/conda/envs/test/lib/python3.11/site-packages/numba/cuda/cudadrv/driver.py", line 258, in ensure_initialized
E       raise CudaSupportError(f"Error at driver init: {description}")
E   numba.cuda.cudadrv.error.CudaSupportError: Error at driver init: Call to cuInit results in CUDA_ERROR_UNKNOWN (999)
E
E
E   Not patching Numba

For now will just restart the failed jobs after the others complete

Edit: Another job had the same sort of error

Edit 2: And again in this job after restarting

jakirkham · 2024-08-26T04:11:05Z

dependencies.yaml

@@ -509,7 +509,7 @@ dependencies:
          - *scikit_learn
          - statsmodels
          - umap-learn==0.5.3
-          - pynndescent==0.5.8
+          - pynndescent


@dantegd , is it alright if we relax this pin?

Given passing tests, it should be fine to relax now

jakirkham · 2024-08-26T19:28:47Z

/merge

jakirkham · 2024-08-26T23:13:24Z

Am seeing the wheel-tests-cuml CUDA 12.5 job getting stuck in the Dask tests (though don't see this with the CUDA 11.8 job). Not sure why that is given both would be using the same NumPy versions. Going to try merging in the upstream branch in case there is some fix we are missing. If there is an issue with this CI node, maybe that will give us a new one as well

jakirkham · 2024-08-27T04:31:34Z

Am seeing the wheel-tests-cuml CUDA 12.5 job getting stuck in the Dask tests (though don't see this with the CUDA 11.8 job).

Still seeing this issue. Going to test CI separately from this change in PR: #6047

jakirkham · 2024-08-27T16:54:33Z

So that PR's CI builds fail because pynndescent is pinned to the old version ( and thus doesn't have this fix: lmcinnes/pynndescent#242 )

jakirkham · 2024-08-28T01:04:59Z

Was searching around in the code for clues. Just came across this, which was unexpected

cuml/python/cuml/cuml/neighbors/CMakeLists.txt

Lines 38 to 40 in 973a65f

    
           foreach(target IN LISTS targets_using_numpy) 
        
             target_include_directories(${target} PRIVATE "${Python_NumPy_INCLUDE_DIRS}") 
        
           endforeach()

Does cuML need NumPy at build time?

If so, would have expected to see cimport numpy or similar in those Cython files, but am not seeing that

jakirkham · 2024-08-28T01:15:49Z

python/cuml/cuml/neighbors/CMakeLists.txt

-
-foreach(target IN LISTS targets_using_numpy)
-  target_include_directories(${target} PRIVATE "${Python_NumPy_INCLUDE_DIRS}")
-endforeach()


Trying dropping this. AFAICT the Cython modules above don't cimport numpy. So they wouldn't need this

Not sure whether it would cause the tests to hang. At a minimum, it is unused; so, it is worth cleaning up

Yeah not sure what happened there. I traced back in the blame to where this was added. Looks like @vyasr recommended removing it back at the source: #4818 (comment)

But there wasn't any additional discussion on that PR (maybe it happened somewhere else), and the change was merged in.

I agree with you that it seems to be unused.

Originally was looking for clues on fixing the hanging test ( #6031 (comment) ). Tried this just in case it helped, but it didn't matter

Read through the history here yesterday. It seems like NumPy was a build dependency a while back (though still wasn't clear then whether it was being used). Think since then every update has assumed NumPy was a build dependency. However as we don't require it during the build, it isn't actually satisfied.

Further we would have needed find_package(Python REQUIRED COMPONENTS Development NumPy) to find NumPy and set $Python_NumPy_INCLUDE_DIRS, but we don't do that either.

Think this hasn't presented much of an issue as we don't actually set targets_using_numpy.

In any event, this time seems as good as any for cleaning this up

jakirkham · 2024-08-28T01:35:00Z

dependencies.yaml

@@ -229,6 +229,7 @@ dependencies:
          - dask-cuda==24.10.*,>=0.0.0a0
          - joblib>=0.11
          - numba>=0.57
+          - numpy>=1.23,<3.0a0


It seems we have a NumPy dependency

cuml/python/cuml/cuml/linear_model/logistic_regression_mg.pyx

Line 27 in 973a65f

import numpy as np

However it isn't getting declared as one. So explicitly added NumPy as a dependency

jakirkham · 2024-08-28T18:56:45Z

Divye documented the CI hang occurring with the pytest cuml-dask CUDA 12.5 wheel job in issue: #6050

Also he added a skip for that test in PR: #6051

Unfortunately other CI jobs still fail due to NumPy 2 being unconstrained and an incompatible pynndescent being installed as observed in a no change PR: #6047 (comment)

Fortunately the latter fix is already here

In the hopes of getting CI to pass, have merged Divye's PR into this one. That way all the fixes and skips for CI are in one place

jameslamb

Left one suggestion to fix failing wheel tests, otherwise the packaging changes here look good to me.

I search around a bit and looked through CI logs, don't see any other issues.

ci/test_wheel.sh

jakirkham · 2024-08-28T22:05:23Z

Looks like this CI job had one test failure

______________________ test_weighted_kmeans[10-10-25-100] ______________________
[gw0] linux -- Python 3.11.9 /pyenv/versions/3.11.9/bin/python

nrows = 100, ncols = 25, nclusters = 10, max_weight = 10, random_state = 428096

    @pytest.mark.parametrize("nrows", [100, 500])
    @pytest.mark.parametrize("ncols", [25])
    @pytest.mark.parametrize("nclusters", [5, 10])
    @pytest.mark.parametrize("max_weight", [10])
    def test_weighted_kmeans(nrows, ncols, nclusters, max_weight, random_state):
    
        # Using fairly high variance between points in clusters
        cluster_std = 1.0
        np.random.seed(random_state)
    
        # set weight per sample to be from 1 to max_weight
        wt = np.random.randint(1, high=max_weight, size=nrows)
    
        X, y = make_blobs(
            nrows,
            ncols,
            nclusters,
            cluster_std=cluster_std,
            shuffle=False,
            random_state=0,
        )
    
        cuml_kmeans = cuml.KMeans(
            init="k-means++",
            n_clusters=nclusters,
            n_init=10,
            random_state=random_state,
            output_type="numpy",
        )
    
        cuml_kmeans.fit(X, sample_weight=wt)
        cu_score = cuml_kmeans.score(X)
    
        sk_kmeans = cluster.KMeans(random_state=random_state, n_clusters=nclusters)
        sk_kmeans.fit(cp.asnumpy(X), sample_weight=wt)
        sk_score = sk_kmeans.score(cp.asnumpy(X))
    
>       assert abs(cu_score - sk_score) <= cluster_std * 1.5
E       assert 6151.191162109375 <= (1.0 * 1.5)
E        +  where 6151.191162109375 = abs((-2365.749267578125 - -8516.9404296875))

test_kmeans.py:174: AssertionError
---------------------------- Captured stdout setup -----------------------------
[D] [20:29:31.325625] /__w/cuml/cuml/python/cuml/build/cp311-cp311-linux_aarch64/cuml/internals/logger.cxx:5269 Random seed: 428096

Not entirely sure why that happened (or why this only happens now)

Given we don't see this test failure anywhere else, am going to assume this was a flaky test and try restarting

Though documenting here in case it comes up again (in the event it needs follow up)

jakirkham · 2024-08-28T23:49:37Z

Thanks all for your help here! 🙏

Looks like it passed and the old merge comment ( #6031 (comment) ) took affect

Let's follow up on the hanging test in issue: #6050

Happy to discuss anything else in new issues 🙂

Remove NumPy <2 pin.

dff4094

seberg added non-breaking Non-breaking change improvement Improvement / enhancement to an existing function labels Aug 19, 2024

github-actions bot added conda conda issue Cython / Python Cython or Python issue labels Aug 19, 2024

seberg mentioned this pull request Aug 19, 2024

NumPy 2.0 support rapidsai/build-planning#38

Closed

seberg and others added 2 commits August 20, 2024 08:15

Include pre-release identifier for upper pin

c781416

Merge branch 'branch-24.10' into my_new_branch

11b0689

jakirkham marked this pull request as ready for review August 24, 2024 05:50

jakirkham requested a review from a team as a code owner August 24, 2024 05:50

jakirkham requested a review from raydouglass August 24, 2024 05:50

jakirkham mentioned this pull request Aug 24, 2024

Remove NumPy <2 pin rapidsai/cugraph#4615

Merged

jakirkham reviewed Aug 26, 2024

View reviewed changes

dantegd approved these changes Aug 26, 2024

View reviewed changes

jakirkham approved these changes Aug 26, 2024

View reviewed changes

Merge branch 'branch-24.10' into my_new_branch

85cd885

raydouglass approved these changes Aug 27, 2024

View reviewed changes

Drop unused NumPy headers

8e2a41b

jakirkham requested a review from a team as a code owner August 28, 2024 01:13

github-actions bot added the CMake label Aug 28, 2024

jakirkham reviewed Aug 28, 2024

View reviewed changes

jakirkham added 2 commits August 27, 2024 18:17

Refresh copyright header

0f00c86

Explicitly add NumPy requirement

6243af8

jakirkham reviewed Aug 28, 2024

View reviewed changes

skip dask pytests

7ca536c

divyegala mentioned this pull request Aug 28, 2024

[BUG] Diagnose pytest cuml-dask hang in CUDA 12.5 wheel CI tests #6050

Closed

divyegala and others added 3 commits August 28, 2024 01:25

run tests in cu11

2edf876

Fix if syntax

f5e5882

Merge divyegala/bug-wheels_dask_pytest into seberg/my_new_branch

b00c929

jakirkham requested a review from a team as a code owner August 28, 2024 18:23

jakirkham requested a review from jameslamb August 28, 2024 18:23

github-actions bot added the ci label Aug 28, 2024

This was referenced Aug 28, 2024

[NO MRG] Test CI #6047

Closed

Reenable pytest cuml-dask for CUDA 12.5 wheel CI tests #6051

Merged

jameslamb approved these changes Aug 28, 2024

View reviewed changes

ci/test_wheel.sh Outdated Show resolved Hide resolved

Update ci/test_wheel.sh

16d2c49

rapids-bot bot merged commit e371e53 into rapidsai:branch-24.10 Aug 28, 2024
55 checks passed

jakirkham mentioned this pull request Aug 29, 2024

Update rapidsai/pre-commit-hooks #6048

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove NumPy <2 pin #6031

Remove NumPy <2 pin #6031

seberg commented Aug 19, 2024

jakirkham commented Aug 24, 2024

jakirkham commented Aug 24, 2024 •

edited

Loading

jakirkham Aug 26, 2024

dantegd Aug 26, 2024

jakirkham commented Aug 26, 2024

jakirkham commented Aug 26, 2024

jakirkham commented Aug 27, 2024

jakirkham commented Aug 27, 2024

jakirkham commented Aug 28, 2024

jakirkham Aug 28, 2024

jameslamb Aug 28, 2024

jakirkham Aug 28, 2024

jakirkham Aug 28, 2024

jakirkham commented Aug 28, 2024 •

edited

Loading

jameslamb left a comment

jakirkham commented Aug 28, 2024

jakirkham commented Aug 28, 2024 •

edited

Loading

Remove NumPy <2 pin #6031

Remove NumPy <2 pin #6031

Conversation

seberg commented Aug 19, 2024

jakirkham commented Aug 24, 2024

jakirkham commented Aug 24, 2024 • edited Loading

jakirkham Aug 26, 2024

Choose a reason for hiding this comment

dantegd Aug 26, 2024

Choose a reason for hiding this comment

jakirkham commented Aug 26, 2024

jakirkham commented Aug 26, 2024

jakirkham commented Aug 27, 2024

jakirkham commented Aug 27, 2024

jakirkham commented Aug 28, 2024

jakirkham Aug 28, 2024

Choose a reason for hiding this comment

jameslamb Aug 28, 2024

Choose a reason for hiding this comment

jakirkham Aug 28, 2024

Choose a reason for hiding this comment

jakirkham Aug 28, 2024

Choose a reason for hiding this comment

jakirkham commented Aug 28, 2024 • edited Loading

jameslamb left a comment

Choose a reason for hiding this comment

jakirkham commented Aug 28, 2024

jakirkham commented Aug 28, 2024 • edited Loading

jakirkham commented Aug 24, 2024 •

edited

Loading

jakirkham commented Aug 28, 2024 •

edited

Loading

jakirkham commented Aug 28, 2024 •

edited

Loading