TST enable non-CPU device testing via array-api-strict #30090

ogrisel · 2024-10-17T16:11:17Z

This is an early draft PR to attempt to leverage multi device support recently merged in array-api-strict: data-apis/array-api-strict#59

We need to wait for a release of array-api-strict + a lock file update to actually get this to run on our CI.

However, I think we should investigate failures early in scikit-learn because I suspect that some (most?) of them are not necessarily a problem in scikit-learn but might be bugs in array-api-strict's device support itself.

/cc @betatim

github-actions · 2024-10-17T16:12:40Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: 5e9ff07. Link to the linter CI: here}

ogrisel · 2024-10-17T16:14:06Z

Here is the output of

$ pytest -v -k array_api_strict  -l -x

on my machine with the main branch of array-api-strict:

==================================================================== test session starts ====================================================================
platform darwin -- Python 3.12.5, pytest-8.3.2, pluggy-1.5.0 -- /Users/ogrisel/miniforge3/envs/dev/bin/python3.12
cachedir: .pytest_cache
hypothesis profile 'default' -> database=DirectoryBasedExampleDatabase(PosixPath('/Users/ogrisel/code/scikit-learn/.hypothesis/examples'))
rootdir: /Users/ogrisel/code/scikit-learn
configfile: setup.cfg
testpaths: sklearn
plugins: repeat-0.9.2, hypothesis-6.112.2, anyio-4.4.0, run-parallel-0.1.0, xdist-3.6.1
collected 37763 items / 37368 deselected / 2 skipped / 395 selected                                                                                         

sklearn/decomposition/tests/test_pca.py::test_pca_array_api_compliance[PCA(n_components=2,svd_solver='full')-check_array_api_input_and_values-array_api_strict-device1-float64] PASSED [  0%]
sklearn/decomposition/tests/test_pca.py::test_pca_array_api_compliance[PCA(n_components=2,svd_solver='full')-check_array_api_input_and_values-array_api_strict-device2-float32] FAILED [  0%]

========================================================================= FAILURES ==========================================================================
__________ test_pca_array_api_compliance[PCA(n_components=2,svd_solver='full')-check_array_api_input_and_values-array_api_strict-device2-float32] ___________

estimator = PCA(n_components=2, svd_solver='full'), check = <function check_array_api_input_and_values at 0x1337b54e0>, array_namespace = 'array_api_strict'
device = array_api_strict.Device('device1'), dtype_name = 'float32'

    @pytest.mark.parametrize(
        "array_namespace, device, dtype_name", yield_namespace_device_dtype_combinations()
    )
    @pytest.mark.parametrize(
        "check",
        [check_array_api_input_and_values, check_array_api_get_precision],
        ids=_get_check_estimator_ids,
    )
    @pytest.mark.parametrize(
        "estimator",
        [
            PCA(n_components=2, svd_solver="full"),
            PCA(n_components=2, svd_solver="full", whiten=True),
            PCA(n_components=0.1, svd_solver="full", whiten=True),
            PCA(n_components=2, svd_solver="covariance_eigh"),
            PCA(n_components=2, svd_solver="covariance_eigh", whiten=True),
            PCA(
                n_components=2,
                svd_solver="randomized",
                power_iteration_normalizer="QR",
                random_state=0,  # how to use global_random_seed here?
            ),
        ],
        ids=_get_check_estimator_ids,
    )
    def test_pca_array_api_compliance(
        estimator, check, array_namespace, device, dtype_name
    ):
        name = estimator.__class__.__name__
>       check(name, estimator, array_namespace, device=device, dtype_name=dtype_name)

array_namespace = 'array_api_strict'
check      = <function check_array_api_input_and_values at 0x1337b54e0>
device     = array_api_strict.Device('device1')
dtype_name = 'float32'
estimator  = PCA(n_components=2, svd_solver='full')
name       = 'PCA'

sklearn/decomposition/tests/test_pca.py:1036: 
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
sklearn/utils/estimator_checks.py:861: in check_array_api_input_and_values
    return check_array_api_input(
        array_namespace = 'array_api_strict'
        device     = array_api_strict.Device('device1')
        dtype_name = 'float32'
        estimator_orig = PCA(n_components=2, svd_solver='full')
        name       = 'PCA'
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _

name = 'PCA', estimator_orig = PCA(n_components=2, svd_solver='full'), array_namespace = 'array_api_strict', device = array_api_strict.Device('device1')
dtype_name = 'float32', check_values = True

    def check_array_api_input(
        name,
        estimator_orig,
        array_namespace,
        device=None,
        dtype_name="float64",
        check_values=False,
    ):
        """Check that the estimator can work consistently with the Array API
    
        By default, this just checks that the types and shapes of the arrays are
        consistent with calling the same estimator with numpy arrays.
    
        When check_values is True, it also checks that calling the estimator on the
        array_api Array gives the same results as ndarrays.
        """
        xp = _array_api_for_tests(array_namespace, device)
    
        X, y = make_classification(random_state=42)
        X = X.astype(dtype_name, copy=False)
    
        X = _enforce_estimator_tags_X(estimator_orig, X)
        y = _enforce_estimator_tags_y(estimator_orig, y)
    
        est = clone(estimator_orig)
    
        X_xp = xp.asarray(X, device=device)
        y_xp = xp.asarray(y, device=device)
    
        est.fit(X, y)
    
        array_attributes = {
            key: value for key, value in vars(est).items() if isinstance(value, np.ndarray)
        }
    
        est_xp = clone(est)
        with config_context(array_api_dispatch=True):
            est_xp.fit(X_xp, y_xp)
            input_ns = get_namespace(X_xp)[0].__name__
    
        # Fitted attributes which are arrays must have the same
        # namespace as the one of the training data.
        for key, attribute in array_attributes.items():
            est_xp_param = getattr(est_xp, key)
            with config_context(array_api_dispatch=True):
                attribute_ns = get_namespace(est_xp_param)[0].__name__
            assert attribute_ns == input_ns, (
                f"'{key}' attribute is in wrong namespace, expected {input_ns} "
                f"got {attribute_ns}"
            )
    
>           assert array_device(est_xp_param) == array_device(X_xp)
E           AssertionError

X          = array([[-2.0251427 ,  0.0291022 , -0.4749453 , ..., -0.33450124,
         0.8657552 , -1.2002964 ],
       [ 1.6137112... ],
       [-0.00607091,  1.3085763 , -0.17495976, ...,  0.99204236,
         0.3216978 , -0.66809046]], dtype=float32)
X_xp       = Array([[-2.0251427 ,
         0.0291022 ,
        -0.4749453 ,
        ...,
        -0.33450124,
         0.8657552 ,
...
         0.3216978 ,
        -0.66809046]], dtype=array_api_strict.float32, device=array_api_strict.Device('device1'))
array_attributes = {'components_': array([[ 0.03484652,  0.6045526 , -0.09228071, -0.09317975,  0.02118714,
        -0.46225083,  0.03672...52413,  0.13682878,
       -0.03120608,  0.05840071,  0.055825  ,  0.12556158, -0.03976958],
      dtype=float32), ...}
array_namespace = 'array_api_strict'
attribute  = array([ 0.18988031,  0.03833218,  0.07648806,  0.08370368,  0.02213484,
       -0.04884844,  0.02524958, -0.11081639, ... 0.00852413,  0.13682878,
       -0.03120608,  0.05840071,  0.055825  ,  0.12556158, -0.03976958],
      dtype=float32)
attribute_ns = 'array_api_strict'
check_values = True
device     = array_api_strict.Device('device1')
dtype_name = 'float32'
est        = PCA(n_components=2, svd_solver='full')
est_xp     = PCA(n_components=2, svd_solver='full')
est_xp_param = Array([ 0.18988031,  0.03833218,
        0.07648806,  0.08370368,
        0.02213484, -0.04884844,
        0.02524958,...682878, -0.03120608,
        0.05840071,  0.055825  ,
        0.12556158, -0.03976958], dtype=array_api_strict.float32)
estimator_orig = PCA(n_components=2, svd_solver='full')
input_ns   = 'array_api_strict'
key        = 'mean_'
name       = 'PCA'
xp         = <module 'array_api_strict' from '/Users/ogrisel/code/array-api-strict/array_api_strict/__init__.py'>
y          = array([0, 0, 1, 1, 0, 0, 0, 1, 0, 1, 1, 0, 0, 0, 1, 1, 1, 0, 0, 1, 1, 0,
       0, 0, 0, 1, 1, 0, 1, 0, 0, 0, 0, 0, 0,...1,
       0, 0, 1, 0, 1, 0, 1, 0, 1, 1, 1, 0, 0, 0, 1, 0, 1, 0, 1, 1, 1, 1,
       1, 0, 0, 1, 0, 1, 1, 0, 1, 1, 0, 0])
y_xp       = Array([0,
       0,
       1,
       1,
       0,
       0,
       0,
       1,
       0,
       1,
       1,
       0...   0,
       1,
       1,
       0,
       0], dtype=array_api_strict.int64, device=array_api_strict.Device('device1'))

sklearn/utils/estimator_checks.py:762: AssertionError
================================================================== short test summary info ==================================================================
FAILED sklearn/decomposition/tests/test_pca.py::test_pca_array_api_compliance[PCA(n_components=2,svd_solver='full')-check_array_api_input_and_values-array_api_strict-device2-float32] - AssertionError
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!
=========================================== 1 failed, 1 passed, 2 skipped, 37368 deselected, 23 warnings in 8.24s ===========================================

This same test passes with PyTorch and CUDA or MPS devices, so I suspect that the lack of device propagation in the computation of the mean_ attribute might reveal a bug in array-api-strict itself. I have not yet investigated in details.

betatim · 2024-10-21T14:17:00Z

I think we will need data-apis/array-api-strict#73 and data-apis/array-api-strict#72 for this PR to work

betatim · 2024-10-21T15:16:20Z

Another issue that needs resolving scipy/scipy#21736

ogrisel · 2025-02-06T14:14:58Z

We should update the lock file to try to run the tests in this PR with the new version of array-api-strict.

EDIT: the lock files have probably already been updated in main, let's just sync this branch and see what happens.

ogrisel · 2025-02-07T17:31:17Z

@betatim I started to update this PR: it discovered several device handling issues and maybe dtype related issues. I have not yet fixed them all, feel free to take over at any point :)

sklearn/decomposition/_pca.py

sklearn/metrics/_regression.py

To compute batch sizes and memory sizes we don't need to use the array API, we can do that math with "just" Python types. This change also fixes a slicing error that only appears with array-api-strict. Unrelated to changing to Python types.

The scipy implementation contains a bug with respect to setting the device of all the arrays it creates. This adds xlogy() to our group of functions we implement ourselves.

Using this in functions that support the xp short circuiting, so I think it makes sense to make this function look similar to get_namespace

betatim · 2025-03-03T12:47:38Z

I pinged Omar and Guillaume for reviews. You don't have to review this, but I thought it might be interesting for you two (and solve the problem that neither Oliver nor I can approve this).

OmarManzoor

Thank you @betatim and @ogrisel.
Generally looks good, just a few comments

sklearn/decomposition/_pca.py

sklearn/metrics/_regression.py

sklearn/utils/_array_api.py

This reduces the amount of `xp.asarray` that we need to convert scalars to arrays for the array API

Co-authored-by: Omar Salman <omar.salman2007@gmail.com>

sklearn/metrics/_regression.py

…e_deviance" This reverts commit 920932f.

ogrisel · 2025-03-06T15:22:13Z

@OmarManzoor @betatim after #30090 (comment), the code is simpler, and all tests pass everywhere.

+1 for merge on my side.

OmarManzoor · 2025-03-06T15:24:06Z

@OmarManzoor @betatim after #30090 (comment), the code is simpler, and all tests pass everywhere.

+1 for merge on my side.

👍 Let's wait for the CI to complete and I'll review and merge

sklearn/metrics/pairwise.py

sklearn/utils/_array_api.py

OmarManzoor

LGTM. Thank you @ogrisel and @betatim

TST enable non-CPU device testing via array-api-strict

5358eff

github-actions bot added the module:utils label Oct 17, 2024

ogrisel added module:test-suite everything related to our tests Array API labels Oct 17, 2024

ogrisel mentioned this pull request Oct 21, 2024

Improvements to device support data-apis/array-api-strict#70

Open

betatim mentioned this pull request Oct 22, 2024

Python scalars in elementwise functions data-apis/array-api#807

Closed

Use correct device when creating arrays

6f4976b

ogrisel added 4 commits February 7, 2025 14:34

Merge branch 'main' into multi-device-array-api-strict

dc2c37a

Make check_array_api_metric array-api-strict aware

6edbb37

Similar fix in check_array_api_input

b2fc401

Forward device info in temp array in LDA

873e9ef

Merge branch 'main' into multi-device-array-api-strict

3424d82

ogrisel commented Feb 20, 2025

View reviewed changes

sklearn/decomposition/_pca.py Outdated Show resolved Hide resolved

ogrisel commented Feb 20, 2025

View reviewed changes

sklearn/metrics/_regression.py Show resolved Hide resolved

betatim added 7 commits February 26, 2025 15:57

Propagate device and dtype when we create new, derived arrays

5adbde4

Add custom xlogy implemntation

f21d280

The scipy implementation contains a bug with respect to setting the device of all the arrays it creates. This adds xlogy() to our group of functions we implement ourselves.

Make sure to propagate device information when creating adhoc arrays

426ef6c

Add xp= keyword argument to get_namespace_and_device

3216ebc

Using this in functions that support the xp short circuiting, so I think it makes sense to make this function look similar to get_namespace

Fix

cebf42d

Fix

5e4356a

betatim added No Changelog Needed CUDA CI labels Feb 28, 2025

github-actions bot removed the CUDA CI label Feb 28, 2025

betatim requested review from glemaitre and OmarManzoor March 3, 2025 12:46

OmarManzoor reviewed Mar 4, 2025

View reviewed changes

sklearn/decomposition/_pca.py Outdated Show resolved Hide resolved

sklearn/metrics/_regression.py Show resolved Hide resolved

sklearn/utils/_array_api.py Outdated Show resolved Hide resolved

betatim and others added 2 commits March 4, 2025 09:19

Switch to using math.lgamma

70c2f46

This reduces the amount of `xp.asarray` that we need to convert scalars to arrays for the array API

Update sklearn/utils/_array_api.py

9b563b2

Co-authored-by: Omar Salman <omar.salman2007@gmail.com>

ogrisel commented Mar 4, 2025

View reviewed changes

sklearn/metrics/_regression.py Outdated Show resolved Hide resolved

Remove xp.asarray(..., device=device_) idioms in _mean_tweedie_deviance

920932f

ogrisel added the CUDA CI label Mar 6, 2025