-
Notifications
You must be signed in to change notification settings - Fork 174
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
KMeans is slow on gpu #1444
Comments
For reference, see the benchmark in the https://github.com/soda-inria/sklearn-numba-dpex repo. Also note that we already notified @samir-nasibli about this problem but we decided to open a dedicated issue to track its resolution transparently. |
Hi @ogrisel thank you for this report as well! |
Let me try with:
If I run the above reproducer with the cpu device I get: AttributeError Traceback (most recent call last)
File <timed eval>:1
File ~/mambaforge/envs/intel/lib/python3.10/site-packages/sklearnex/preview/cluster/k_means.py:192, in KMeans.fit(self, X, y, sample_weight)
189 if sklearn_check_version("1.2"):
190 self._validate_params()
--> 192 dispatch(self, 'fit', {
193 'onedal': self.__class__._onedal_fit,
194 'sklearn': sklearn_KMeans.fit,
195 }, X, y, sample_weight)
197 return self
File ~/mambaforge/envs/intel/lib/python3.10/site-packages/sklearnex/_device_offload.py:161, in dispatch(obj, method_name, branches, *args, **kwargs)
158 backend, q, cpu_fallback = _get_backend(obj, q, method_name, *hostargs)
160 if backend == 'onedal':
--> 161 return branches[backend](obj, *hostargs, **hostkwargs, queue=q)
162 if backend == 'sklearn':
163 return branches[backend](obj, *hostargs, **hostkwargs)
File ~/mambaforge/envs/intel/lib/python3.10/site-packages/sklearnex/preview/cluster/k_means.py:216, in KMeans._onedal_fit(self, X, _, sample_weight, queue)
213 self._initialize_onedal_estimator()
214 self._onedal_estimator.fit(X, queue=queue)
--> 216 self._save_attributes()
File ~/mambaforge/envs/intel/lib/python3.10/site-packages/sklearnex/preview/cluster/_common.py:70, in BaseKMeans._save_attributes(self)
68 self._labels_ = self._onedal_estimator.labels_
69 self._inertia_ = self._onedal_estimator.inertia_
---> 70 self._algorithm = self._onedal_estimator._algorithm
71 self._cluster_centers_ = self._onedal_estimator.cluster_centers_
72 self._sparse = False
AttributeError: 'KMeans' object has no attribute '_algorithm' then with gpu:0 device, the code still breaks as follows but after waiting for approximately one minute (or more): AttributeError Traceback (most recent call last)
File <timed eval>:1
File ~/mambaforge/envs/intel/lib/python3.10/site-packages/sklearnex/preview/cluster/k_means.py:192, in KMeans.fit(self, X, y, sample_weight)
189 if sklearn_check_version("1.2"):
190 self._validate_params()
--> 192 dispatch(self, 'fit', {
193 'onedal': self.__class__._onedal_fit,
194 'sklearn': sklearn_KMeans.fit,
195 }, X, y, sample_weight)
197 return self
File ~/mambaforge/envs/intel/lib/python3.10/site-packages/sklearnex/_device_offload.py:161, in dispatch(obj, method_name, branches, *args, **kwargs)
158 backend, q, cpu_fallback = _get_backend(obj, q, method_name, *hostargs)
160 if backend == 'onedal':
--> 161 return branches[backend](obj, *hostargs, **hostkwargs, queue=q)
162 if backend == 'sklearn':
163 return branches[backend](obj, *hostargs, **hostkwargs)
File ~/mambaforge/envs/intel/lib/python3.10/site-packages/sklearnex/preview/cluster/k_means.py:216, in KMeans._onedal_fit(self, X, _, sample_weight, queue)
213 self._initialize_onedal_estimator()
214 self._onedal_estimator.fit(X, queue=queue)
--> 216 self._save_attributes()
File ~/mambaforge/envs/intel/lib/python3.10/site-packages/sklearnex/preview/cluster/_common.py:70, in BaseKMeans._save_attributes(self)
68 self._labels_ = self._onedal_estimator.labels_
69 self._inertia_ = self._onedal_estimator.inertia_
---> 70 self._algorithm = self._onedal_estimator._algorithm
71 self._cluster_centers_ = self._onedal_estimator.cluster_centers_
72 self._sparse = False
AttributeError: 'KMeans' object has no attribute '_algorithm' Versions:
|
Already fixed and available with 2024.0. |
but it will not be available for quite a while - and currently we are going through the integration and bunch of changes so even building from sources currently would be painfully. We can share internal build in 2-3 weeks |
Am a bit confused by performance in latest releases, should we look at |
Re-ran the scikit-learn-intelex kmeans benchmark while carefully installing scikit-learn-intelex==2023.2.1 from pip, it looks much better now 🤔 , the |
@fcharras Based on this result, can the issue be closed now? Also FYI that once #1634 is merged, KMeans will be out of preview and the SKLEARNEX_PREVIEW env variable will no longer be necessary. |
Was this about the perf regression observed on CPU or the new run on GPU? It does not seem that you ran it on GPU based on the results. |
The following snippet
show for
device=cpu
:(CPU with 224 cores)
and when
device="gpu:0"
(running with a max series gpu) it's very slow (I have it running for several minuts now, it's not over yet). On 100x less data it completes in about 4.5sc, extrapolating from that, the walltime would be almost an hour.We show with the implementation provided in the sklearn-numba-dpex project that this amount of data can run in less than 10sc on max series too.
Environment:
dpcpp-cpp-rt
with-c conda
channelThe text was updated successfully, but these errors were encountered: