Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fixed support for distributed datasets in create_auto_config #2508

Merged
merged 5 commits into from
Sep 16, 2022
Merged

Conversation

tgaddair
Copy link
Collaborator

@tgaddair tgaddair commented Sep 16, 2022

Fixes #2499.
Related #2497.

Also changes exclusion logic to be less restrictive for index-looking fields.

@github-actions
Copy link

github-actions bot commented Sep 16, 2022

Unit Test Results

         6 files  ±  0         6 suites  ±0   2h 48m 27s ⏱️ + 15m 41s
  3 390 tests +  4  3 312 ✔️ +  4    78 💤 ±0  0 ±0 
10 170 runs  +12  9 913 ✔️ +12  257 💤 ±0  0 ±0 

Results for commit a34cfc3. ± Comparison against base commit 8947e19.

♻️ This comment has been updated with latest results.

@tgaddair tgaddair added bug Something isn't working release-0.6 labels Sep 16, 2022
@tgaddair tgaddair marked this pull request as draft September 16, 2022 04:50
@tgaddair tgaddair changed the title Fixed sampling with dask in create_auto_config Fixed support for distributed backends in create_auto_config Sep 16, 2022
@tgaddair tgaddair changed the title Fixed support for distributed backends in create_auto_config Fixed support for distributed datasets in create_auto_config Sep 16, 2022
@tgaddair tgaddair marked this pull request as ready for review September 16, 2022 05:21
Copy link
Contributor

@justinxzhao justinxzhao left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you for this change!

@@ -1,9 +1,13 @@
from abc import ABC, abstractmethod
from typing import List, Tuple

import dask.dataframe as dd
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This dask import should be ok since users using Ludwig automl are expected to pip install ludwig[distributed]. Is that right?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah, the automl library requires Ray and Dask to work.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

Successfully merging this pull request may close these issues.

TypeError: '>' not supported between instance of 'str' and 'int' when tune_for_memory is True
2 participants