[MRG] fix prevewing public dataset errors #190

HuaizhengZhang · 2024-09-07T18:32:32Z

User description

Closes #188

PR Type

enhancement, bug fix

Description

Clarified the conditions under which the preview_csv_data function should be used for public datasets in the advisor documentation.
Changed the default mode in the CLI from 'general' to 'baseline' to better align with the intended functionality.
Updated the error message in the CLI to reflect the new mode options, ensuring users are informed of valid modes.

Changes walkthrough 📝

Relevant files

Documentation

advisor.py `Clarify dataset preview conditions in advisor documentation` mle/agents/advisor.py Clarified the usage of `preview_csv_data` for public datasets. Updated documentation to specify conditions for previewing datasets.	+1/-1

Enhancement

cli.py `Update CLI default mode and error message` mle/cli.py Changed default mode from 'general' to 'baseline'. Updated error message to reflect new mode options.	+3/-3

💡 PR-Agent usage:
Comment /help on the PR to get a list of all available PR-Agent tools and their descriptions

github-actions · 2024-09-07T18:33:33Z

PR Reviewer Guide 🔍

⏱️ Estimated effort to review: 2 🔵🔵⚪⚪⚪
🧪 No relevant tests
🔒 No security concerns identified
⚡ Key issues to review Ambiguity in Documentation The added documentation in line 58 is ambiguous. It's unclear whether the function `preview_csv_data` should or should not be used for public datasets. Consider rephrasing for clarity. Error Handling The error message in line 59 only specifies 'baseline' and 'report' as valid modes. Ensure this aligns with all supported modes in the CLI.

github-actions · 2024-09-07T18:33:35Z

PR Code Suggestions ✨

Category	Suggestion	Score
Possible bug	Add conditional logic to ensure `preview_csv_data` is only called for local datasets The conditional logic for determining whether to preview the CSV file based on the dataset type (public or local) is unclear and potentially missing. It's important to implement a clear conditional check to ensure that `preview_csv_data` is called only for local datasets and not for public datasets. mle/agents/advisor.py [58] -the CSV file or not if the dataset is a public dataset. +if not is_public_dataset: + preview_csv_data(dataset_path) Suggestion importance[1-10]: 8 Why: The suggestion addresses a potential bug by clarifying when to call `preview_csv_data`, which is crucial for correct functionality when handling different dataset types.	8
Maintainability	Refactor mode handling to use a dictionary for scalability and maintainability Consider handling additional modes in the `start` function or providing a more generic error handling mechanism. This would make the function more flexible and maintainable, especially if more modes are to be added in the future. mle/cli.py [52-59] -if mode == 'baseline': - # Baseline mode - return workflow.baseline(os.getcwd(), model) -elif mode == 'report': - # Report mode - return ctx.invoke(report, model=model, visualize=False) -else: - raise ValueError("Invalid mode. Supported modes: 'baseline', 'report'.") +mode_functions = { + 'baseline': lambda: workflow.baseline(os.getcwd(), model), + 'report': lambda: ctx.invoke(report, model=model, visualize=False) +} +try: + return mode_functions[mode]() +except KeyError: + raise ValueError(f"Invalid mode. Supported modes: {', '.join(mode_functions.keys())}.") Suggestion importance[1-10]: 7 Why: The suggestion improves maintainability by using a dictionary for mode handling, making the code more scalable and easier to extend with additional modes.	7
Possible issue	Verify and potentially revert the default mode to 'general' if the change was unintended The default value for the `mode` argument in the `start` command has been changed from 'general' to 'baseline'. Ensure that this change is intentional and correctly documented, as it alters the default behavior of the command. mle/cli.py [43] -@click.argument('mode', default='baseline') +@click.argument('mode', default='general') # if reverting to previous default Suggestion importance[1-10]: 5 Why: The suggestion is valid as it prompts verification of an intentional change in default behavior, which is important for maintaining expected functionality.	5

fix prevewing public dataset errors

1b52f13

HuaizhengZhang requested review from huangyz0918 and leeeizhang September 7, 2024 18:32

dosubot bot added size:XS This PR changes 0-9 lines, ignoring generated files. bug Something isn't working labels Sep 7, 2024

github-actions bot added enhancement New feature or request Bug fix Review effort [1-5]: 2 labels Sep 7, 2024

HuaizhengZhang requested review from zlfben and huangyz0918 and removed request for huangyz0918 and zlfben September 7, 2024 18:34

zlfben approved these changes Sep 7, 2024

View reviewed changes

dosubot bot added the lgtm This PR has been approved by a maintainer label Sep 7, 2024

HuaizhengZhang merged commit c0c82f3 into main Sep 7, 2024
4 checks passed

HuaizhengZhang deleted the hz/fix-public-data branch September 7, 2024 18:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MRG] fix prevewing public dataset errors #190

[MRG] fix prevewing public dataset errors #190

HuaizhengZhang commented Sep 7, 2024 •

edited by github-actions bot

Loading

github-actions bot commented Sep 7, 2024

github-actions bot commented Sep 7, 2024 •

edited

Loading

[MRG] fix prevewing public dataset errors #190

[MRG] fix prevewing public dataset errors #190

Conversation

HuaizhengZhang commented Sep 7, 2024 • edited by github-actions bot Loading

User description

PR Type

Description

Changes walkthrough 📝

github-actions bot commented Sep 7, 2024

PR Reviewer Guide 🔍

github-actions bot commented Sep 7, 2024 • edited Loading

PR Code Suggestions ✨

HuaizhengZhang commented Sep 7, 2024 •

edited by github-actions bot

Loading

github-actions bot commented Sep 7, 2024 •

edited

Loading