Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

HyperParameterOptimizer runs only as single task at a time #613

Closed
navotoz opened this issue Mar 22, 2022 · 6 comments
Closed

HyperParameterOptimizer runs only as single task at a time #613

navotoz opened this issue Mar 22, 2022 · 6 comments

Comments

@navotoz
Copy link

navotoz commented Mar 22, 2022

Hello

I started the optimizer using the code from the example (with minor changes):

an_optimizer = HyperParameterOptimizer(
    base_task_id=args['template_task_id'],
    hyper_parameters=[
        DiscreteParameterRange('generator/A', values=[0, 1, 2]),
        DiscreteParameterRange('generator/B', values=[0, 1, 2]),
        DiscreteParameterRange('losses/l1', values=[1, 10, 0]),
    ],
    objective_metric_title='mse',
    objective_metric_series='Average',
    objective_metric_sign='min',
    max_number_of_concurrent_tasks=5,
    optimizer_class=aSearchStrategy,
    execution_queue=execution_queue,
    spawn_project=None,  # 'HPO spawn project',
    save_top_k_tasks_only=None,
    time_limit_per_job=None,
    pool_period_min=5,
    total_max_jobs=None,
    min_iteration_per_job=8,
    max_iteration_per_job=None,
    job_complete_callback=job_complete_callback
)

This setup worked fine, until I upgraded the the agent and the clearml pkg to lateset itself yesterdey.

Since yesterday, the optimizer spawns a single new task, only when the old task is done. Thus, only a single worker (out of 3) is working, while the rest are idle.

The version of clearml==1.3.1

Thanks for your help.

@erezalg
Copy link
Contributor

erezalg commented Mar 22, 2022

Hi @navotoz, What optimizer are you using?

@navotoz
Copy link
Author

navotoz commented Mar 22, 2022

Hi @erezalg, aSearchStrategy=OptimizerOptuna.
Everything else in the code is taken straight from the example on your site.

@erezalg
Copy link
Contributor

erezalg commented Mar 22, 2022

Hi @navotoz
I might know what it is, we changed the mechanism in which we do job parallelization in optuna per optuna's changes. Let me test it

@navotoz
Copy link
Author

navotoz commented Mar 22, 2022

Hi @erezalg,
Thanks for that.

@erezalg
Copy link
Contributor

erezalg commented Mar 24, 2022

Hi @navotoz, we released a new RC, please check if it solves the issue.
To install, run pip install clearml==1.3.2rc1

@navotoz
Copy link
Author

navotoz commented Mar 24, 2022

Working as expected!
Thank you.

@navotoz navotoz closed this as completed Mar 24, 2022
clearml-bot pushed a commit that referenced this issue Mar 24, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants