Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add job_timeout parameter for upgrade_cluster #18230

Merged
merged 1 commit into from
Nov 29, 2018

Conversation

borod108
Copy link

The motivation here is that we need the job to have one timeout, and the
Ansible role to have another - the job will end if the timeout from the
time it was created will pass, while Ansible playbook will only timeout if it is
stack for too long on one phase. So we will want to set the job timeout
to something huge - 1 year, and set a real timeout through the playbook
extra_vars.

This is part of implementing: https://bugzilla.redhat.com/show_bug.cgi?id=1644605

The motivation here is that we need the job to have one timeout, and the
Ansible role to have another - the job will end if the timeout from the
time it was created will pass, while Ansible playbook will only timeout if it is
stack for too long on one phase. So we will want to set the job timeout
to something huge - 1 year, and set a real timeout through the playbook
extra_vars.

This is part of implementing: https://bugzilla.redhat.com/show_bug.cgi?id=1644605
@borod108
Copy link
Author

@agrare I think this is the last one for this phase of the cluster upgrade.

@miq-bot
Copy link
Member

miq-bot commented Nov 22, 2018

Checked commit borod108@e1965d6 with ruby 2.3.3, rubocop 0.52.1, haml-lint 0.20.0, and yamllint 1.10.0
2 files checked, 0 offenses detected
Everything looks fine. 🍰

@agrare agrare self-assigned this Nov 26, 2018
@agrare
Copy link
Member

agrare commented Nov 26, 2018

@borod108 is there a corresponding PR to manageiq-providers-ovirt to accept and set job_timeout in ManageIQ::Providers::Redhat::AnsibleRoleWorkflow ?

@borod108
Copy link
Author

@agrare I do not think that is required. https://github.com/ManageIQ/manageiq-providers-ovirt/blob/master/app/models/manageiq/providers/redhat/ansible_role_workflow.rb
we do not override anything that uses job_timeout.
We just pass the timeout_job as time to the create_job:

def self.job_options(env_vars, extra_vars, role_options, timeout, poll_interval)

@agrare
Copy link
Member

agrare commented Nov 29, 2018

@borod108 but doesn't that not impact the job timeout since it is a class method?

Forgot that you already did that here https://github.com/ManageIQ/manageiq/pull/18108/files#diff-a8d27c2744e902e4796e24bb1560aec7R6 👍

@agrare agrare merged commit 0e1006b into ManageIQ:master Nov 29, 2018
@agrare
Copy link
Member

agrare commented Nov 29, 2018

@borod108 if this is the last part needed for backporting this feature lets get on the migration for STI for the other infra providers so we can get this moved to ovirt.

@borod108
Copy link
Author

borod108 commented Dec 3, 2018

@miq-bot add-label hammer/yes

simaishi pushed a commit that referenced this pull request Dec 3, 2018
@simaishi
Copy link
Contributor

simaishi commented Dec 3, 2018

Hammer backport details:

$ git log -1
commit aa3b07422c46d5e11a7ee4b7236230b4b2dfbbea
Author: Adam Grare <agrare@redhat.com>
Date:   Thu Nov 29 08:50:41 2018 -0500

    Merge pull request #18230 from borod108/rfe/upgrade_cluster_role_options
    
    Add job_timeout parameter for upgrade_cluster
    
    (cherry picked from commit 0e1006bce4c40edaa7059247886ad08524f97b8c)
    
    https://bugzilla.redhat.com/show_bug.cgi?id=1644605

@agrare agrare added this to the Sprint 100 Ending Dec 3, 2018 milestone Feb 14, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants