[V2V] lookup active tasks in a single query #18876

kbrock · 2019-06-14T13:56:48Z

Overview

infrastructure conversion distributes work among nodes.
It does this by assigning tasks (the work) to the nodes with the least number of tasks.

refs:

Before

We looked up the number of tasks (aka amount of work) being performed on each conversion host up front for each ems.

This means when we were assigning tasks, we never looked up the task count again, and the numbers never increased.
So the hosts with the least amount of work, still looked like they had the least amount of work.
A single host would get all the tasks for each timer event / round. Besides not distributing the work well, we also sometimes went over the max thresholds for hosts.

After

We are looking up the eligible servers up front for each ems. This work is expensive and it is good to cache.

For each task we are assigning, we are re-calculating the number of tasks running for each conversion host in a single query.
Now we are able to see the increased work loads and we no longer assign all current work to the same conversion host.

instead of calculating the size in an n+1 manner, use the virtual attributes Ems#conversion_hosts is not a real collection, so we have to sum across each collection and add those together

trivial refactor storing a single conversion_host rather than all of them

remove totals lookup for eligible? cache eligible_conversion_hosts for each ems lookup all totals at a time, and select the best host for the job (the one with the fewest number of active tasks)

miq-bot · 2019-06-14T14:08:00Z

Checked commits kbrock/manageiq@6cdfa6c~...484f37b with ruby 2.3.3, rubocop 0.69.0, haml-lint 0.20.0, and yamllint 1.10.0
4 files checked, 3 offenses detected

lib/infra_conversion_throttler.rb

⚠️ - Line 13, Col 28 - Rails/ActiveRecordAliases - Use update! instead of update_attributes!.
❗ - Line 10, Col 47 - Style/SymbolProc - Pass &:check_concurrent_tasks as an argument to select instead of a block.
❗ - Line 12, Col 9 - Layout/EmptyLineAfterGuardClause - Add empty line after guard clause.

djberg96 · 2019-06-14T14:22:17Z

app/models/ext_management_system.rb

+  def total_active_tasks
+    host_conversion_hosts.sum(:total_active_tasks) + vm_conversion_hosts.sum(:total_active_tasks)
+  end
+


Since this is on the EMS model, and since other models could potentially need their own list of active tasks, perhaps it would be best to rename this to total_active_conversion_host_tasks.

jerryk55 · 2019-06-14T14:22:55Z

app/models/ext_management_system.rb

+  def total_active_tasks
+    host_conversion_hosts.sum(:total_active_tasks) + vm_conversion_hosts.sum(:total_active_tasks)
+  end
+


This just points out the difficulty I'm going to have trying to move the V2V model code - specifically the conversion_hosts model - out of the core repo and into the V2V repo. Still open to suggestions on how this can be done...

kbrock · 2019-06-14T18:26:00Z

this is requiring too much work with the specs. not a fan of all the stubbing.

Something odd was happening. some caching was getting introduced and the counts were not updating.

This can be used for references, but we're avoiding this change for now

kbrock added the bug label Jun 14, 2019

kbrock requested a review from djberg96 June 14, 2019 13:58

kbrock added 3 commits June 14, 2019 09:59

introduce total_active_tasks

6cdfa6c

instead of calculating the size in an n+1 manner, use the virtual attributes Ems#conversion_hosts is not a real collection, so we have to sum across each collection and add those together

v2v throttle eligible_host

70eeeeb

trivial refactor storing a single conversion_host rather than all of them

v2v throttle eligible

484f37b

remove totals lookup for eligible? cache eligible_conversion_hosts for each ems lookup all totals at a time, and select the best host for the job (the one with the fewest number of active tasks)

kbrock force-pushed the total_active_task branch from 421cabd to 484f37b Compare June 14, 2019 13:59

kbrock changed the title ~~[v2v] lookup active tasks in a single query~~ [V2V] lookup active tasks in a single query Jun 14, 2019

djberg96 reviewed Jun 14, 2019

View reviewed changes

jerryk55 reviewed Jun 14, 2019

View reviewed changes

kbrock closed this Jun 14, 2019

kbrock deleted the total_active_task branch June 14, 2019 18:26

kbrock mentioned this pull request Jun 21, 2019

Fix "query in loop" in InfraConversionThrottler #18865

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[V2V] lookup active tasks in a single query #18876

[V2V] lookup active tasks in a single query #18876

kbrock commented Jun 14, 2019 •

edited

Loading

miq-bot commented Jun 14, 2019

djberg96 Jun 14, 2019

jerryk55 Jun 14, 2019

kbrock commented Jun 14, 2019

[V2V] lookup active tasks in a single query #18876

[V2V] lookup active tasks in a single query #18876

Conversation

kbrock commented Jun 14, 2019 • edited Loading

Overview

Before

After

miq-bot commented Jun 14, 2019

djberg96 Jun 14, 2019

Choose a reason for hiding this comment

jerryk55 Jun 14, 2019

Choose a reason for hiding this comment

kbrock commented Jun 14, 2019

kbrock commented Jun 14, 2019 •

edited

Loading