[WIP] Reconnect container images when seen again #14808

cben · 2017-04-19T12:16:03Z

Bug: if we disconnect a container image and then encounter it again, we create a duplicate record.
Scenarios where we may disconnect an image not currently used in any container:

Kubernetes provider
Openshift configured get_container_images (Add config option to skip container_images #14606)
Openshift configured store_unused_images setting (if ems_refresh.openshift.store_unused_images setting #14662 lands)
Openshift fetching used images one by one (if Container image post refresh from openshift #14628 lands)
Openshift eventually dropping the image from /oapi/v1/images (don't know if/when this happens?)

If then we see the image again, we'll create a duplicate record, which can confuse reports.

This PR makes refresh reconnect (ems_id) an existing image if it has the matching digest.
However, if duplicates have already been created it won't merge them, will just reuse arbitrary one of the copies.

EDIT: TODO: this retains old_ems_id and deleted_on. This is not OK.
We need a more explicit API to reconnect than assiciation.push. Maybe add a reconnect_inv method?
EDIT: as Ladas points out, loading the records back to RAM is heavy :-(
cc @Ladas @agrare @simon3z @enoodle

Steps for Testing/QA [Optional]

Refresh. Turn off get_container_images in Advanced Settings. Refresh. Turn on. Refresh.
TODO: a way to actually see the dups in reports?

@miq-bot add-label bug, providers/containers

https://bugzilla.redhat.com/show_bug.cgi?id=1488072

Ladas · 2017-04-19T12:19:33Z

app/models/ems_refresh/save_inventory_helper.rb

+      found = reconnect_index.fetch(hash) if reconnect_index
+      if found
+        found.update_attributes!(hash.except(:id, :type))
+        deletes.delete(found) unless deletes.blank?


this one should not be in deletes, so we can remove the line

I think so too, but wasn't sure if deletes may ever be larger than association...

The deletes should equal association from start, then all that will be left in deletes, after the processing are disconnected. So already disconnected records should not be present, since the association and reconnect_from should be disjoint sets.

Ladas · 2017-04-19T12:22:48Z

app/models/ems_refresh/save_inventory_helper.rb

@@ -50,10 +50,13 @@ def save_inventory_multi(association, hashes, deletes, find_key, child_keys = []
    remove_keys = Array.wrap(extra_keys) + child_keys

    record_index = TypedIndex.new(association, find_key)
+    reconnect_index = TypedIndex.new(reconnect_from, find_key)


hm, this like can add another huge memory peak, if we collect enough disconnected_records

Right, forgot to mention it. This will be heavy, negating the some/all of the win of not saving them :-(
The only other way I could think of was doing one-by-one queries for new records, which is also not good.
It seems refresh fundamentally requires an efficient DB upsert...

hm, I rewrote the saving code in the graph refresh to not do Model.all, but Model.find_each. And we could also limit reconnect_from.where(:ems_ref => [:ems_refs_to_be_created])

but we can't do any of this, unless we would rewrite the old refresh saving code entirely, like in graph refresh. :-)

well, theoretically we could use new_records and the ems_refs there to limit the reconnect_from further, that should help a lot. Since I expect we will be loading just dozens of 'to be reconnected' records, comparing to possible 100k+ of disconnected records in total.

The initial refresh could also build a huge query, so might be good to do it in the batches

miq-bot · 2017-04-19T12:26:41Z

This pull request is not mergeable. Please rebase and repush.

All potential `reconnect_from` records are loaded up front, may be heavy? Found matches are added back to the `association`.

Note that if duplicate images have already been created, this will not merge them, will just reuse an arbitrary one.

miq-bot · 2017-04-19T15:58:22Z

Checked commits cben/manageiq@c58a2e1~...2f6fee0 with ruby 2.2.6, rubocop 0.47.1, and haml-lint 0.20.0
3 files checked, 0 offenses detected
Everything looks good. 🏆

miq-bot · 2017-04-21T05:42:12Z

This pull request is not mergeable. Please rebase and repush.

enoodle · 2017-04-23T14:00:00Z

cc @zeari

zeari · 2017-04-23T14:01:50Z

@simon3z If the only way we identify containers is by name then they might also re-appear this way and need the same change

cben · 2017-04-23T14:17:40Z

I think containers belong to a pod which has a GUID that will never reappear, no? Can pod's containers change during its lifetime?

zeari · 2017-04-23T16:10:39Z

I think containers belong to a pod which has a GUID that will never
reappear, no?
Can pod's containers change during its lifetime?

I thought they can. if theyre tightly coupled then theres no need.

cben · 2017-09-04T09:16:20Z

Closing as impractical.
Tracked in ManageIQ/manageiq-providers-kubernetes#103 and https://bugzilla.redhat.com/show_bug.cgi?id=1488072

miq-bot added bug providers/containers labels Apr 19, 2017

Ladas reviewed Apr 19, 2017

View reviewed changes

miq-bot added the wip label Apr 19, 2017

cben mentioned this pull request Apr 19, 2017

Test improvements for #14606 #14661

Merged

Ladas reviewed Apr 19, 2017

View reviewed changes

miq-bot added the unmergeable label Apr 19, 2017

test image re-connection after disconnecting inactive images

c58a2e1

cben force-pushed the reconnect-container-images branch from 2017d24 to f2fd447 Compare April 19, 2017 12:56

miq-bot removed the unmergeable label Apr 19, 2017

save_inventory_multi: ability to reconnect records

cb8a656

All potential `reconnect_from` records are loaded up front, may be heavy? Found matches are added back to the `association`.

cben force-pushed the reconnect-container-images branch from f2fd447 to 272e5f1 Compare April 19, 2017 15:34

Reconnect disconnected container images if we see them again

2f6fee0

Note that if duplicate images have already been created, this will not merge them, will just reuse an arbitrary one.

cben force-pushed the reconnect-container-images branch from 272e5f1 to 2f6fee0 Compare April 19, 2017 15:56

chessbyte assigned simon3z Apr 19, 2017

miq-bot added the unmergeable label Apr 21, 2017

simon3z mentioned this pull request May 22, 2017

Option needed for new ems_refresh.openshift.store_unused_images setting ManageIQ/manageiq-providers-kubernetes#11

Merged

cben mentioned this pull request Aug 28, 2017

Should re-connect container images that are used again ManageIQ/manageiq-providers-kubernetes#103

Closed

cben closed this Sep 5, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Reconnect container images when seen again #14808

[WIP] Reconnect container images when seen again #14808

cben commented Apr 19, 2017 •

edited

Loading

Ladas Apr 19, 2017

cben Apr 19, 2017

Ladas Apr 19, 2017

Ladas Apr 19, 2017

cben Apr 19, 2017

Ladas Apr 19, 2017

miq-bot commented Apr 19, 2017

miq-bot commented Apr 19, 2017

miq-bot commented Apr 21, 2017

enoodle commented Apr 23, 2017

zeari commented Apr 23, 2017

cben commented Apr 23, 2017 via email •

edited

Loading

zeari commented Apr 23, 2017

cben commented Sep 4, 2017

[WIP] Reconnect container images when seen again #14808

[WIP] Reconnect container images when seen again #14808

Conversation

cben commented Apr 19, 2017 • edited Loading

Steps for Testing/QA [Optional]

Ladas Apr 19, 2017

Choose a reason for hiding this comment

cben Apr 19, 2017

Choose a reason for hiding this comment

Ladas Apr 19, 2017

Choose a reason for hiding this comment

Ladas Apr 19, 2017

Choose a reason for hiding this comment

cben Apr 19, 2017

Choose a reason for hiding this comment

Ladas Apr 19, 2017

Choose a reason for hiding this comment

miq-bot commented Apr 19, 2017

miq-bot commented Apr 19, 2017

miq-bot commented Apr 21, 2017

enoodle commented Apr 23, 2017

zeari commented Apr 23, 2017

cben commented Apr 23, 2017 via email • edited Loading

zeari commented Apr 23, 2017

cben commented Sep 4, 2017

cben commented Apr 19, 2017 •

edited

Loading

cben commented Apr 23, 2017 via email •

edited

Loading