Enhancement request: forced-multithread operation to support "snapshot"-like collection #2527

athompson-merlin · 2022-04-27T00:56:49Z

(See #457 for some background detail. I may as well just quote myself, more or less:)

I, like others, have a need to collect as much of a "snapshot" as possible, so I would prefer that - at the interval time - Oxidized should spin up as many threads as possible in order to accomplish data collection as rapidly as possible. Oxidized is not resource-limited in my environment, and I still have max_threads to control that anyway.

This would also be a useful feature for troubleshooting my production instance more easily - when something doesn't work right and I have to restart Oxidized (e.g. editing a custom model), it can take almost an hour before Oxidized finishes its initial single-threaded poll of all the devices and reaches steady-state, at which point I can begin troubleshooting usefully.

This might make more sense when I say that many of my devices take 5-10min to collect running-config, so when those get serialized, some of the assumptions about e.g., jobs taking 5sec by default, are waaaaaay out of sync with my environment.

Ideal (for me) would be a new config option along the lines of use_max_available_threads: [yes|no] that could be changed and reloaded at runtime. Although some discussion has previously occurred, I don't see anything like that today.

If I'm missing something, or if this is a trivial local modification, great! - please tell me.
Otherwise, it looks like the section of code needing changed is jobs.want starting at

oxidized/lib/oxidized/jobs.rb

Line 35 in 8179f48

def new_count

but I'm barely able to read this code base, forget about modifying core functionality.

Alternately, if MAX_INTER_JOB_GAP (at

oxidized/lib/oxidized/jobs.rb

Line 4 in 8179f48

MAX_INTER_JOB_GAP = 300 # add job if more than X from last job started

) was a config file parameter, and able to be reloaded at runtime, that might suffice? I don't think I would want that permanently set to e.g. 1 sec, but I'm unclear on what negative effects that might have.

The text was updated successfully, but these errors were encountered:

athompson-merlin · 2022-04-27T01:11:37Z

Adjusting MAX_INTER_JOB_GAP to 1 directly in the source doesn't produce exactly the results I want, but certainly a lot closer. It takes A seconds to spin up B parallel jobs for C devices, where A>C>B (why???) and then once done, it appears to wait the usual interval amount of time before starting again.

seros1521 mentioned this issue May 1, 2022

Add option: use_max_threads #2528

Merged

4 tasks

ghost closed this as completed in #2528 May 9, 2022

0x00af mentioned this issue Mar 5, 2024

when 'use_max_threads' is true, and 'threads' is equal or higher than the nodes to query, oxidized enters an endless loop #3095

Closed

This issue was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enhancement request: forced-multithread operation to support "snapshot"-like collection #2527

Enhancement request: forced-multithread operation to support "snapshot"-like collection #2527

athompson-merlin commented Apr 27, 2022

athompson-merlin commented Apr 27, 2022

Enhancement request: forced-multithread operation to support "snapshot"-like collection #2527

Enhancement request: forced-multithread operation to support "snapshot"-like collection #2527

Comments

athompson-merlin commented Apr 27, 2022

athompson-merlin commented Apr 27, 2022