RecursiveOpenStructs are heavy & slow #251

cben · 2017-06-07T21:47:59Z

[don't have time to measure soon but wanted to braindump... cc @ilackarms @simon3z]

Kubeclient wraps all parsed JSONs with RecursiveOpenStructs. ROS does several expensive things:
https://github.com/aetherknight/recursive-open-struct/blob/v1.0.4/lib/recursive_open_struct.rb#L69-L111
https://github.com/ruby/ruby/blob/v2_4_1/lib/ostruct.rb

on construction: deep dup the hash — can be mitigated by setting :mutate_input_hash, safe our case.
- it also keeps a DeepDup instance around, one per ROS object afaict.
When you access fields as methods (result.foo.bar): method_missing call, creating a singleton class for each object, and defining 3 methods foo, foo=, foo_as_a_hash (per key)! The methods also keep a closure to reference vars from new_ostruct_member. Then it proceeds to [:foo] access.
- I've seen with a sampling profiler in manageiq code that walks & accesses most of the data parsed from kubernetes, that 30%~80% CPU gets spent in new_ostruct_member! (we don't currently care about the CPU but other kubeclient users might)
- I suspect allocating singleton classes and 3 methods + closure per field wastes tons of RAM.
- Creating the methods is premature optimization to save CPU but is wasteful for once-or-twice usage patterns.
When you access as hash (result[:foo][:bar]): recursively contstructs ROS for the children. It's on-demand and memoized but as you walk the data you'll get a tree of ROSs parallel to the original hashes...
on .to_h to just get the underlying hash: deep dups it — can maybe be cheaper with :preserve_original_keys?
Anyway, result.to_h looks like cheapest access option!

I have several ideas to try making upstream ROS & OS faster.

The text was updated successfully, but these errors were encountered:

moolitayer · 2017-06-08T07:24:29Z

Maybe we need an option that tells kubeclient to return JSON instead of ROS?

moolitayer · 2017-06-08T07:26:15Z

BTW since that would require a large code change in our usage code (even if it's mostly cosmetic) we need some benchmarks

ilackarms · 2017-06-08T16:16:35Z

can we test the actual size of the things we're returning from kubeclient with ObjectSpace.memsize_of ?

cben · 2017-06-14T20:22:27Z

Is recursive-open-struct really necessary? aetherknight/recursive-open-struct#19 shows neat trick telling JSON.parse which class to create. Not clear if/how it helps us but wanted to remember it.

moolitayer · 2017-06-14T20:58:31Z

Sorry for repeating... I've never understood why we need ROS.
pod.name instead of pod["name"]?? while lots of complicated stuff is happening behind the scenes to enable it

simon3z · 2017-06-15T11:31:01Z

Sorry for repeating... I've never understood why we need ROS.

@moolitayer me neither. It's also problematic for those keys that have special symbols (e.g. -).
For me it's a 👍 to drop ROS, although we should carefully pick and balance what we're working on to improve performance, I have the feeling that the big part is still on the MIQ db side in our case.

grosser · 2017-06-15T21:51:37Z

FYI related issue #197

moolitayer · 2017-08-21T09:22:41Z

See #262, add option for raw and avoids ROS (and JSON parsing altogether)

cben · 2018-03-04T12:28:35Z

Covered by #262 as: :raw, #306 as: :parsed & as: :parsed_symbolized, and #299 allowing to set as: for whole client.

cben mentioned this issue Jun 7, 2017

[WIP] Save allocations (?) by using symbols for JSON keys #252

Closed

2 tasks

grosser mentioned this issue Jun 15, 2017

Updated outdated dependencies #253

Merged

cben added the performance label Mar 4, 2018

cben closed this as completed Mar 4, 2018

grosser mentioned this issue Sep 1, 2020

use more efficient hashes when talking to kubernetes fabric8io/fluent-plugin-kubernetes_metadata_filter#254

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RecursiveOpenStructs are heavy & slow #251

RecursiveOpenStructs are heavy & slow #251

cben commented Jun 7, 2017 •

edited

Loading

moolitayer commented Jun 8, 2017

moolitayer commented Jun 8, 2017

ilackarms commented Jun 8, 2017

cben commented Jun 14, 2017

moolitayer commented Jun 14, 2017

simon3z commented Jun 15, 2017

grosser commented Jun 15, 2017

moolitayer commented Aug 21, 2017

cben commented Mar 4, 2018

RecursiveOpenStructs are heavy & slow #251

RecursiveOpenStructs are heavy & slow #251

Comments

cben commented Jun 7, 2017 • edited Loading

moolitayer commented Jun 8, 2017

moolitayer commented Jun 8, 2017

ilackarms commented Jun 8, 2017

cben commented Jun 14, 2017

moolitayer commented Jun 14, 2017

simon3z commented Jun 15, 2017

grosser commented Jun 15, 2017

moolitayer commented Aug 21, 2017

cben commented Mar 4, 2018

cben commented Jun 7, 2017 •

edited

Loading