sis_hash_helper more efficient #86

albertz · 2022-06-20T12:03:44Z

This gives me some huge speedup, of a big GMM pipeline, from 12 sec runtime down to 2 sec runtime.

Edit Sorry, wrong numbers...

albertz · 2022-06-20T12:39:00Z

Hm ok I need to do some debugging, this seems to change the hash.

JackTemaki · 2022-06-20T12:41:42Z

Yes, my setup before the change:
error(7) runnable(5) waiting(649)

My setup after the change:
error(3) runnable(12) waiting(247)

albertz · 2022-06-20T12:48:07Z

Ok, similar fix as in #89. Now for my simple test I get the same hash.

albertz · 2022-06-20T12:50:38Z

However, the speedup is much less now. Need to do some benchmarks. I think it should still be faster than before though.

albertz · 2022-06-20T12:53:21Z

However, the speedup is much less now. Need to do some benchmarks. I think it should still be faster than before though.

Actually, no? It seems slightly slower. Maybe due to higher memory consumption? I need to investigate this a bit more...

sisyphus/hash.py

albertz · 2022-06-21T08:20:14Z

However, the speedup is much less now. Need to do some benchmarks. I think it should still be faster than before though.

Actually, no? It seems slightly slower. Maybe due to higher memory consumption? I need to investigate this a bit more...

I'm also thinking about actually do more clever caching. But not exactly sure where. Unfortunately the objects we get here are often dict or list or some other config objects, and they are all mutable, so you cannot really cache the hash, unless you make sure the cache is correctly invalidated, which is not easily possible always (e.g. no idea how you would do that for a dict). Maybe in JobSingleton.__call__ we can do it for the args, because afterwards the args should anyway not change anymore.

albertz · 2022-06-21T08:49:30Z

Also, I think the current change in this PR here does not fully captures all the recursive calls of sis_hash_helper. The recursive calls are often through obj._sis_hash calls.

critias · 2022-06-21T09:06:34Z

Also, I think the current change in this PR here does not fully captures all the recursive calls of sis_hash_helper. The recursive calls are often through obj._sis_hash calls.

You could also cache the result of obj._sis_hash

albertz requested review from critias and JackTemaki June 20, 2022 12:04

albertz mentioned this pull request Jun 20, 2022

extract_paths more efficient, small fixup #89

Merged

albertz marked this pull request as draft June 20, 2022 12:39

albertz marked this pull request as ready for review June 20, 2022 12:48

albertz marked this pull request as draft June 20, 2022 12:53

critias reviewed Jun 21, 2022

View reviewed changes

sisyphus/hash.py Show resolved Hide resolved

sisyphus/hash.py Show resolved Hide resolved

sisyphus/hash.py Show resolved Hide resolved

albertz added 2 commits June 21, 2022 11:01

sis_hash_helper more efficient

170691b

sis_hash_helper more efficient fix

2f4cbe0

albertz force-pushed the albert-sis-hash-helper-more-efficient branch from 4e6fb53 to 2f4cbe0 Compare June 21, 2022 09:01

albertz mentioned this pull request Jun 21, 2022

Sisyphus is too slow #90

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sis_hash_helper more efficient #86

sis_hash_helper more efficient #86

albertz commented Jun 20, 2022 •

edited

Loading

albertz commented Jun 20, 2022

JackTemaki commented Jun 20, 2022

albertz commented Jun 20, 2022

albertz commented Jun 20, 2022

albertz commented Jun 20, 2022

albertz commented Jun 21, 2022

albertz commented Jun 21, 2022

critias commented Jun 21, 2022

sis_hash_helper more efficient #86

Are you sure you want to change the base?

sis_hash_helper more efficient #86

Conversation

albertz commented Jun 20, 2022 • edited Loading

albertz commented Jun 20, 2022

JackTemaki commented Jun 20, 2022

albertz commented Jun 20, 2022

albertz commented Jun 20, 2022

albertz commented Jun 20, 2022

albertz commented Jun 21, 2022

albertz commented Jun 21, 2022

critias commented Jun 21, 2022

albertz commented Jun 20, 2022 •

edited

Loading