Fix pickling of states and distributions #305

daniel-klein · 2024-02-24T02:36:12Z

Not pretty, but seems to work. Would appreciate a review. Should close #257 and now we can run MultiSim!

…ckling of states.

daniel-klein · 2024-02-24T02:45:01Z

starsim/distributions.py

+        return dct
+
+    def __setstate__(self, state):
+        self.__init__(state['_gen'], state['rng'])


I understand that it's not great form to call __init__ from __setstate__, in case init changes in some unexpected way. Better practice could be to copy the relevant guts out of init to a separate function that's call both by __init__ and __setstate__, but meh?

daniel-klein · 2024-02-24T02:45:55Z

starsim/states.py

+
+    def __getstate__(self):
+        slots_dict = {s: getattr(self, s) for s in self.__slots__ if hasattr(self, s) and getattr(self, s) is not None}
+        return (self.__dict__, slots_dict)


This is basically reimplementing __getstate__... because I don't know how to call the default state getter. Ideas?

daniel-klein · 2024-02-24T02:46:18Z

starsim/states.py

+        for st in state:
+            for k, v in st.items():
+                setattr(self, k, v)
+        return


Again, basically just the default state setter because I'm not sure what else to do.

daniel-klein · 2024-02-24T02:46:35Z

tests/test_simple.py

+    sims = ss.MultiSim([ss.Sim(pars, label='Sim1'), ss.Sim(pars, label='Sim2')])
+    sims.run()
+    s1, s2 = sims.sims
+    assert np.allclose(s1.summary[:], s2.summary[:], rtol=0, atol=0, equal_nan=True)


Switched from parallel to MultiSim, now that we have that functionality.

cliffckerr

Looks good to me, but let's see if @RomeshA or @kaa2102 have any thoughts on extra Python wizardry to do here :) Otherwise, I think we can refactor further if other things break in future. Working is a big step up from not working!

RomeshA · 2024-02-26T13:31:59Z

I was a bit unsure what the UIDArray.__getstate__ and UIDArray.__setstate__ are needed for, is there a test case that fails if they are omitted? Is it perhaps related to the new addition of UIDArray.__getattr__()? I think something isn't quite right with that, if I run

import starsim as ss
import sciris as sc
s = ss.Sim()
s.initialize()
s2 = sc.dcp(s)

then states like s2.people.female are turned into just plain arrays. @cliffckerr do we really need to override that function entirely? It feels like it might open us up to a wide range of weird side effects further down the track

daniel-klein · 2024-02-26T19:37:36Z

@RomeshA - these changes are intended to enable parallel processing, e.g. MultiSim. Initially, there was a problem with forking and collecting the ScipyDistributions that prevented us from even testing parallel processing. But with those issues addressed, I encountered all sorts of issues with states, for example when results are pickled in returning from a completed sim. Try running test_parallel in test_simple.py.

cliffckerr · 2024-02-27T00:15:32Z

@RomeshA To fix that particular issue, we could change it to

    def __getattr__(self, attr):
        """ Make it behave like a regular array mostly -- enables things like sum(), mean(), etc. """
        if attr in ['__deepcopy__', '__setstate__']:
            return self.__getattribute__(attr)
        else:
            return getattr(self.values, attr)

Buuuuut ... this is definitely ugly and probably not super performant, though not sure if this path is encountered enough for it to matter.

cliffckerr · 2024-02-27T00:31:48Z

NB: on main, the copied sim can't be run. If you comment out getattr or replace it with the code above, the sim runs, but produces bizarre results (where are the recovered people going?!):

import starsim as ss
import sciris as sc

s = ss.Sim(pars=dict(diseases='sir', networks='random'))
s.initialize()

s2 = sc.dcp(s)

s.run()
s2.run()

s.plot()
s2.plot()

So in any case, something needs to be fixed.

cliffckerr · 2024-02-27T00:38:16Z

@daniel-klein I pushed this change to getattr so you don't need the explicit getstate/setstate. I am baffled as to why it works, though, because I thought __getattr__() is only called if __getattribute__() fails, but this calls it for those special cases and it works?!?

RomeshA · 2024-02-27T02:23:01Z

Those changes above should fix @cliffckerr's example - the issue was that we were inadvertently deep-copying arrays that were supposed to be references to other existing arrays, so the discrepancy would occur because the first 10 time steps would update the inadvertently copied arrays, and then when dead agents are removed at t=10, the views were re-connected but because the proper arrays hadn't been used up to that point, they would still contain the original values (e.g., everyone susceptible). So the fix is to make sure that the arrays are re-linked when copied/unpickled

RomeshA

Seems to all work now with the latest set of changes!

daniel-klein added 2 commits February 22, 2024 16:04

Addresssing #257

e473cef

Addressing ScipyDistribution can't be pickled #257 and also fixing pi…

0e6ff76

…ckling of states.

daniel-klein requested review from RomeshA and kaa2102 February 24, 2024 02:36

daniel-klein commented Feb 24, 2024

View reviewed changes

cliffckerr approved these changes Feb 24, 2024

View reviewed changes

remove explicit getstate/setstate

91668be

RomeshA added 6 commits February 27, 2024 11:10

Update states registry when copying

b48b707

WIP

abd44c8

Correctly re-map array references when unpickling UIDArrays and States

127c109

Uncomment tests

3111504

Remove debug print commands

9b9ee5e

Uncomment __getattr__

d56e0a9

RomeshA approved these changes Feb 27, 2024

View reviewed changes

cliffckerr added 3 commits February 26, 2024 19:36

Merge branch 'main' into fix_distribution_pickle

4f80beb

tidy docstring

e3a832e

update changelog

8dd5449

cliffckerr merged commit 02c20ab into main Feb 27, 2024
2 checks passed

cliffckerr deleted the fix_distribution_pickle branch February 27, 2024 03:40

daniel-klein mentioned this pull request Feb 27, 2024

Copy still doesn't work #318

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix pickling of states and distributions #305

Fix pickling of states and distributions #305

daniel-klein commented Feb 24, 2024

daniel-klein Feb 24, 2024

daniel-klein Feb 24, 2024

daniel-klein Feb 24, 2024

daniel-klein Feb 24, 2024

cliffckerr left a comment •

edited

Loading

RomeshA commented Feb 26, 2024

daniel-klein commented Feb 26, 2024

cliffckerr commented Feb 27, 2024

cliffckerr commented Feb 27, 2024

cliffckerr commented Feb 27, 2024

RomeshA commented Feb 27, 2024

RomeshA left a comment

Fix pickling of states and distributions #305

Fix pickling of states and distributions #305

Conversation

daniel-klein commented Feb 24, 2024

daniel-klein Feb 24, 2024

Choose a reason for hiding this comment

daniel-klein Feb 24, 2024

Choose a reason for hiding this comment

daniel-klein Feb 24, 2024

Choose a reason for hiding this comment

daniel-klein Feb 24, 2024

Choose a reason for hiding this comment

cliffckerr left a comment • edited Loading

Choose a reason for hiding this comment

RomeshA commented Feb 26, 2024

daniel-klein commented Feb 26, 2024

cliffckerr commented Feb 27, 2024

cliffckerr commented Feb 27, 2024

cliffckerr commented Feb 27, 2024

RomeshA commented Feb 27, 2024

RomeshA left a comment

Choose a reason for hiding this comment

cliffckerr left a comment •

edited

Loading