Determinism through serialization cycle #332

Uriopass · 2023-07-26T09:57:43Z

I'm trying to make a replay system based on determinism. Actions on the world are saved in a replay and are applied on a hecs::World. Entity objects are stored separately in resources and serialized too.

I have a determinism problem demonstrated via the following code.
If a world has gone through a serialization cycle, the entity allocation state is not serialized and therefore entity do not necessarily get the same IDs.

let mut w1 = World::new();
let e = w1.spawn((Comp,));
w1.despawn(e);

let w2 = deserialize(serialize(&w1)); // simplified for example

let e1 = w1.spawn((Comp,));
let e2 = w2.spawn((Comp,));

assert_eq!(e1, e2); // fail! generations don't match. 
// A longer example can also show a set of operations where the id (not the genration) doesn't match either
// since the freelist is not saved.

From what I understand, this is intended.
What should I do if I want reproducible worlds that survive through serialization cycles? Is enough of hecs API publicly exposed to be able to do this?
That is, doing operations1 -> ser -> deser -> operations2 results in the same World as operations1 -> operations2.

The text was updated successfully, but these errors were encountered:

adamreichold · 2023-07-26T10:28:06Z

What about attaching a persistent ID to each entity which could then be used to match up the new transient entity ID after deserialization? Actions would of course also have to record persistent ID of the entities they were applied to.

Uriopass · 2023-07-26T10:34:25Z

I would find it disappointing that I must have extra state when the information is already there (entities already have unique ids). But you're right it might be more robust. I'm not sure how much magic would be required to make this work.

Uriopass · 2023-07-26T10:59:41Z

Iteration order might also change even with the persistent ids, but I don't know enough about hecs internals to check this.

adamreichold · 2023-07-26T11:03:03Z

One other thing that might help you if it applies to your use case at all: If you basically use the serialization support as a means to copy a world (instead of really persisting and loading it into another process), this could be implemented directly.

For example, in rs-ecs (which is based on hecs albeit simplified) we have World::clone to completely copy a World with all contained entities and components (assuming they can be cloned and the method of doing so has been registered with the given Cloner). This does preserve entity ID and iteration order and could most likely be directly ported to hecs. (In a future with stable specilization the interface would also become much simpler by specializing the internal clone function on the Clone and Copy traits.)

Uriopass · 2023-07-26T11:24:54Z

I'm using the serialization to do classical save/loading on the filesystem, but with the constraint that I'm building a replay system and I want to ensure determinism, so sadly cloning wouldn't help here.

Ralith · 2023-08-01T17:47:17Z

I recognize the usefulness of being able to save/restore allocator state for some applications. I don't think it costs us much to expose the freelist for folks to serialize if they really want; even if we have to break it in the future, the cost of doing so is small.

Would you also need generations to be assigned consistently before/after serialization? That might be a good chunk of additional data, and a bit harder to expose ergonomically.

Uriopass · 2023-08-01T21:55:11Z

How big would it really be? It's all about saving the IDs which aren't that big, no? I'd say generation is not that expensive and ensures 100% state restoration.

Ralith · 2023-08-02T22:26:07Z

Generations would double the amount of data (which is proportional to the high watermark of the number of entity's you've ever had concurrently live), and aren't conveniently laid out in a dense slice. Not a deal breaker, but would require a bit of thought to expose gracefully.

ZagButNoZig · 2024-01-05T19:05:53Z

Hey @Ralith I would be interested in this feature too. What additional data would we need to serialize to get deterministic serialization working?

I was looking trough the code and it seems like the "big thing" is to fully serialize the Entities struct inside world, to get consistent ids and generations.

To get a bit by bit identical world we would also in theory need to serialize the id field of the world, do you think that would be relevant?

Would we need to serialize any of the other members of world?

Thanks for the great work btw!

Ralith · 2024-01-05T22:59:27Z

Two additional pieces of data need to be exposed, both from the entity allocator

The freelist: required to ensure entities are allocated in consistent order
The generation counters of non-live entities: required to ensure allocated entities have the same generations

APIs are needed to both expose this information, and to allow it to be provided to a new world.

To get a bit by bit identical world we would also in theory need to serialize the id field of the world, do you think that would be relevant?

No, that ID's purpose is to only distinguish different World objects at runtime. Providing any way to set it would be unsafe.

Uriopass mentioned this issue Jul 26, 2023

Determinism Uriopass/Egregoria#90

Open

adamreichold mentioned this issue Aug 6, 2023

Is it possible to clone World? With use case. #333

Closed

caspark mentioned this issue May 16, 2024

Add World::try_clone() #372

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Determinism through serialization cycle #332

Determinism through serialization cycle #332

Uriopass commented Jul 26, 2023 •

edited

Loading

adamreichold commented Jul 26, 2023

Uriopass commented Jul 26, 2023

Uriopass commented Jul 26, 2023 •

edited

Loading

adamreichold commented Jul 26, 2023 •

edited

Loading

Uriopass commented Jul 26, 2023 •

edited

Loading

Ralith commented Aug 1, 2023

Uriopass commented Aug 1, 2023

Ralith commented Aug 2, 2023 •

edited

Loading

ZagButNoZig commented Jan 5, 2024 •

edited

Loading

Ralith commented Jan 5, 2024

Determinism through serialization cycle #332

Determinism through serialization cycle #332

Comments

Uriopass commented Jul 26, 2023 • edited Loading

adamreichold commented Jul 26, 2023

Uriopass commented Jul 26, 2023

Uriopass commented Jul 26, 2023 • edited Loading

adamreichold commented Jul 26, 2023 • edited Loading

Uriopass commented Jul 26, 2023 • edited Loading

Ralith commented Aug 1, 2023

Uriopass commented Aug 1, 2023

Ralith commented Aug 2, 2023 • edited Loading

ZagButNoZig commented Jan 5, 2024 • edited Loading

Ralith commented Jan 5, 2024

Uriopass commented Jul 26, 2023 •

edited

Loading

Uriopass commented Jul 26, 2023 •

edited

Loading

adamreichold commented Jul 26, 2023 •

edited

Loading

Uriopass commented Jul 26, 2023 •

edited

Loading

Ralith commented Aug 2, 2023 •

edited

Loading

ZagButNoZig commented Jan 5, 2024 •

edited

Loading