Vision: Redesign Event system #278

xlc · 2022-04-13T10:15:24Z

Related issues: paritytech/substrate#9480 paritytech/substrate#11132

Currently all the events are stored in a single array. This means while we can emit events in the runtime, we can never be able to process them in runtime or light client efficiently and safely. This prevents people from using foreign events during cross chain interactions and basically makes many async multi chain operations impossible.

We also have supports event topic that is incomplete (#371) and unused at all.

We should redesign the event system to meet at least those requirements:

Efficient to emit
- Emit event should have minimal overhead, should not increase PoV size.
Efficient to proof
- It should be possible to generate a proof of existent of some events with minimal proof size. Light client and runtime should be able to verify the proof very efficiently.
Efficient removal
- We need to purge all events from previous block at beginning of block construction so remove old events should be as efficient as possible.
Easy to query
- Clients and light clients should be able to query some particular events without iterating every single events in the block
- Clients and light clients should be able to subscribe for some particular events by monitoring for storage changes for some storage keys
O(1) for checkpointing/commit/reversion in transactional processing.

bkchr · 2022-04-13T12:26:01Z

We could probably put events into their own child trie. The main problem I would see with this is the efficient removal, which would not work with a child trie.

shawntabrizi · 2022-04-20T17:45:12Z

Why cant we have a merkle trie as a single storage item? And then just the root hash as a second item that is easy and small to read.

bkchr · 2022-04-21T08:06:06Z

We can not have a merkle trie as a single storage item. It would be too costly. It would require reading the entire trie, when we want to add a new event. However, we could add some sort of post-processing into on_finalize. So, keep the current event structure and append events over the lifetime of a block and then at the end we read all events again and put them into a trie. Then we store this trie and the root inside the state.

bkchr · 2022-04-21T08:07:58Z

But maybe we can also think again about child tries and how to speedup their deletion.

Polkadot-Forum · 2022-10-04T09:26:00Z

This issue has been mentioned on Polkadot Forum. There might be relevant details there:

https://forum.polkadot.network/t/redesign-event-system/625/1

gavofyork · 2022-10-04T09:54:51Z

I don't think events need to/should be "structured"; this would add a lot of conceptual, storage and implementation overhead. I do think we want a quick and easy means of proving that an event happened in a block, as well as proving the context in which it happened (I'm thinking especially an extrinsic index and possibly some call stack).

Some form of secondary trie structure would be the obvious way to go, but child tries (as presently implemented) are needlessly persisting their contents, which we don't want/need.

I think perhaps a specialist data-structure which gives a per-block non-persistent trie accumulator. It may or may not use a host call, depending on how fast it would be to do in wasm.

Basically, you'd be able to append an item and eventually get the root of the trie of all items [ (0, item[0]), (1, item[1]), ... ]. The actual item contents would be retained in memory somewhere and could be queried by the host in a new API call. The weight of emitting an event would need to include the crypto needed for the trie root computation, but that's fine. Clients would need to store the event content for later proof construction, but this is no different to extrinsics. This is all quite similar to how transaction receipts are done in Ethereum.

xlc · 2022-10-04T10:09:42Z

Currently it is very hard (if not impossible) to identify event-call association within a reasonably complicated call (e.g. multisig + proxy + batch). Nested event solves this.
However, it is possible to simulate nested event without implementing nested event. Similar to the ItemCompleted event in batch call, we can just insert some separator event and we just need a well defined parser to convert event array to event tree. So we don't have to put this into consideration when picking the data structure.

I can see a in memory accumulator can help with this and it also have many other use cases. Maybe we should also prioritize it as well.

gavofyork · 2022-10-04T10:21:55Z

Indeed.

gavofyork · 2022-10-24T13:57:22Z

One further requirement: O(1) for checkpointing/commit/reversion in transactional processing.

Lohann · 2023-05-17T22:22:41Z

I think perhaps a specialist data-structure which gives a per-block non-persistent trie accumulator

RSA Accumulators almost meet all those requirements:

Efficient to emit:

✅ RSA Accumulators have O(1) insertion time
❌ The issue is that it can only stores prime numbers, while is easy to map any event to an unique prime using something like:

fn hash_to_prime<E: Codec>(element: &E) -> [u8;32] {
   let mut hash = Blake2_256::hash(element.encode());
   loop {
       if is_prime(&hash) {
           return hash;
       }
       hash = Blake2_256::hash(&hash);
   }
}

For static events, it can be pre-calculated at compile-time, but for an arbitrary element such implementation may not be safe to run on-chain once an adversary can craft an element which requires many iterations to find the prime.

✅ Efficient to proof:

Using shamir's trick, RSA Accumulators proofs are constant size, regardless the number of elements you want to proof.

✅ Efficient removal:

As efficient as membership proofs if Precomputing all membership witnesses

✅ O(1) for checkpointing/commit/reversion in transactional processing.

Batch additions and deletion are also constant time

❓ Easy to query: Depends, I see a conflicting requirements here, which is check if an event is present looking at the storage key, but at the same time not storing the event in the block.. a possible solution here would be the same as ethereum and use bloom filters to tell if an event may be present in a block or not.
it may be combined with non-membership proofs, where the node must proof that a given event is not present in a block.

Lohann · 2023-05-19T01:00:24Z

Ok, I think I've found a more elegant design that meet all listed requirements, a specialized Cuckoo Filter

Cuckoo filter is very similar to Bloom Filter, but store partial fingerprints, less false positives, very good space efficiency and supports deletions!

1 - Anyone can easily skip blocks that doesn't contains the events he is looking for with a small fraction of false positives.
2 - To prevent iterating every single event, we can use the cuckoo filter's fingerprint position as storage index.
3 - On transaction revert, events can be deleted from the filter or the filter could be created only after all event are emitted (on_finalize), whatever is more efficient.
4 - Resistant against this kind of attack

References

ntn-x2 · 2023-05-29T09:39:13Z

Adding my two cents here.

We are working on a cross-chain identity use case, and we need Transact to be dispatched with parachain origin, while letting users pay for tx fees. This might be possible with XCM v4, but until then, the only solution we have found is to pre-fund the account on the target chain by sending some asset's to the sender parachain account on the receiver chain, and for this information to travel back to the sender chain before the user is able to dispatch any extrinsic.

Having efficient proof-of-event capabilities would allow users to prove they funded the parachain account, without relying on XCM to send back such information. This would be possible by using state proofs and a more efficient way of proving that a specific event happened.

kayabaNerve · 2024-02-06T06:09:38Z

RSA Accumulators almost meet all those requirements:

When I read this months ago (November?), I read up on RSA Accumulators. I was completely horrified. Not only do they have a trusted setup, the idea of hashing to prime numbers seemed like a mess. I did see there was groups of unknown order proposed as an alternative to a trusted setup, yet it was still quite fringe.

For completely unrelated reasons, I ended up working on threshold ECDSA and for that, implemented class groups of unknown order in Rust. With that, I implemented a hash to prime (and was again horrified). I ended up accepting it however for its role in removing cryptographic assumptions, having better understood them. Please note some prior works do rely on the Strong Root Assumption (not the Adaptive Root Assumption), and that'd void the need for a hash to prime.

If anyone is interested in using my work, which is in pure Rust, for experiments: https://github.com/kayabaNerve/threshold-threshold-ecdsa/blob/develop/src/class_group.rs

Please note I doubt my implementation is optimal. I believe Chia has made far more efficient reduction algorithms, and I know malachite is ~4x slower than gmp (though this would be a C dependency).

Please note hashing to prime numbers is still messy to implement, and I don't doubt it can be adversarially bombed. I do still question what the worst case would be for an algorithm designed to be used as a hash to prime is. Please note the proposed brute force above can be replaced with a sieve of some form, such as https://eprint.iacr.org/2011/481 which intends to effect a uniform distribution.

Such a proposal could use a single storage slot, and create an inclusion proof with a trivial algorithm using the accumulator of the prior block + an O(n) algorithm where all other events in the same block are included (leaving out the intended event).

I'll also note there are trustless mechanisms to generate uniform elements in a class group, per https://eprint.iacr.org/2024/034.

The above is solely to flesh out the cryptographic proposal of accumulators. My commentary on cuckoo filters is it doesn't sound to actually offer LC verification, solely efficient finding of where events may have been. Is that correct, or would the light client proof be all of the hashes in the filter (where the event's hash inclusion proved it occurred)?

This includes a critical fix for debug release versions of litep2p (which are running in Kusama as validators). While at it, have stopped the oncall pain of alerts around `incoming_connections_total`. We can rethink the metric expose of litep2p in Q1. ## [0.8.2] - 2024-11-27 This release ensures that the provided peer identity is verified at the crypto/noise protocol level, enhancing security and preventing potential misuses. The release also includes a fix that caused `TransportService` component to panic on debug builds. ### Fixed - req-resp: Fix panic on connection closed for substream open failure ([#291](paritytech/litep2p#291)) - crypto/noise: Verify crypto/noise signature payload ([#278](paritytech/litep2p#278)) ### Changed - transport_service/logs: Provide less details for trace logs ([#292](paritytech/litep2p#292)) ## Testing Done This has been extensively tested in Kusama on all validators, that are now running litep2p. Deployed PR: #6638 ### Litep2p Dashboards ![Screenshot 2024-11-26 at 19 19 41](https://github.com/user-attachments/assets/e00b2b2b-7e64-4d96-ab26-165e2b8d0dc9) ### Libp2p vs Litep2p CPU usage After deploying litep2p we have reduced CPU usage from around 300-400% to 200%, this is a significant boost in performance, freeing resources for other subcomponents to function more optimally. ![image(1)](https://github.com/user-attachments/assets/fa793df5-4d58-4601-963d-246e56dd2a26) cc @paritytech/sdk-node --------- Signed-off-by: Alexandru Vasile <alexandru.vasile@parity.io> Co-authored-by: GitHub Action <action@github.com>

This includes a critical fix for debug release versions of litep2p (which are running in Kusama as validators). While at it, have stopped the oncall pain of alerts around `incoming_connections_total`. We can rethink the metric expose of litep2p in Q1. ## [0.8.2] - 2024-11-27 This release ensures that the provided peer identity is verified at the crypto/noise protocol level, enhancing security and preventing potential misuses. The release also includes a fix that caused `TransportService` component to panic on debug builds. ### Fixed - req-resp: Fix panic on connection closed for substream open failure ([#291](paritytech/litep2p#291)) - crypto/noise: Verify crypto/noise signature payload ([#278](paritytech/litep2p#278)) ### Changed - transport_service/logs: Provide less details for trace logs ([#292](paritytech/litep2p#292)) ## Testing Done This has been extensively tested in Kusama on all validators, that are now running litep2p. Deployed PR: #6638 ### Litep2p Dashboards ![Screenshot 2024-11-26 at 19 19 41](https://github.com/user-attachments/assets/e00b2b2b-7e64-4d96-ab26-165e2b8d0dc9) ### Libp2p vs Litep2p CPU usage After deploying litep2p we have reduced CPU usage from around 300-400% to 200%, this is a significant boost in performance, freeing resources for other subcomponents to function more optimally. ![image(1)](https://github.com/user-attachments/assets/fa793df5-4d58-4601-963d-246e56dd2a26) cc @paritytech/sdk-node --------- Signed-off-by: Alexandru Vasile <alexandru.vasile@parity.io> Co-authored-by: GitHub Action <action@github.com> (cherry picked from commit 51c3e95)

This includes a critical fix for debug release versions of litep2p (which are running in Kusama as validators). While at it, have stopped the oncall pain of alerts around `incoming_connections_total`. We can rethink the metric expose of litep2p in Q1. ## [0.8.2] - 2024-11-27 This release ensures that the provided peer identity is verified at the crypto/noise protocol level, enhancing security and preventing potential misuses. The release also includes a fix that caused `TransportService` component to panic on debug builds. ### Fixed - req-resp: Fix panic on connection closed for substream open failure ([paritytech#291](paritytech/litep2p#291)) - crypto/noise: Verify crypto/noise signature payload ([paritytech#278](paritytech/litep2p#278)) ### Changed - transport_service/logs: Provide less details for trace logs ([paritytech#292](paritytech/litep2p#292)) ## Testing Done This has been extensively tested in Kusama on all validators, that are now running litep2p. Deployed PR: paritytech#6638 ### Litep2p Dashboards ![Screenshot 2024-11-26 at 19 19 41](https://github.com/user-attachments/assets/e00b2b2b-7e64-4d96-ab26-165e2b8d0dc9) ### Libp2p vs Litep2p CPU usage After deploying litep2p we have reduced CPU usage from around 300-400% to 200%, this is a significant boost in performance, freeing resources for other subcomponents to function more optimally. ![image(1)](https://github.com/user-attachments/assets/fa793df5-4d58-4601-963d-246e56dd2a26) cc @paritytech/sdk-node --------- Signed-off-by: Alexandru Vasile <alexandru.vasile@parity.io> Co-authored-by: GitHub Action <action@github.com>

bkchr added the J0-enhancement label Apr 13, 2022

mustermeiszer mentioned this issue Jul 25, 2022

Future ChildStorage design and capabilities paritytech/substrate#11913

Closed

This was referenced Oct 4, 2022

In block in memory storage paritytech/substrate#12415

Closed

Transient storage runtime host function #359

Open

gavofyork mentioned this issue Oct 28, 2022

Vision: Host functions and database support for non-Merklised-persistent data structures #245

Open

kianenigma added the Z5-epic label Mar 10, 2023

kianenigma changed the title ~~Redesign Event system~~ Vision: Redesign Event system Mar 10, 2023

juangirini added the T1-runtime label Jun 7, 2023

juangirini transferred this issue from paritytech/substrate Aug 24, 2023

the-right-joyce added I5-enhancement An additional feature request. T1-FRAME This PR/Issue is related to core FRAME, the framework. I6-meta A specific issue for grouping tasks or bugs of a specific category. and removed J0-enhancement labels Aug 25, 2023

xlc mentioned this issue Sep 13, 2023

[rpc] we need the rpc to subscribe or filter the event for substrate chain. #1532

Open

stwiname mentioned this issue Nov 17, 2023

Substrate extrinsics and events filtering subquery/developer-guild#55

Open

kianenigma added this to Parity Roadmap Feb 28, 2024

github-project-automation bot moved this to Draft in Parity Roadmap Feb 28, 2024

xlc mentioned this issue Apr 8, 2024

🧑‍💻 Wishlist: Developers #3900

Open

jonathanudd pushed a commit to jonathanudd/polkadot-sdk that referenced this issue Apr 10, 2024

Bump async-stream from 0.2.1 to 0.3.0 (paritytech#278)

4545931

kayabaNerve mentioned this issue Jan 17, 2025

Decide if we want to merklize state serai-dex/serai#647

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Vision: Redesign Event system #278

Vision: Redesign Event system #278

xlc commented Apr 13, 2022 •

edited

Loading

bkchr commented Apr 13, 2022

shawntabrizi commented Apr 20, 2022 •

edited

Loading

bkchr commented Apr 21, 2022

bkchr commented Apr 21, 2022

Polkadot-Forum commented Oct 4, 2022

gavofyork commented Oct 4, 2022

xlc commented Oct 4, 2022

gavofyork commented Oct 4, 2022

gavofyork commented Oct 24, 2022

Lohann commented May 17, 2023 •

edited

Loading

Lohann commented May 19, 2023 •

edited

Loading

ntn-x2 commented May 29, 2023

kayabaNerve commented Feb 6, 2024 •

edited

Loading

Vision: Redesign Event system #278

Vision: Redesign Event system #278

Comments

xlc commented Apr 13, 2022 • edited Loading

bkchr commented Apr 13, 2022

shawntabrizi commented Apr 20, 2022 • edited Loading

bkchr commented Apr 21, 2022

bkchr commented Apr 21, 2022

Polkadot-Forum commented Oct 4, 2022

gavofyork commented Oct 4, 2022

xlc commented Oct 4, 2022

gavofyork commented Oct 4, 2022

gavofyork commented Oct 24, 2022

Lohann commented May 17, 2023 • edited Loading

Lohann commented May 19, 2023 • edited Loading

ntn-x2 commented May 29, 2023

kayabaNerve commented Feb 6, 2024 • edited Loading

xlc commented Apr 13, 2022 •

edited

Loading

shawntabrizi commented Apr 20, 2022 •

edited

Loading

Lohann commented May 17, 2023 •

edited

Loading

Lohann commented May 19, 2023 •

edited

Loading

kayabaNerve commented Feb 6, 2024 •

edited

Loading