feat: a standard for accessing the transaction log #66

roman-kashitsyn · 2022-10-17T21:50:38Z

No description provided.

sea-snake · 2022-10-28T17:11:37Z

The Transaction record has a field kind, is this value limited to namespaced query/update method names only? Or is it also possible to have transactions for events that happened during for example a heartbeat.

The Transaction has an optional field with the same name as the kind value. I assume a response can be deserialized based on record { kind: text; } first and then based on this text value you know how to deserialize the whole response.

But what about services that want to show the whole log and don't have all the possible candid definitions? Without a candid definition for each possible response type, you can't deserialize the data to it's original value since a encoded response for example doesn't contain the original keys of a struct. Maybe the candid could be fetched by the service to resolve this (not sure if we can do this from an inter canister call). Or the transaction data could be a more generic format similar to the metadata in ICRC-1.

I'm not sure how a canister could implement certification for all transactions. If the transactions were grouped in blocks that are certified as seen in the ICP ledger, then the number of hashes would be significantly lower. Is certification something that should be part of ICRC-3 standard? Without certification, a web service would be required to make update calls.

roman-kashitsyn · 2022-10-31T10:19:17Z

Great questions, @sea-snake!

Our experience with the ICP ledger suggests that having a single API for accessing transactions from canisters and agents (e.g., from a Rosetta node) does not work well, partly because Candid is not a great encoding for certification.

So the WG agreed to split the log access into smaller specifications: one for canisters and another for agents. Canisters want structured data and don't care about certification. Agents want certification and don't care too much about Candid.

ICRC-3 is a specification for canisters.

The Transaction has an optional field with the same name as the kind value. I assume a response can be deserialized based on record { kind: text; } first and then based on this text value you know how to deserialize the whole response.

Not quite: you can decode the whole record at once. If your code does not know about some transaction type, the Candid decoder will omit this field, and your code won't be able to access it.

For example, if your code assumes this structure:

record {
  kind: text; icrc1_mint : opt ...; icrc1_burn : opt ...; icrc1_transfer : ```;
}

And the ledger gives you the following:

record { kind = "icrc2_approve"; icrc2_approve : opt record { ... } }

the decoder will succeed, and you'll get record { kind = "icrc2_approve"; icrc1_mint = null ; <other fields are also null> }.

I'm not sure how a canister could implement certification for all transactions.

ICRC-3 will not support certification; it's a canister-oriented API. Another specification based on CBOR encoding and representation-independent hashes will follow later. We will need such a spec to implement an efficient Rosetta node.

plitzenberger

The current architecture requires the service to move data to an archive before it runs out of memory. Wouldn't it be easier to have an orchestration canister that holds the information about which canisters hold which transaction ranges? Then, whenever the latest canister runs out of mem, a new one is created and set to the latest field on the orchestration canister. This way, the get_transactions request would always have the same interface independent of which range gets requested.

standards/ICRC-3/ICRC-3.did

plitzenberger · 2023-01-11T09:30:56Z

standards/ICRC-3/ICRC-3.did

+    //
+    // For each entry `e` in [archived_transactions], `[e.from, e.from + len)` is a sub-range
+    // of the originally requested transaction range.
+    archived_transactions : vec record {


The archived_transactions field feels a bit unrelated to me. I would expect this as a response of a meta data request. Or is this related to the specified requested range in some way?

Or is this related to the specified requested range in some way?

Yes, the ledger restricts entries in the archived transactions list to the range the client requested.

We didn't want to have a separate "metadata" endpoint that returns a range -> canister assignment because this API would suffer from race conditions: by the time you received the response, the assignment of the tail might have moved already.

plitzenberger · 2023-01-11T09:39:19Z

standards/ICRC-3/ICRC-3.did

+        // The function you should call to fetch the archived transactions.
+        // The range of the transaction accessible using this function is given by [from]
+        // and [len] fields above.
+        callback : QueryArchiveFn;


I am trying to understand why we have different interfaces for query transactions from the archive. This data should point to the suitable canister where I can call the get_transactions in the same way, IMO.

That's a possibility; this approach would allow for multi-stage archival.
The main problem with having nested get_transactions is that writing a client becomes harder: how many redirects would you want to follow?
With the current API, you won't have more than one redirect.

dietersommer · 2023-01-19T09:26:37Z

standards/ICRC-3/README.md

+  5. The archive implementors decided not to return more than 2_000 transactions per request.
+     The archive returns the following value.
+     ```candid
+     record { transactions : vec { /* transactions 0..2_000 */ } }


off-by-1 error: should probably be { /* transactions 0..1_999 */ } as only 2000 trx can be returned
analogous below for the remaining part of the example

I use Rust syntax for ranges here, which does not include the right bound: 0..2000 means [0, 2000) in standard math notation. We can also switch to the math notation to avoid confusion.

Ah, OK. I guess if we make it clear, it should be fine as well. But otherwise, it is confusing. I proposed a change to standard notation, feel free to reject it and make clear what kind of notation it is. Not sure what is less confusing to the majority of people.

william-iclight · 2023-02-03T08:12:16Z

We have a solution for this and have been running it for some time.

DRC202 is a standard for scalable storage of token transaction records. It supports multi-token storage, automatic scaling to create storage canisters (buckets), and automatic routing of query records.

Automatic expansion ,the proxy contract uses BloomFilter
Support AccountId/Principal/Txid/BlockIndex multiple ways to query transaction records

More information: https://github.com/iclighthouse/DRC_standards/blob/main/DRC202/DRC202.md

roman-kashitsyn added 2 commits October 17, 2022 23:50

feat: a standard for accessing the transaction log

ac5c538

Expanded the docs and added an example

c3404e0

roman-kashitsyn marked this pull request as ready for review October 18, 2022 14:38

Fix candid tests

df1a7e1

Merge branch 'main' into roman-icrc3

9e4787e

plitzenberger reviewed Jan 11, 2023

View reviewed changes

dietersommer reviewed Jan 19, 2023

View reviewed changes

dietersommer added 2 commits January 19, 2023 10:32

Corrections to ICRC-3

46a19cb

Correction to ICRC-3 proposal

ef5a25f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: a standard for accessing the transaction log #66

feat: a standard for accessing the transaction log #66

roman-kashitsyn commented Oct 17, 2022

sea-snake commented Oct 28, 2022

roman-kashitsyn commented Oct 31, 2022

plitzenberger left a comment

plitzenberger Jan 11, 2023

roman-kashitsyn Jan 19, 2023

plitzenberger Jan 11, 2023

roman-kashitsyn Jan 19, 2023

dietersommer Jan 19, 2023

roman-kashitsyn Jan 19, 2023

dietersommer Jan 19, 2023

william-iclight commented Feb 3, 2023

feat: a standard for accessing the transaction log #66

Are you sure you want to change the base?

feat: a standard for accessing the transaction log #66

Conversation

roman-kashitsyn commented Oct 17, 2022

sea-snake commented Oct 28, 2022

roman-kashitsyn commented Oct 31, 2022

plitzenberger left a comment

Choose a reason for hiding this comment

plitzenberger Jan 11, 2023

Choose a reason for hiding this comment

roman-kashitsyn Jan 19, 2023

Choose a reason for hiding this comment

plitzenberger Jan 11, 2023

Choose a reason for hiding this comment

roman-kashitsyn Jan 19, 2023

Choose a reason for hiding this comment

dietersommer Jan 19, 2023

Choose a reason for hiding this comment

roman-kashitsyn Jan 19, 2023

Choose a reason for hiding this comment

dietersommer Jan 19, 2023

Choose a reason for hiding this comment

william-iclight commented Feb 3, 2023