Implement cumulus StorageWeightReclaim as wrapping transaction extension + frame system ReclaimWeight #6140

gui1117 · 2024-10-19T16:54:37Z

(rebasing of #5234)

Issues:

Transaction extensions have weights and refund weight. So the reclaiming of unused weight must happen last in the transaction extension pipeline. Currently it is inside CheckWeight.
cumulus storage weight reclaim transaction extension misses the proof size of logic happening prior to itself.

Done:

a new storage ExtrinsicWeightReclaimed in frame-system. Any logic which attempts to do some reclaim must use this storage to avoid double reclaim.
a new function reclaim_weight in frame-system pallet: info and post info in arguments, read the already reclaimed weight, calculate the new unused weight from info and post info. do the more accurate reclaim if higher.
CheckWeight is unchanged and still reclaim the weight in post dispatch
ReclaimWeight is a new transaction extension in frame system. For solo chains it must be used last in the transactino extension pipeline. It does the final most accurate reclaim

StorageWeightReclaim is moved from cumulus primitives into its own pallet (in order to define benchmark) and is changed into a wrapping transaction extension.
It does the recording of proof size and does the reclaim using this recording and the info and post info. So parachains don't need to use ReclaimWeight. But also if they use it, there is no bug.

/// The TransactionExtension to the basic transaction logic.
pub type TxExtension = cumulus_pallet_weight_reclaim::StorageWeightReclaim<
     Runtime,
     (
             frame_system::CheckNonZeroSender<Runtime>,
             frame_system::CheckSpecVersion<Runtime>,
             frame_system::CheckTxVersion<Runtime>,
             frame_system::CheckGenesis<Runtime>,
             frame_system::CheckEra<Runtime>,
             frame_system::CheckNonce<Runtime>,
             frame_system::CheckWeight<Runtime>,
             pallet_transaction_payment::ChargeTransactionPayment<Runtime>,
             BridgeRejectObsoleteHeadersAndMessages,
             (bridge_to_rococo_config::OnBridgeHubWestendRefundBridgeHubRococoMessages,),
             frame_metadata_hash_extension::CheckMetadataHash<Runtime>,
     ),
>;

bkchr · 2024-10-20T19:24:07Z

Anyway this doesn't have any vulnerability, it just wastes resources, but people should put the CheckWeight last in the pipeline.

I would say it depends. When you are required to put CheckWeight as the latest extension, it also means that you are missing a cheap, early return.

georgepisaltu · 2024-10-21T06:46:18Z

you are missing a cheap, early return

Going through the pipeline should be cheap anyway. It's just extensions which are pretty light and the "wasted" work for overweight transactions should be done off-chain when validators are building their blocks.

Because the weight check isn't hardcoded and users can build whatever extension they like to handle it, we need to have some sort of convention when we introduce other weight related logic. I skimmed through the PR and I like the approach, but I won't formally approve because I didn't review thoroughly.

gui1117 · 2024-10-21T15:05:44Z

Anyway this doesn't have any vulnerability, it just wastes resources, but people should put the CheckWeight last in the pipeline.

I would say it depends. When you are required to put CheckWeight as the latest extension, it also means that you are missing a cheap, early return.

Maybe it is time to split this transaction extension into CheckWeight and RefundWeight.

EDIT: or we can do the RefundWeight in note_applied_extrinsic.

EDIT: or we can use a storage to store the weight refunded by CheckWeight, then StorageWeightReclaim will just take this value instead of trying to guess it incorrectly.

EDIT: I decided to with a new storage ExtrinsicWeight or ExtrinsicWeightRefunded, CheckWeight will register its refund there, then StorageWeightReclaim will undo the CheckWeight operation and do the correct refund.

Later we can introduce another RefundWeight for solo-chains or parachains that doesn't want to use StorageWeightReclaim. RefundWeight can be placed at the end of the pipeline, and CheckWeight is unchanged and not breaking.

…ize-reclaim-more-accurate

…ean'

…tem --clean'" This reverts commit eebb5c7.

…nsions --clean'

gui1117 · 2024-10-31T06:39:43Z

cumulus/parachains/runtimes/assets/asset-hub-westend/src/weights/frame_system_extensions.rs

-//! HOSTNAME: `gleipnir`, CPU: `AMD Ryzen 9 7900X 12-Core Processor`
-//! WASM-EXECUTION: `Compiled`, CHAIN: `Some("asset-hub-westend-dev")`, DB CACHE: 1024
+//! HOSTNAME: `697235d969a1`, CPU: `Intel(R) Xeon(R) CPU @ 2.60GHz`
+//! WASM-EXECUTION: `Compiled`, CHAIN: `None`, DB CACHE: 1024


I used /cmd bench --pallet frame_system_extensions to run this benchmark.
Somehow this doesn't give a chain anymore.
Is there some specific settings where giving a chain would change the result of the benchmarks?

afair it's expected behavior for frame-omni-bencher
right, @ggwpez ?

btw, for now it's still better to use old command bot, as this one we still test/tweak - Here's docs for old one https://command-bot.parity-prod.parity.io/static/docs/latest.html?repo=polkadot-sdk, but maybe you can leave these results if they look fine

Once it's tested we'll notify everyone on forum about new bot

Great to know, also the new /cmd seems to have missed coretime-rococo, coretime-westend, people-rococo, people-westend when I wrote /cmd bench --pallet frame_system_extensions. (I might mistaken)

Hm I think this is since we use the --runtime argument, and it is not extracting the chain spec name. But it should, good find: #6320

gui1117 · 2024-10-31T09:30:38Z

PR should be ready for review, I updated the description

georgepisaltu

Great work! I'm going to approve once the questions in the comments are answered.

georgepisaltu · 2024-11-05T16:15:05Z

substrate/frame/system/src/lib.rs

+				current_weight.accrue(already_reclaimed, info.class);
+				current_weight.reduce(accurate_reclaim, info.class);


I don't think there's any case where we could allow a negative refund, in other words a post dispatch weight greater than what was estimated. If such a case ever exists, we just take the hit and don't refund anything, the code was executed and the damage was done. There are multiple places in the weight checking code that assumes that no weight increases are possible after dispatch and take the route described above.

I'd change the code to something like this:

Suggested change

current_weight.accrue(already_reclaimed, info.class);

current_weight.reduce(accurate_reclaim, info.class);

let to_reclaim = accurate_reclaim.saturating_sub(already_reclaimed);

current_weight.reduce(to_reclaim, info.class);

The saturation will happen if accurate_reclaim is less than already_reclaim.
Indeed I considered better to make the ReclaimWeight authoritative and erasing previous reclaim.
But it sounds better saturate.
I will update.

This is equivalent because few lines before we do let accurate_reclaim = already_reclaimed.max(unspent);

So the saturation will never happen.

No logic change refactor: 472d63c

cumulus/pallets/weight-reclaim/src/lib.rs

georgepisaltu · 2024-11-05T17:27:00Z

cumulus/pallets/weight-reclaim/src/lib.rs

+		// NOTE: `calc_actual_weight` will take the minimum of `post_info` and `info` weights.
+		// This means any underestimation of compute time in the pre dispatch info will not be
+		// taken into account.
+		let benchmarked_weight = post_info_with_inner.calc_actual_weight(info);


I'd find it easier to read if this was benchmarked_actual_weight or something, the current name implies it's from before dispatch.

Yes, I renamed and tried to keep the concept of "actual" 5322297

georgepisaltu · 2024-11-05T17:32:24Z

cumulus/pallets/weight-reclaim/src/lib.rs

+			frame_system::ExtrinsicWeightReclaimed::<T>::put(accurate_unspent);
+		});
+
+		Ok(inner_refund)


Shouldn't we refund the overestimated proof sizes of the inner extensions? I read the function a bunch of times but I don't think we do, the proof size overestimation we'd need to refund would be something like post_info_with_inner.calc_actual_weight(info).proof_size().saturating_sub(measured_proof_size) and we'd need to add this to inner_refund. Am I missing anything?

Indeed I didn't bother returning accurate unspent amount.

If we include the unspent amount of all the dispatch (call, and inner transaction extension), then the unspent we can return is info.total_weight().saturating_sub(accurate_weight).

I will return this and also ensure it is tested

done in 597fde4

Co-authored-by: georgepisaltu <52418509+georgepisaltu@users.noreply.github.com>

gui1117 · 2024-11-06T03:24:10Z

~~I forgot to include https://github.com/paritytech/polkadot-sdk/pull/5281/files~~

gui1117 · 2024-11-06T04:21:36Z

@georgepisaltu concerns should be addressed now, also I missed one change in the old transaction extension.
To keep the accrue in case node proof size is higher than what is registered in block weight: 1ad6e56

@skunert you might be interested by this PR

georgepisaltu

Good to merge from my POV 😉 when the last comment is addressed.

georgepisaltu · 2024-11-06T11:14:03Z

cumulus/pallets/weight-reclaim/src/tests.rs

+
+		assert_ok!(Tx::post_dispatch(pre, &info, &mut post_info, LEN, &Ok(())));
+
+		// TODO TODO: assert_eq!(post_info.actual_weight, Weight::from_parts(0, 650));


Need to fix this and also we probably need to test this with a post dispatch info with a Some(...) as post_info.actual_weight as well as a None, but in practice our CheckedExtrinsic::apply + dispatch_transaction implementation would guarantee that the weight is always Some.

Fixed in 363f7d3

The case where None is given as post_info is also tested.
In such case some unspent is returned. (But ignored later as we can't reduce weight from None.)

cumulus/pallets/weight-reclaim/src/lib.rs

storage weight reclaim

d252ce1

gui1117 mentioned this pull request Oct 19, 2024

WIP: implement StorageWeightReclaim as wrapping transaction extension with weight #5234

Closed

1 task

move prdoc

670add6

gui1117 added T9-cumulus This PR/Issue is related to cumulus. T2-pallets This PR/Issue is related to a particular pallet. labels Oct 19, 2024

gui1117 added 10 commits October 20, 2024 16:33

fmt

d1e480a

use new extension for bridge hub rococo

06903c8

fix macro

9500cd1

remove unnecessary deprecated

9bc4f6c

umbrella

c316f8a

use in template

6e7b072

Merge branch 'master' into gui-storage-proof-size-reclaim-more-accurate

b9f00fc

fix template

7c777db

prdoc semver

f336a34

complete wrapper including bare stuff

f40892f

coordinate refunds with a storage

337d5c8

gui1117 requested a review from a team as a code owner October 24, 2024 08:50

gui1117 added 4 commits October 24, 2024 21:37

Merge remote-tracking branch 'origin/master' into gui-storage-proof-s…

4740554

…ize-reclaim-more-accurate

do not change check weight position

643cfda

fmt

52cc2aa

remove forgotten dbg

f4f5e19

paritytech deleted a comment from github-actions bot Oct 25, 2024

gui1117 added 2 commits October 25, 2024 15:00

fix benchmark

cf2275f

Merge branch 'master' into gui-storage-proof-size-reclaim-more-accurate

7a001e8

paritytech deleted a comment from github-actions bot Oct 25, 2024

gui1117 requested a review from a team as a code owner October 31, 2024 00:34

github-actions bot deleted a comment from gui1117 Oct 31, 2024

actions-user and others added 4 commits October 31, 2024 01:33

Update from gui1117 running command 'bench --pallet frame_system --cl…

eebb5c7

…ean'

Revert "Update from gui1117 running command 'bench --pallet frame_sys…

906f150

…tem --clean'" This reverts commit eebb5c7.

fix test

e5b8bf6

Update from gui1117 running command 'bench --pallet frame_system_exte…

ba1b3bf

…nsions --clean'

gui1117 commented Oct 31, 2024

View reviewed changes

gui1117 added 3 commits October 31, 2024 15:45

remove unused

be6b34e

better trace and some comment

bc625fc

use weight reclaim everywhere

f792821

gui1117 requested a review from a team as a code owner October 31, 2024 07:54

gui1117 added 2 commits October 31, 2024 17:05

fmt

1704d70

add tests

10b0ef0

paritytech deleted a comment from github-actions bot Oct 31, 2024

gui1117 changed the title ~~Implement StorageWeightReclaim as wrapping transaction extension~~ Implement cumulus StorageWeightReclaim as wrapping transaction extension + frame system ReclaimWeight Nov 1, 2024

georgepisaltu reviewed Nov 5, 2024

View reviewed changes

gui1117 and others added 5 commits November 6, 2024 10:45

Update cumulus/pallets/weight-reclaim/src/lib.rs

5e8cae0

Co-authored-by: georgepisaltu <52418509+georgepisaltu@users.noreply.github.com>

Update substrate/frame/system/src/lib.rs

a0a0589

Co-authored-by: georgepisaltu <52418509+georgepisaltu@users.noreply.github.com>

more explicit names

5322297

single operation on weight

472d63c

return more accurate unspent from storage reclaim tx ext

597fde4

gui1117 added 2 commits November 6, 2024 13:17

Keep accrue when node proof size is bigger: #5281

1ad6e56

outdated comment

898caa6

georgepisaltu approved these changes Nov 6, 2024

View reviewed changes

gui1117 commented Nov 6, 2024

View reviewed changes

cumulus/pallets/weight-reclaim/src/lib.rs Outdated Show resolved Hide resolved

fix post info + tests

363f7d3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement cumulus StorageWeightReclaim as wrapping transaction extension + frame system ReclaimWeight #6140

Implement cumulus StorageWeightReclaim as wrapping transaction extension + frame system ReclaimWeight #6140

gui1117 commented Oct 19, 2024 •

edited

Loading

bkchr commented Oct 20, 2024

georgepisaltu commented Oct 21, 2024

gui1117 commented Oct 21, 2024 •

edited

Loading

gui1117 Oct 31, 2024

mordamax Oct 31, 2024 •

edited

Loading

gui1117 Oct 31, 2024

ggwpez Oct 31, 2024

gui1117 commented Oct 31, 2024

georgepisaltu left a comment

georgepisaltu Nov 5, 2024

gui1117 Nov 6, 2024 •

edited

Loading

georgepisaltu Nov 5, 2024

gui1117 Nov 6, 2024

georgepisaltu Nov 5, 2024

gui1117 Nov 6, 2024

gui1117 Nov 6, 2024

gui1117 commented Nov 6, 2024 •

edited

Loading

gui1117 commented Nov 6, 2024

georgepisaltu left a comment

georgepisaltu Nov 6, 2024

gui1117 Nov 6, 2024

		current_weight.accrue(already_reclaimed, info.class);
		current_weight.reduce(accurate_reclaim, info.class);


		assert_ok!(Tx::post_dispatch(pre, &info, &mut post_info, LEN, &Ok(())));

		// TODO TODO: assert_eq!(post_info.actual_weight, Weight::from_parts(0, 650));

Implement cumulus StorageWeightReclaim as wrapping transaction extension + frame system ReclaimWeight #6140

Are you sure you want to change the base?

Implement cumulus StorageWeightReclaim as wrapping transaction extension + frame system ReclaimWeight #6140

Conversation

gui1117 commented Oct 19, 2024 • edited Loading

Issues:

Done:

bkchr commented Oct 20, 2024

georgepisaltu commented Oct 21, 2024

gui1117 commented Oct 21, 2024 • edited Loading

Choose a reason for hiding this comment

mordamax Oct 31, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gui1117 commented Oct 31, 2024

georgepisaltu left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gui1117 Nov 6, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gui1117 commented Nov 6, 2024 • edited Loading

gui1117 commented Nov 6, 2024

georgepisaltu left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gui1117 commented Oct 19, 2024 •

edited

Loading

gui1117 commented Oct 21, 2024 •

edited

Loading

mordamax Oct 31, 2024 •

edited

Loading

gui1117 Nov 6, 2024 •

edited

Loading

gui1117 commented Nov 6, 2024 •

edited

Loading