revive: Limit the amount of static memory a contract can use #5726

athei · 2024-09-16T13:25:17Z

This will make sure that when uploading new code that the declared static memory fits within a defined limit. We apply different limits to code and data. Reason is that code will consume much more memory per byte once decoded during lazy execution.

This PR:

Remove the MaxCodeLen from the Config to we maintain tight control over it.
Defines a single STATIC_MEMORY_BYTES knob that limits the maximum decoded size.
Enforces them only on upload but not on execution so we can raise them later.
Adapt the worst case calculation in integrity_check.
Bumps the max stack depth from 5 to 10 as this will still fit within our memory envelope.
The memory limit per contract is now a cool 1MB that can be spent on data or code.
Bump PolkaVM for good measure
The blob is limited to 256kb which is just a sanity check to not even try parsing very big inputs.

substrate/frame/revive/src/limits.rs

Co-authored-by: Cyrill Leutwiler <cyrill@parity.io>

xermicus

LGTM

koute · 2024-09-17T20:47:19Z

substrate/frame/revive/src/limits.rs

+	pub fn enforce<T: Config>(blob: Vec<u8>) -> Result<CodeVec, DispatchError> {
+		fn round_page(n: u64) -> u64 {
+			debug_assert!(
+				PAGE_SIZE != 0 && (PAGE_SIZE & (PAGE_SIZE - 1)) == 0,


FYI, you have the next_multiple_of method on primitives you can use instead of this helper.

Ahh yes. This function panics on overflow, though. Hence, I forced the inputs to be u32 and do the rounding in u64. When we switch to 64bit we probably just error out when the section sizes don't fit into u32.

koute · 2024-09-17T20:50:19Z

substrate/frame/revive/src/limits.rs

+		// plus the overhead of instructions in memory which is derived from the code
+		// size itself and the number of instruction
+		let memory_size = (blob.len() as u64)
+			.saturating_add(round_page(program.ro_data_size() as u64))


For all of the u32 -> u64 casts you could use u64::from, which I think is nicer as it will give you an error if this cast ever becomes truncating (in PolkaVM I have a clippy lint set up for this).

Alternatively, instead of the saturating_* spam you could use core::num::Saturating wrapper plus a local trait to make it nicer, but this might be a complete overkill.

trait Cast { fn cast(self) -> Saturating<u64>; } impl Cast for u32 { fn cast(self) -> Saturating<u64> { return Saturating(u64::from(self)) } } impl Cast for usize { fn cast(self) -> Saturating<u64> { return Saturating(self as u64) } } let memory_size = blob.len().cast() + program.ro_data_size().cast() - program.ro_data.len().cast() + ...

Yes to the first point. I replaced by into where possible. The code then breaks when we upgrade to 64bit and we need to re-evaluate.

Looking at the saturating_* spam gives me a piece of mind. Seeing raw arithmetic is kinda triggering :)

koute · 2024-09-17T21:22:17Z

substrate/frame/revive/src/wasm/mod.rs

@@ -56,7 +57,7 @@ use sp_runtime::DispatchError;
 #[codec(mel_bound())]
 #[scale_info(skip_type_params(T))]
 pub struct WasmBlob<T: Config> {


Shouldn't the WasmBlob be renamed? (: (Not necessarily in this PR of course.)

Yes there is a lot of renaming to do. I am pushing this of because this will be a nasty PR.

pgherveou · 2024-09-18T12:16:00Z

substrate/frame/revive/src/limits.rs

@@ -58,3 +61,85 @@ pub const STORAGE_KEY_BYTES: u32 = 128;
 ///
 /// The buffer will always be disabled for on-chain execution.
 pub const DEBUG_BUFFER_BYTES: u32 = 2 * 1024 * 1024;
+
+/// The page size in which PolkaVM should allocate memory chunks.
+pub const PAGE_SIZE: u32 = 4 * 1024;


maybe just use VM_MIN_PAGE_SIZE from polkavm-common?

That constant's not for public consumption. (: (In general everything is 'polkavm-common' is private)

Besides, what matters is the page size that's configured, not what the minimum supported is, so this way is how it is actually intended to be used.

pgherveou · 2024-09-18T12:30:25Z

substrate/frame/revive/src/lib.rs

-				.saturating_sub(MAX_STACK_SIZE)
-				.saturating_div(17 * 4);
+				.saturating_sub(STATIC_MEMORY_BYTES)
+				.saturating_div(4);


Suggested change

.saturating_div(4);

.saturating_div(EXTRA_OVERHEAD_PER_CODE_BYTE);

This is to account for memory allocator overhead (see the docs above). But it's a good point: All those numbers should have names.

Replaced by constants

Limit memory consumption

8296a9f

athei mentioned this pull request Sep 16, 2024

Limit the amount of memory a contract can use #5725

Closed

athei requested a review from koute September 16, 2024 13:30

athei added the R0-silent Changes should not be mentioned in any release notes label Sep 16, 2024

athei added 4 commits September 16, 2024 16:23

Add prdoc

a0e1c52

Change formula for static memory limit

d2a6683

Simplify formula

ba2dcbd

fix docs

a97a95d

athei requested review from pgherveou and xermicus September 16, 2024 17:22

Merge branch 'master' into at/limit_mem

f689703

xermicus reviewed Sep 17, 2024

View reviewed changes

substrate/frame/revive/src/limits.rs Outdated Show resolved Hide resolved

substrate/frame/revive/src/limits.rs Outdated Show resolved Hide resolved

xermicus reviewed Sep 17, 2024

View reviewed changes

substrate/frame/revive/src/limits.rs Outdated Show resolved Hide resolved

athei and others added 4 commits September 17, 2024 09:32

Fix typo

4e0faf6

Co-authored-by: Cyrill Leutwiler <cyrill@parity.io>

Use checked Math

a568f74

Move code blob size check into the enforce function

3e1f15e

Merge branch 'master' into at/limit_mem

7093700

athei requested a review from xermicus September 17, 2024 15:54

Import vec

4495c9e

xermicus approved these changes Sep 17, 2024

View reviewed changes

koute approved these changes Sep 17, 2024

View reviewed changes

athei added 2 commits September 18, 2024 08:17

Switch to next_multiple of and remove as 64 where possible

236b358

round_page returns u64

8f6d619

pgherveou reviewed Sep 18, 2024

View reviewed changes

pgherveou approved these changes Sep 18, 2024

View reviewed changes

athei added 2 commits September 18, 2024 16:05

Merge branch 'master' into at/limit_mem

9aa739a

Remove magic numbers

af06631

athei enabled auto-merge September 18, 2024 14:25

athei added this pull request to the merge queue Sep 18, 2024

Merged via the queue into master with commit 310ef5c Sep 18, 2024
206 of 209 checks passed

athei deleted the at/limit_mem branch September 18, 2024 15:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

revive: Limit the amount of static memory a contract can use #5726

revive: Limit the amount of static memory a contract can use #5726

athei commented Sep 16, 2024 •

edited

Loading

xermicus left a comment

koute Sep 17, 2024

athei Sep 18, 2024

koute Sep 17, 2024

athei Sep 18, 2024

koute Sep 17, 2024

athei Sep 18, 2024

pgherveou Sep 18, 2024

koute Sep 18, 2024

pgherveou Sep 18, 2024

athei Sep 18, 2024

athei Sep 18, 2024

	.saturating_div(4);
	.saturating_div(EXTRA_OVERHEAD_PER_CODE_BYTE);

revive: Limit the amount of static memory a contract can use #5726

revive: Limit the amount of static memory a contract can use #5726

Conversation

athei commented Sep 16, 2024 • edited Loading

xermicus left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

athei commented Sep 16, 2024 •

edited

Loading