bug: fix for memory issue #1066

segfault-magnet · 2023-07-27T12:55:58Z

closes: #1058
codeveloped with @Salka1988
Special tnx to @nedsalk for brainstorming with us.

The issue from the example

There was a memory problem in the example but not for the reasons listed in the issue. The issue (quoting the audit) says:

The reason for this issue is that the decoder supports zero-sized types which do not result in progress during parsing. Therefore, the parser might run for a long time and eventually run out of memory.

Units in Sway are encoded as 1WORD of zeroes. So the given example didn't experience the zero-sized-type issue. What it did experience is a caveat of how we decoded multiple tokens of the same type (by cloning the param type). This is now replaced with a std::iter::repeat which will copy (as needed) the reference to a single ParamType solving the memory issue (by only having 1 reference to the param type at any moment).

Real zero-sized types

A real zero-sized type would be an empty struct or an empty string array (i.e. str[0]).

Enums don't come into play since empty enums cannot be instantiated. Also, an Enum with a ZST variant would have been at least 1 WORD since it encodes the discriminant, making it not a ZST.

You also cannot define an empty tuple (as far as we're aware).

Empty structs could also eventually be used as markers (e.g. Tcp<Connected> where struct Connected{}).

The solution

The decoder now can be configured to accept two upper limits: one for the maximum tolerable depth, the second for the maximum tokens allowed.

The depth limit protects our stack (think call-frames and recursions) against types such as these:

struct Parent {
   child1: Child1 { 
      child2: Child2 {
          .... 
            child3: Child3 { ...
                  child7812341: Child7812341
      }
}

The whole structure would be zero-sized and contain so many levels of nested zero-size structs that we might run out of stack trying to decode it.

The second limit (max_tokens) protects against:

collections of ZSTs
structures with endless fields of ZSTs
basically any other combination we could conceive of.

We've set the defaults to 45 (levels of nesting) and 10k max tokens.

The user can tweak the decoder used for contract and script calls. Tests have been added and the docs updated.

Checklist

I have linked to any relevant issues.
I have updated the documentation.
I have added tests that prove my fix is effective or that my feature works.
I have added necessary labels.
I have done my best to ensure that my PR adheres to the Fuel Labs Code Review Standards.
I have requested a review from the relevant team or maintainers.

digorithm

Alright, so correct if I'm wrong: this is solving the memory issue only for collections and not the other cases you mentioned in the description related to other ZSTs, right? If that's the case, the title should mention it, and the memory issue is partially solved (maybe create a separate issue for the follow-up work).

As for handling the other ZSTs... banning them would probably not be a good approach. I'd rule this one out.

Defining a timeout mechanism that's mostly implicit and only explicit when the user needs it (e.g., they just experienced the OOM issue and need to act on it) sounds like a good idea.

Now... onto defining the limits, yeah... token based seems weird. Not sure. Time-based seems like a safe bet. I might be off here, but this type of decoding seems like it's either very fast (e.g., by and large, less than a handful of seconds most of the time) or something ill-defined or malicious that's straight-up undesirable that would take more than, say 30 seconds. This could cover the vast majority of cases.

Then, in case of slower hardware struggling to decode reasonable and valid data for whatever reason, this user will see a timeout error, and the error should direct the user to increase the configurable timeout limit. What do y'all think?

segfault-magnet · 2023-07-27T17:13:02Z

Alright, so correct if I'm wrong: this is solving the memory issue only for collections and not the other cases you mentioned in the description related to other ZSTs, right?

Right.

If that's the case, the title should mention it, and the memory issue is partially solved (maybe create a separate issue for the follow-up work).

I'll change the title. Will create an issue as soon as we agree on further steps.

Defining a timeout mechanism that's mostly implicit and only explicit when the user needs it (e.g., they just experienced the OOM issue and need to act on it) sounds like a good idea.

Now... onto defining the limits, yeah... token based seems weird. Not sure. Time-based seems like a safe bet. I might be off here, but this type of decoding seems like it's either very fast (e.g., by and large, less than a handful of seconds most of the time) or something ill-defined or malicious that's straight-up undesirable that would take more than, say 30 seconds. This could cover the vast majority of cases.

Didn't play around with it but my thinking was that, due to our recursive approach to decoding, every nesting level will incur a call frame penalty to the stack along with any other data we may leave on the stack. Some of the recursions might be tail-optimized, but I'm not sure all are.

If the attack was in the form of a struct with a lot of fields, that isn't a problem since we'd be freeing and reserving the stack during the decoding.

But if the attack was in the form of an enormously nested struct, then for each nesting level we'd go deeper in the recursion. This might cause us to hit the stack limit of the thread.

So limiting decoding by elapsed time might prove finicky, how fast will the recursion burrow, i.e. how loaded is the CPU at the moment? The user might be happy with a limit in normal times, only to cause him an OOM if the attack happened while his CPU load was down.

Not sure that untangling the recursion would help by that much either since we need to keep state for every level.

I'll play around, try and recreate what this attack might look like, and get some numbers.

Ideally, we'd limit by time and memory used, but I'm not sure how practical this is to implement.

digorithm · 2023-07-27T17:16:39Z

Yeah, it sounds like we gotta do a spike to iron out these details.

…fuels-rs into feat/decoding_zero_sized_types

Dentosal · 2023-08-03T14:29:44Z

The whole structure would be zero-sized and contain so many levels of nested zero-size structs that we might run out of stack trying to decode it.

Typically you wouldn't run out of stack space, since the parsed structure would be placed into the heap. Then you could have a memory size limit, similar to what e.g. regex crate does.

segfault-magnet · 2023-08-03T14:37:47Z

The whole structure would be zero-sized and contain so many levels of nested zero-size structs that we might run out of stack trying to decode it.

Typically you wouldn't run out of stack space, since the parsed structure would be placed into the heap. Then you could have a memory size limit, similar to what e.g. regex crate does.

The decoding process is recursive, meant it in the sense that we'd recurse until there is no more stack left. I'm not sure all of our decoding recursions can be tail-call optimized.

I made the PR a draft, my current idea is not to go for a time check but rather let the user configure optionally additional security measures for analyzing a type before even attempting the decoding.

For example: "the type mustn't be deeper than N" or "structs/enums can have at max N fields/variants" and such.

That along with the ZST collections should prove enough configurability so that users might decode from untrusted schemas.

P.S. Tnx for the regex link, I'll look at its impl more closely.

…fuels-rs into feat/decoding_zero_sized_types

…sized_types

packages/fuels/tests/logs/script_needs_custom_decoder_logging/src/main.sw

iqdecay

LGTM!

examples/codec/src/lib.rs

packages/fuels-core/src/codec.rs

digorithm

I love the new encoding/decoding session; we badly needed that. I just left a couple of grammar nits to improve flow/clarity.

I'm still going through the actual implementation, but I'd like to send these suggestions now rather than later.

docs/src/codec/decoding.md

docs/src/codec/encoding.md

docs/src/codec/index.md

Co-authored-by: Rodrigo Araújo <rod.dearaujo@gmail.com>

…sized_types

digorithm

Thanks for the patience; this was a big one. The new tests helped me a lot to understand these changes. Great job, dude!

- This PR closes #1228 by adding the `EncoderConfig` similarly to what was done in #1066. BREAKING CHANGE: - `Configurables` structs now need to be instantiated through a `::new(encoder_config)` or `::default()` method. - `Configurables::with_some_string_config(some_string)` methods now return a `Result<Configurables>` instead of `Configurables`. - `Predicates::encode_data` now returns a `Result<UnresolvedBytes>` instead of `UnresolvedBytes`. - `PredicateEncoder` structs must be instantiated through a `::new(encoder_config)` or `::default()` method. --------- Co-authored-by: MujkicA <32431923+MujkicA@users.noreply.github.com> Co-authored-by: Rodrigo Araújo <rod.dearaujo@gmail.com> Co-authored-by: hal3e <git@hal3e.io> Co-authored-by: Ahmed Sagdati <37515857+segfault-magnet@users.noreply.github.com>

segfault-magnet added 3 commits July 27, 2023 11:32

fix memory issue

4864bfa

forbid decoding of zero sized type collections

ce39510

add docs

0d0cde2

segfault-magnet added bug Something isn't working enhancement New feature or request labels Jul 27, 2023

segfault-magnet requested a review from Dentosal July 27, 2023 12:55

segfault-magnet requested a review from digorithm as a code owner July 27, 2023 12:55

segfault-magnet self-assigned this Jul 27, 2023

segfault-magnet requested review from iqdecay, hal3e, MujkicA, Salka1988 and Br1ght0ne as code owners July 27, 2023 12:55

Salka1988 previously approved these changes Jul 27, 2023

View reviewed changes

digorithm added the security-audit label Jul 27, 2023

digorithm previously approved these changes Jul 27, 2023

View reviewed changes

segfault-magnet changed the title ~~bug: fix memory issue~~ bug: partial fix for memory issue Jul 27, 2023

Merge branch 'master' into feat/decoding_zero_sized_types

0dd05dc

segfault-magnet marked this pull request as draft August 2, 2023 17:49

segfault-magnet and others added 2 commits August 2, 2023 19:59

Merge branch 'feat/decoding_zero_sized_types' of github.com:FuelLabs/…

68e0b55

…fuels-rs into feat/decoding_zero_sized_types

Merge branch 'master' into feat/decoding_zero_sized_types

7b233cc

Merge branch 'feat/decoding_zero_sized_types' of github.com:FuelLabs/…

53dac43

…fuels-rs into feat/decoding_zero_sized_types

segfault-magnet dismissed stale reviews from Salka1988 and digorithm via 53dac43 August 4, 2023 17:24

segfault-magnet added 2 commits August 8, 2023 22:12

Merge branch 'master' into feat/decoding_zero_sized_types

c9243d5

add depth limit to structs, enums, arrays, and tuples.

fed2a0b

Merge remote-tracking branch 'origin/master' into feat/decoding_zero_…

db45b86

…sized_types

Br1ght0ne previously approved these changes Sep 6, 2023

View reviewed changes

MujkicA reviewed Sep 6, 2023

View reviewed changes

packages/fuels/tests/logs/script_needs_custom_decoder_logging/src/main.sw Show resolved Hide resolved

Merge branch 'master' into feat/decoding_zero_sized_types

63ac8b9

MujkicA previously approved these changes Sep 6, 2023

View reviewed changes

iqdecay previously approved these changes Sep 6, 2023

View reviewed changes

examples/codec/src/lib.rs Show resolved Hide resolved

packages/fuels-core/src/codec.rs Show resolved Hide resolved

digorithm requested changes Sep 6, 2023

View reviewed changes

segfault-magnet and others added 2 commits September 11, 2023 08:29

Merge branch 'master' into feat/decoding_zero_sized_types

5aa5068

add todo

9df8b79

segfault-magnet dismissed stale reviews from hal3e, iqdecay, MujkicA, Br1ght0ne, and Dentosal via 9df8b79 September 11, 2023 06:44

Apply suggestions from code review

ba38f8c

Co-authored-by: Rodrigo Araújo <rod.dearaujo@gmail.com>

segfault-magnet mentioned this pull request Sep 11, 2023

Remove unneeded sway project #1126

Open

segfault-magnet requested review from digorithm, Br1ght0ne, MujkicA and iqdecay September 11, 2023 07:33

segfault-magnet added 2 commits September 11, 2023 09:35

link is down, use from archive

a88f661

Merge remote-tracking branch 'origin/master' into feat/decoding_zero_…

4c3e8fb

…sized_types

iqdecay approved these changes Sep 12, 2023

View reviewed changes

MujkicA approved these changes Sep 12, 2023

View reviewed changes

digorithm approved these changes Sep 12, 2023

View reviewed changes

segfault-magnet merged commit c145802 into master Sep 12, 2023

segfault-magnet deleted the feat/decoding_zero_sized_types branch September 12, 2023 20:36

digorithm mentioned this pull request Oct 3, 2023

TOB-FUEL-13: Specification about ABI encoding is outdated #1107

Closed

iqdecay mentioned this pull request Jan 14, 2024

fix!: encoding capacity overflow #1249

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bug: fix for memory issue #1066

bug: fix for memory issue #1066

segfault-magnet commented Jul 27, 2023 •

edited

Loading

digorithm left a comment

segfault-magnet commented Jul 27, 2023

digorithm commented Jul 27, 2023

Dentosal commented Aug 3, 2023

segfault-magnet commented Aug 3, 2023 •

edited

Loading

iqdecay left a comment

digorithm left a comment

digorithm left a comment

bug: fix for memory issue #1066

bug: fix for memory issue #1066

Conversation

segfault-magnet commented Jul 27, 2023 • edited Loading

The issue from the example

Real zero-sized types

The solution

Checklist

digorithm left a comment

Choose a reason for hiding this comment

segfault-magnet commented Jul 27, 2023

digorithm commented Jul 27, 2023

Dentosal commented Aug 3, 2023

segfault-magnet commented Aug 3, 2023 • edited Loading

iqdecay left a comment

Choose a reason for hiding this comment

digorithm left a comment

Choose a reason for hiding this comment

digorithm left a comment

Choose a reason for hiding this comment

segfault-magnet commented Jul 27, 2023 •

edited

Loading

segfault-magnet commented Aug 3, 2023 •

edited

Loading