Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Keep parameter values out IMemoryCache in RelationalCommandCache #34803

Merged
merged 3 commits into from
Oct 14, 2024

Conversation

yinzara
Copy link
Contributor

@yinzara yinzara commented Oct 1, 2024

Only store necessary info for parameter values RelationalCommandCacheKey to prevent memory leaks.

This a relatively serious memory leak defect and needs to be ported back to the release/8.0 branch as well.

Fixes #34028

  • I've read the guidelines for contributing and seen the walkthrough
  • I've posted a comment on an issue with a detailed description of how I am planning to contribute and got approval from a member of the team
  • The code builds and tests pass locally (also verified by our automated build checks)
  • Commit messages follow this format:
        Summary of the changes
        - Detail 1
        - Detail 2

        Fixes #bugnumber
  • Tests for the changes have been added (for bug fixes / features)
  • Code follows the same patterns and style as existing code in this repo

Copy link
Member

@roji roji left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @yinzara, this looks like a good fix - see minor comments below.

@yinzara yinzara force-pushed the bugfix/34028 branch 4 times, most recently from bc4f507 to 201dfe2 Compare October 3, 2024 23:11
@yinzara
Copy link
Contributor Author

yinzara commented Oct 3, 2024

All done now. Sorry about all the force pushes :)

Store only nullness and array lengths in struct form to prevent parameters memory leaks

Fixes dotnet#34028
@yinzara
Copy link
Contributor Author

yinzara commented Oct 4, 2024

All requested modifications have been complete. All checks have passed. Ready for merge when you are.

I know that this defect has technically existed for many prior versions of .NET.

Do we still port defects to the release/6.0 branch or is that branch now completely unmaintained?

Can I help in doing that or is this something only you guys do?

@roji
Copy link
Member

roji commented Oct 7, 2024

@yinzara the enumeration of the dictionary - to create the array from it - isn't guaranteed to have the same ordering every time (dictionary ordering is indeterminate), so the arrays could have ended up having the values in different positions.

I pushed a commit that goes back to using a simple dictionary for now, and cleans up various remaining things - please take a look and tell me if you see any issues. Regarding efficiency, this whole mechanism needs to be optimized: the parameter nullness should be a single efficient bitmap, along with an additional thing for the FromSql object arrays. But for now we're looking for a simple fix - one that could potentially be backported too - so I'm trying to minimize change and risk.

@roji
Copy link
Member

roji commented Oct 7, 2024

Do we still port defects to the release/6.0 branch or is that branch now completely unmaintained?
Can I help in doing that or is this something only you guys do?

6.0 is still in support, though only until November (support policy), and the bar is quite high. We'll discuss in the team if we want to backport this and to which exact versions; in any case, that's something we'll do on our side - thanks for the offer to help though.

@roji
Copy link
Member

roji commented Oct 7, 2024

@dotnet/efteam would appreciate a review here as well, as it's a possible servicing candidate and needs to be accurate...

@yinzara
Copy link
Contributor Author

yinzara commented Oct 7, 2024 via email

@roji
Copy link
Member

roji commented Oct 7, 2024

@yinzara my bad - sorry for the extra work. Yeah, I thought about doing sorting but decided to do the simplest/safest thing here for now (again we can optimize later).

@yinzara
Copy link
Contributor Author

yinzara commented Oct 7, 2024

If you still want it:

private readonly struct CommandCacheKey
        : IEquatable<CommandCacheKey>
    {
        private readonly Expression _queryExpression;
        private readonly (string, ParameterValueInfo)[] _parameterValues;

        internal CommandCacheKey(Expression queryExpression, IReadOnlyDictionary<string, object?> parameterValues) {
            _queryExpression = queryExpression;
            _parameterValues = new (string, ParameterValueInfo)[parameterValues.Count];
            var i = 0;
            foreach (var (key, value) in parameterValues)
            {
                _parameterValues[i++] = (key, new ParameterValueInfo(value));
            }
            Array.Sort(_parameterValues);
        }

        public override bool Equals(object? obj)
            => obj is CommandCacheKey commandCacheKey
                && Equals(commandCacheKey);

        public bool Equals(CommandCacheKey commandCacheKey)
        {
            // Intentionally reference equal, don't check internal components
            if (!ReferenceEquals(_queryExpression, commandCacheKey._queryExpression))
            {
                return false;
            }

            Check.DebugAssert(
                _parameterValues.Length == commandCacheKey._parameterValues.Length,
                "Parameter Count mismatch between identical expressions");

            for (var i = 0; i < _parameterValues.Length; i++)
            {
                var (thisKey, thisValue) = _parameterValues[i];
                var (otherKey, otherValue) = commandCacheKey._parameterValues[i];

                Check.DebugAssert(
                    thisKey == otherKey,
                    "Parameter Name mismatch between identical expressions");

                if (thisValue.IsNull != otherValue.IsNull)
                {
                    return false;
                }

                if (thisValue.ObjectArrayLength != otherValue.ObjectArrayLength)
                {
                    return false;
                }
            }

            return true;
        }

        public override int GetHashCode()
            => RuntimeHelpers.GetHashCode(_queryExpression);
    }

    // Note that we keep only the nullness of parameters (and array length for FromSql object arrays), and avoid referencing the actual parameter data (see #34028).
    private readonly struct ParameterValueInfo
    {
        public bool IsNull { get; }

        public int? ObjectArrayLength { get; }

        internal ParameterValueInfo(object? parameterValue)
        {
            IsNull = parameterValue == null;
            ObjectArrayLength = parameterValue is object[] arr ? arr.Length : null;
        }
    }

@roji
Copy link
Member

roji commented Oct 7, 2024

Not quite sure if doing Array.Sort() on an array of tuples is right... In any case, I'd rather leave things closer to the current implementation for now. The null-ness information really should just be a bitmap in any case.

@yinzara
Copy link
Contributor Author

yinzara commented Oct 7, 2024

I promise this is my last rebuttal :-) and feel free to not read it and just do what you want as I completely agree this is not my project. I won't respond again.

Since C# 7, ValueTuple(s) have implemented IComparable and if you read the code always compare each element using the default Comparer. Since we know that an IReadOnlyDictionary can only contain unique keys by definition, the first element will always be the only one necessary for sorting so we can know that the order of the comparison of keys will be consistent and predictable.

The only difference in the algorithm in my solution from the existing solution is the use of the DebugAssert which could throw an exception (if debug is enabled) or potentially return true if the lengths of the keys (or their names) don't match where in your solution and the original, it would return false from the Equality test (never throwing an exception if debug is enabled). My original proposed solution (now lost in force pushes) is exactly as above except I returned false instead of true by always testing the keys and lengths (without DebugAssert) and then using DebugFail, keeping with the original behavior. However, it was suggested I change it to a DebugAssert instead. I guess that could be a reason to revisit this later after a backport merge, but it's hard to say that's enough to justify it. I'm happy to restore that commit.

While I completely understand the goal is to minimize change for ease of portability, the entirety of the CommandCacheKey (and ParameterValueInfo) must be ported as a whole back as the entirety of the class has changed save a few choice lines and if you follow the commit history of this file back to 6.0 release, the entire file was reformatted so either solution should both be equally portable. Objectively your solution is less lines of code changed, I will completely agree. I just don't think it's more/less complex to migrate to older versions. I would hope the preferred outcome would be if we could avoid having to make any future changes for this issue while still having an ideal solution given the goals below.

The goals of the solution are in order of importance:

  1. To be the most efficient computationally as this code is central to everything EF does
  2. To use the least amount of memory (for the same reason)
  3. Be the least complex code-wise for ease of portability and long term maintenance

Assumptions

  • Object allocations are particularly computationally heavy and should be avoided if not necessary
  • Storing a value in a bitmap would require a byte array property which is both an object allocation and saves 3 bytes for every 4 elements in the bitmap
  • Storing the length of the FromSql object arrays would require at minimum 1 object allocation ( int[]) in the best case
  • To store the values as arrays and not dictionaries, we must somehow order our key iteration in a consistent manner for the keys provided in the dictionary (so probably alphabetically)

Current Proposed solution

There is only one property, a Dictionary.

The dictionary has two arrays in it - int[] ( size 4xN bytes) + Entry[] ( size 8*N + 16 ( reference to string key) + (1 [nullness bool] + 5 [nullable int, length of array]) * N + 4 int properties (4x4) and a ulong property (8) = 13N + 40 bytes )

2 object allocations approx 72+18N bytes

Construction of a CommandCacheKey is an Order N (to iterate over the keys) * N Log N (to insert in dict, though might be more or less, hard to say).

Equality test is an Order N (to iterate over the keys) * Log N (to lookup in the dict) operation.

Potentially Most Efficient Solution using BitMaps

We would have a byte[] for the nullness bitmap and a int?[] for the length of the arrays.

That's 2 object allocations (16 bytes * 2) plus N/4 (byte array) plus 5*N (int? array)

32+N/4+5*N bytes - basically half of your proposed solution

In this solution we would need to iterate over the keys of the IReadOnlyByteArray during the construction of the CommandCacheKey in alphabetical order. This will require at least one object allocation and some memory (though it will be reclaimed after construction). That operation would be at least an order N operation potentially NLog N I think in the worse case. It would then be another 2O(N) operation to fill the bitmap and int array. The code to do so will also be relatively complex compared to your or my solution.

So construction of CommandCache Key would be Order N*Log N + 2N with 3 object allocations

Equality tests would be 2*O(N)

My Proposed Solution

1 object allocation (the tuple array)

That's 1 object allocation (16 bytes) plus N (bool?) + 5N (int?)

So my solution is 3 bytes more for every 4 elements in the key so 3/4 * N bytes more. For the default configuration of the IMemoryCache of 10000 with a load factor of 10L for these entries means that my solution will be approximately 1k different in total memory consumption across the entire application. (I would consider that to be "negligible" IMHO)

Construction of a CommandCacheKey would be an N (iterate over the keys) + N (best case) Log N operation (worse case)

Equality tests would be 3O(N) or 2O(N), though the difference is mostly irrelevant.

Conclusions

  1. Construction costs between all solutions are pretty much the same
  2. My solution has only 1 object allocation while the bitmap has 3 and the current proposed has 2
  3. Memory costs of my solution is 1K worse than the bitmap and less than half of the current proposed solution
  4. Equality tests of bitmap and my solution are comparable while the current proposed solution is more computationally heavy
  5. Complexity of code changes is highest in the bitmap solution and lowest in the current proposed solution

So the only real difference between my solution and the bitmap solution is less than 1k of memory for the entire application in exchange for 2 unnecessary object allocations. As an engineer, I would definitely give up the 1k for the avoidance of the object allocations.

That means I would say the bitmap solutions isn't actually ideal at all and my solution is the most ideal. It has nearly the lowest memory footprint, the lowest computational overhead, and is still of a low code complexity.

@roji
Copy link
Member

roji commented Oct 12, 2024

@yinzara thanks for engaging - that's quite a lot of thinking/analysis on this topic...

While I completely understand the goal is to minimize change for ease of portability, the entirety of the CommandCacheKey (and ParameterValueInfo) must be ported as a whole back as the entirety of the class has changed save a few choice lines and if you follow the commit history of this file back to 6.0 release, the entire file was reformatted so either solution should both be equally portable. Objectively your solution is less lines of code changed, I will completely agree. I just don't think it's more/less complex to migrate to older versions. I would hope the preferred outcome would be if we could avoid having to make any future changes for this issue while still having an ideal solution given the goals below.

Well, formatting changes aren't a big deal here; and in any case, It's not so much about the difficulty of backporting itself, as the size of the change we're backporting. Your proposal simply changes the code more significantly (switching away from a dictionary to a sorted array), so it's a bit riskier.

I won't go into all the specific size/length calculations, as I think you've made some assumptions here and overlooked some optimizations... If we really want to optimize this, we would use BitVector32 when the number of parameters is <= 32 (virtually all cases) - note that it's a value type, zero allocations - and fall back to BitArray for more than 32 parameters. The array lengths would be nullable, so that queries without an object[] parameter (again, that's the vast majority of queries) wouldn't have anything there. In other words, the very common case (<= 32 parameters, no object[]) would have zero heap allocations.

Further, from a CPU perspective, in the common case we'd be comparing BitVector32, which is equivalent to comparing two uints (as efficient as one can get). Even for the other cases, when comparing simple arrays, looping over an array and performing two equalities as you're doing above is unlikely to be efficient; doing a single memcmp on a memory block is likely to be far better. This can be done in .NET via Span.SequenceEquals - see https://stackoverflow.com/questions/43289/comparing-two-byte-arrays-in-net/48599119#48599119.

To summarize, it's really worth distinguishing between two things: (1) fixing the current bug (and backporting it), and (2) optimizing the cache key. Trying to do both at the same time is riskier, and we really try to prioritize the stability of existing versions, so we're very conservative with the changes we patch. The solution you're proposing above doesn't optimize things as much as it could, and at the same time represents more of a departure from the current implementation than is necessary (and therefore more risk).

I'll keep this open in case you have any additional comments (those are always welcome!), and also because we need to decide whether we'll be backporting this (I'll discuss this with the team tomorrow). Of course, it's fine for you (or anyone) to have a different opinion on the above! In fact, I'd be happy to accept a PR from you later which does the additional optimization pass

@yinzara
Copy link
Contributor Author

yinzara commented Oct 12, 2024

I very much appreciate the thoughtful reply. Makes me want to continue to contribute.

@roji roji dismissed their stale review October 14, 2024 20:41

No longer relevant

@roji roji merged commit af420cd into dotnet:main Oct 14, 2024
7 checks passed
roji pushed a commit to roji/efcore that referenced this pull request Oct 14, 2024
…net#34803)

Store only nullness and array lengths in struct form to prevent parameters memory leaks

Fixes dotnet#34028

Co-authored-by: Shay Rojansky <roji@roji.org>
(cherry picked from commit af420cd)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

User parameter data is referenced inside RelationalCommandCache
4 participants