-
Notifications
You must be signed in to change notification settings - Fork 4.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Consider clearing JsonSerializerOptions.Caching cache using a timer, not just based on incoming calls #76548
Comments
Tagging subscribers to this area: @dotnet/area-system-text-json, @gregsdennis Issue DetailsFirst, it's totally possible I'm misinterpreting what STJ does here. We got this report saying that a Blazor Server app was holding hundreds of MB in memory for longer than expected, and using the VS memory inspection tools, it looks like at least one (possibly the only) thing pinning this data is rooted in It looks like the only thing that clears dangling entries is subsequent calls into the JSON serializer. This is likely sufficient in server scenarios where the server will never stop processing requests for any long duration, and even if it does, people probably don't care about freeing any memory until the server experiences memory pressure. However, for desktop scenarios, an app may stop doing things for a long time, and during that period, people really want it to release memory that's no longer required. SuggestionInstead of triggering This isn't blocking anything in Blazor directly, but would be a good solution for people using STJ directly in desktop apps, or Blazor Server as an implementation detail of a desktop app (like this person).
|
Also, if this caching behavior was introduced in 7.0, then it could be argued that it's a regression. (Subjective, I know.) |
Is the actual problem the shortcut described in https://github.com/dotnet/runtime/blob/main/src/libraries/System.Text.Json/src/System/Text/Json/Serialization/JsonSerializerOptions.Caching.cs#L165-L167? Should this code rather use ConditionalWeakTable to avoid unnecessarily keeping the keys around? |
Thanks @SteveSandersonMS, your analysis is accurate. At the time when the cache was implemented, we debated as to whether there should be a timer that ran evictions in the background but ultimately found that approach to be unsavory. In hindsight, I had not anticipated that user configuration in That being said, it is almost certain that the application in question is creating single-use
I think we could definitely improve the cache based on this feedback but I don't necessarily think this requires a .NET 7 fix. I think we should wait on more user feedback in case it arrives.
Original prototypes did use ConditionalWeakTable, effectively performing a linear search over all runtime/src/libraries/System.Text.Json/src/System/Text/Json/Serialization/JsonSerializerOptions.cs Lines 133 to 141 in f96723a
But IIRC enumeration performance for the particular type was bad enough that it almost invalidated any performance benefits gained from caching. |
I think the original author was using an unmodified (except for the ability to manually trigger GC) Blazor template to demonstrate the problem. |
Hey @eiriktsarpalis Just in case you couldn't read the complete original issue (I'm the author) and just for clarification, the application in question which you are referring to is the Blazor framework itself using the default Blazor Server template. Regards! |
Regarding what @SteveSandersonMS mentioned about a timer, you can configure the retention period of circuits for Blazor, for example:
So maybe this should be tied somehow with the serializer cache clearing call, I guess it wouldn't make much sense to configure different amounts of time for each since one depends on the other to be useful. Regards! |
Hi @chrdlx, that's interesting. If there are indeed keys in the cache that require eviction that would suggest to me that Blazor is using temporary options instances, which should be avoided under most circumstances. I'm not too familiar with Blazor internals however, is there anything in its configuration that could prompt single-use instances to be created? Perhaps some options instance only used for initialized purposes that then gets garbage collected? cc @SteveSandersonMS |
Hi @eiriktsarpalis ! I guess Steve is the right one for this kind of stuff. I use Blazor to develop Apps but sadly I don't have that deep knowledge about its inner workings to give you a proper answer on how to solve this. Regards! |
For context, here are some benchmarks comparing the current caching strategy with one performing linear traversal on a ConditionalWeakTable: public static CachingContext GetOrCreate(JsonSerializerOptions options)
{
Debug.Assert(options.IsReadOnly, "Cannot create caching contexts for mutable JsonSerializerOptions instances");
Debug.Assert(options._typeInfoResolver != null);
OptionsEqualityComparer comparer = s_equalityComparer;
foreach (KeyValuePair<JsonSerializerOptions, object?> entry in TrackedOptionsInstances.All)
{
JsonSerializerOptions otherOptions = entry.Key;
if (otherOptions._cachingContext != null && comparer.Equals(options, otherOptions))
{
return otherOptions._cachingContext;
}
}
return new CachingContext(options);
}
|
@eiriktsarpalis Blazor has a I don't think we were aware that, at a certain point, it became expensive to create If you have a recommendation for a different way that a JsonConverter can access contextual objects specific to a particular seralization process, please let us know. |
Stephen Toub's 7.0 performance behemoth had some stuff about this: https://devblogs.microsoft.com/dotnet/performance_improvements_in_net_7/#json It wouldn't have been intuitive to me that an "Options" class would be massively expensive. Obviously that article contains the somewhat optimistic promise "... with appropriate removal when no longer used in order to avoid unbounded leaks" which overlooks the old saw about cache eviction being the hardest problem in computer science. |
It's been like this since the introduction of the type -- it's essentially the same issue as |
What do you recommend as a solution? It does seem like the 7.0 caching technique change has led to an issue for non-server scenarios. Is there (or should there be) some way to signal that a particular To be honest, I don't think we'd see this as being urgent enough to fix for 7.0 within Blazor since we only have evidence that it affects one person, and they already have a workaround. What we could do in Blazor in 8.0 though is add some logic to our circuit disposal code that makes it null out the reference from the JsonConverterFactory to the circuit, so the circuits at least can be collected even if there are some JsonSerializerOptions/JsonConverterFactory instances left stuck in the cache until more calls happen. Since this could affect other (non-Blazor) desktop applications that use STJ, it might be worth having a more general solution such as making JsonSerializerOptions disposable, or clearing the cache on a timer, or going back to ConditionalWeakTable. If you think any of those are a possibility please let us know so we could make use of that in 8.0. |
I'm curious, what prompted requiring separate I know that System.Text.Json doesn't provide a good mechanism for scoping DI on a per-operation basis, here's a user story that attempts to cover this and related requirements: #63795. |
This is what I mentioned above:
Apologies if it's still unclear, but if you can clarify which part of this is unclear I'll try to clarify more!
That would be great, but don't know of a way 😄 It's what I was asking with "If you have a recommendation for a different way that a JsonConverter can access contextual objects specific to a particular seralization process, please let us know." #63795 sounds interesting but async conversion isn't an aspect of it for us; we just have to be able to perform a JsonSerializer.Serialize call within some context that allows the converters to access objects within that context (which we achieve today by having a separate JsonConverterFactory for each context). |
For the case of serialization, state could be encapsulated by the serialized value. I'm not aware of a good way to achieve the same effect in deserialization, unfortunately. |
It does avoid unbounded leaks... unbounded being the key word. |
Hi. I guess this behaviour cannot be avoided in 7.0 release? |
We might consider backporting #76607 in servicing, assuming there is substantial impact in blazor apps. |
First, it's totally possible I'm misinterpreting what STJ does here. We got this report saying that a Blazor Server app was holding hundreds of MB in memory for longer than expected, and using the VS memory inspection tools, it looks like at least one (possibly the only) thing pinning this data is rooted in
JsonSerializerOptions
'ss_cache
. The cache usesWeakReference
for values, but the problem is with the keys.It looks like the only thing that clears dangling entries is subsequent calls into the JSON serializer. This is likely sufficient in server scenarios where the server will never stop processing requests for any long duration, and even if it does, people probably don't care about freeing any memory until the server experiences memory pressure. However, for desktop scenarios, an app may stop doing things for a long time, and during that period, people really want it to release memory that's no longer required.
Suggestion
Instead of triggering
TryEvictDanglingEntries
only when there's a call toGetCachingContext
, consider also having some guarantee like "it will always clear dangling entries within 10s even if there are no such incoming calls" or similar.This isn't blocking anything in Blazor directly, but would be a good solution for people using STJ directly in desktop apps, or Blazor Server as an implementation detail of a desktop app (like this person).
The text was updated successfully, but these errors were encountered: