Don't cache canonicalize calls when `containingUrl` is available #2215

nex3 · 2024-04-09T23:13:32Z

nex3 · 2024-04-09T23:13:55Z

ntkme · 2024-04-09T23:51:04Z

lib/src/async_import_cache.dart

-        if (await _canonicalize(importer, url, baseUrl, forImport)
-            case var result?) {
+    var key = (url, forImport: forImport);
+    if (_canonicalizeCache.containsKey(key)) return _canonicalizeCache[key];


I can think of an edge case with two importers [importerA, importerB]:

Resolve an import x, with containing url u:y. importerA accessed containingUrl, but could not resolve it, then importerB resolved it without using containingUrl thus get cached.

Resolve an import x, with containing url file://a.scss. importerA would have been able to resolve it because now this has a different containingUrl that importerA can handle, however, because of the previous cache from importerB, importerA would not even get attempted.

I think this would be unexpected, because importerA is before importerB.

In this case, _canonicalize() would return (null, false) for importerA because containingUrl is passed, so cacheable will be set to false on line 188 and importerB's result won't be cached even if it would be cacheable in isolation (since that case is guarded by when cacheable).

Sorry, I forgot this is the first iteration that cacheable is based on whether it's passed or not, instead of whether it's used or not.

The logic should work the same if it just checks access—we'd still return (null, true) in the case where importerA accessed containingUrl but couldn't resolve the import.

I think this works correctly.

If I read it correctly, any non-cacheable importer will make all importers after it non-cacheable for the current import. And therefore if a FilesystemImporter is low priority it would not be cached and cost can be high? If that’s the case it might still be worth to have per importer cache key instead of single cache key for all importers.

If we have [nonCacheableImporter, loadPathImporter], with the implementation in this PR the loadPathImporter would not be cached, and repeatedly loading the same file from load path can get slower due to repeated I/O calls.

Thus I think it's worth to make the trade-off to use a little bit more memory to cache per importer (and even cache failed canonicalization if cacheable), and a bit more CPU to re-check cache per importer, like the pseudo code below:

for (var importer in importers) { var key = (url, forImport: forImport, importer: importer); if (_canonicalizeCache.containsKey(key)) { var cached = _canonicalizeCache[key]; if (cached != null) return cached; } else { var (result, cacheable) = await _canonicalize(importer, url, baseUrl, forImport); if (cacheable) { _canonicalizeCache[key] = result; } if (result != null) return result; } } return null;

The cost of CPU/memory overhead would likely be lower than the I/O overhead.

I don't think it's crazy to have a per-importer canonicalize cache in addition to the current whole-load-path cache, although I think we might want to be a bit more sophisticated about only filling it if we run into an uncacheable load in practice. Either way, let's save that for a follow-up after we fix the initial bug.

Goodwine · 2024-04-11T18:36:48Z

lib/src/async_import_cache.dart

+    if (_canonicalizeCache.containsKey(key)) return _canonicalizeCache[key];
+
+    var cacheable = true;
+    for (var importer in _importers) {


Nit: I was able to follow the caching logic, but it wasn't very obvious at the beginning. It would be great if the caching strategy was documented, but at the same time the documenting the strategy also feels like documenting an implementation detail so maybe it doesn't have to be documented at all 🤔.

Added a comment to explain in more detail

Inspired by comments on #2215

) Inspired by comments on #2215

nex3 requested a review from Goodwine April 9, 2024 23:13

Don't cache canonicalize calls when containingUrl is available

6021d0d

See #2208

nex3 force-pushed the canonicalize-cache branch from c97ef5e to 6021d0d Compare April 9, 2024 23:21

ntkme reviewed Apr 9, 2024

View reviewed changes

ntkme approved these changes Apr 10, 2024

View reviewed changes

Goodwine approved these changes Apr 11, 2024

View reviewed changes

nex3 added 2 commits April 11, 2024 15:10

Merge remote-tracking branch 'origin/main' into canonicalize-cache

d2c9bf7

Comment caching logic

67c62fa

nex3 merged commit 821b98e into main Apr 11, 2024
34 checks passed

nex3 deleted the canonicalize-cache branch April 11, 2024 22:51

nex3 added a commit that referenced this pull request Apr 12, 2024

Add a per-importer cache for loads that aren't cacheable en masse

015856c

Inspired by comments on #2215

nex3 mentioned this pull request Apr 12, 2024

Add a per-importer cache for loads that aren't cacheable en masse #2219

Merged

nex3 added a commit that referenced this pull request Apr 12, 2024

Add a per-importer cache for loads that aren't cacheable en masse

7f4c817

Inspired by comments on #2215

nex3 added a commit that referenced this pull request Apr 18, 2024

Add a per-importer cache for loads that aren't cacheable en masse (#2219

b97f26f

) Inspired by comments on #2215

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Don't cache canonicalize calls when `containingUrl` is available #2215

Don't cache canonicalize calls when `containingUrl` is available #2215

nex3 commented Apr 9, 2024

nex3 commented Apr 9, 2024

ntkme Apr 9, 2024 •

edited

Loading

nex3 Apr 9, 2024 •

edited

Loading

ntkme Apr 10, 2024

nex3 Apr 10, 2024

ntkme Apr 10, 2024 •

edited

Loading

ntkme Apr 10, 2024

nex3 Apr 10, 2024

Goodwine Apr 11, 2024

nex3 Apr 11, 2024

Don't cache canonicalize calls when containingUrl is available #2215

Don't cache canonicalize calls when containingUrl is available #2215

Conversation

nex3 commented Apr 9, 2024

nex3 commented Apr 9, 2024

ntkme Apr 9, 2024 • edited Loading

Choose a reason for hiding this comment

nex3 Apr 9, 2024 • edited Loading

Choose a reason for hiding this comment

ntkme Apr 10, 2024

Choose a reason for hiding this comment

nex3 Apr 10, 2024

Choose a reason for hiding this comment

ntkme Apr 10, 2024 • edited Loading

Choose a reason for hiding this comment

ntkme Apr 10, 2024

Choose a reason for hiding this comment

nex3 Apr 10, 2024

Choose a reason for hiding this comment

Goodwine Apr 11, 2024

Choose a reason for hiding this comment

nex3 Apr 11, 2024

Choose a reason for hiding this comment

Don't cache canonicalize calls when `containingUrl` is available #2215

Don't cache canonicalize calls when `containingUrl` is available #2215

ntkme Apr 9, 2024 •

edited

Loading

nex3 Apr 9, 2024 •

edited

Loading

ntkme Apr 10, 2024 •

edited

Loading