Use a combined inputs hash to check if we need to run a build #606

jakemac53 · 2017-11-14T17:22:20Z

This was more nuanced than I expected, and I ended up needing to clean up a few other things in the process. Highlights are as follows:

Changed summaries to semantic summaries (we were actually relying on the full ones before, whoops!)
Fixed some behavior around replacing SyntheticAssetNodes with real ones. Previously we would remove all the outputs from the graph and clean up their inputs sets which wasn't correct (we want to retain all those edges and nodes, and just swap out the node in question).
When invalidating nodes, we no longer pre-emptively delete them. In fact, I never explicitly delete them in this cl I just rely on the asset writer to overwrite them (unless their primary input is deleted).
The inputs set for GeneratedAssetNodes is now ordered to ensure the combined md5 hash will always be the same for the same inputs.

Fixes #371

…or all outputs of an action

natebosch · 2017-11-14T17:26:14Z

build_compilers/lib/src/summary_builder.dart

@@ -76,7 +76,7 @@ Future createUnlinkedSummary(Module module, BuildStep buildStep,
  request.arguments.addAll([
    '--build-summary-only',
    '--build-summary-only-unlinked',
-    '--build-summary-output=${summaryOutputFile.path}',
+    '--build-summary-output-semantic=${summaryOutputFile.path}',


At least a few of these changes feel like they could be justified in isolation and should be relatively easy to move to a new PR.

Can we submit this one separately?

Do we get a speedup w/ this?

I will break this out into its own cl and test in isolation, its mostly about invalidating later build steps but it should also improve the time to read and write summaries

#609, no noticeable build speedup from this alone unfortunately.

natebosch · 2017-11-14T17:28:08Z

build_runner/lib/src/asset_graph/graph.dart

@@ -66,7 +66,9 @@ class AssetGraph {
    var existing = get(node.id);
    if (existing != null) {
      if (existing is SyntheticAssetNode) {
-        _remove(existing.id);
+        // Don't call _remove, that transitively removes primary outputs. We


can we rename to _removeSubtree or something like that to make this clear?

renamed to _removeRecursive.

natebosch · 2017-11-14T17:32:19Z

build_runner/lib/src/asset_graph/node.dart

+
+  /// A digest combining all digests of all previous inputs.
+  ///
+  /// This is used to determine if a node really needs to be output or not.


[optional] I might rephrase this, the Builder still decides whether to output an asset, we decide whether to run the build.

"Used to determine whether all the inputs to a build step are identical to the previous run indicating that the previous output is still valid."

natebosch · 2017-11-14T17:38:17Z

build_runner/lib/src/generate/build_impl.dart

+    if (!await _buildShouldRun(builderOutputs, wrappedReader)) {
+      return <AssetId>[];
+    }
+    // We may have read some inputs in the call to `_buildShouldRun`, we want


should we not be going through the wrapped reader if we don't want to track this?

Or is that the only way for the lazy assets to work?

Ya, we do this because of the lazy assets and also the canRead implementation which takes into account phase numbers.

natebosch · 2017-11-14T17:39:40Z

build_runner/lib/src/generate/build_impl.dart

@@ -443,6 +515,10 @@ class BuildImpl {
        assert(inputNode != null, 'Asset Graph is missing $input');
        inputNode.outputs.add(output);
      }
+
+      // And finally compute the combined digest for all inputs.


some of these comments are "what" with no "why" - should we be refactoring to make the code easier to understand?

broke out some parts of this into appropriately named functions so its easier to follow

natebosch

As discussed offline - the fact that this change requires touching so many places, especially in subtle ways, points to it not having a cohesive implementation.

I think in the future there will be room to shift more of the logic explicitly into the AssetGraph or a parallel class rather that sit across both the AssetGraph and build_impl.

I think we can get it in as is and refactor later. Looks like test coverage is good enough to have confidence in it.

jakemac53 added 12 commits November 9, 2017 06:59

checkpoint

6ad3c24

temp hacks

2dc32a2

Merge branch 'master' into inputs-hashes

3352f45

dont pre-emptively delete generated sources

e0d91f1

Merge branch 'master' into inputs-hashes

4e2df37

use SplayTreeSet to ensure ordering, only compute input hashes once f…

9fb5f15

…or all outputs of an action

use semantic summaries and supported apis from package:crypto

b1be27e

update the asset graph test

00a7eb2

get the watch/build tests passing

1eaddf3

Merge branch 'master' into inputs-hashes

fa52e61

add unit test, fix bug

76c40e7

optimize previous digest checking, comment cleanup

82e47e0

jakemac53 added the type-enhancement A request for a change that isn't a bug label Nov 14, 2017

jakemac53 added this to the M0: Replace `pub serve` milestone Nov 14, 2017

jakemac53 assigned natebosch Nov 14, 2017

jakemac53 requested a review from natebosch November 14, 2017 17:22

googlebot added the cla: yes Google is happy with the PR contributors label Nov 14, 2017

natebosch reviewed Nov 14, 2017

View reviewed changes

jakemac53 mentioned this pull request Nov 14, 2017

use semantic output summaries #609

Merged

jakemac53 added 3 commits November 14, 2017 11:23

Merge branch 'master' into inputs-hashes

ec494b9

rename _remove to _removeRecursive

2d1de88

clean up _setOutputsState

8618c2c

natebosch approved these changes Nov 14, 2017

View reviewed changes

jakemac53 merged commit 94d034b into master Nov 14, 2017

jakemac53 deleted the inputs-hashes branch November 14, 2017 20:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use a combined inputs hash to check if we need to run a build #606

Use a combined inputs hash to check if we need to run a build #606

jakemac53 commented Nov 14, 2017

natebosch Nov 14, 2017

kevmoo Nov 14, 2017

jakemac53 Nov 14, 2017

jakemac53 Nov 14, 2017

natebosch Nov 14, 2017

jakemac53 Nov 14, 2017

natebosch Nov 14, 2017

jakemac53 Nov 14, 2017

natebosch Nov 14, 2017

jakemac53 Nov 14, 2017

natebosch Nov 14, 2017

jakemac53 Nov 14, 2017

natebosch left a comment

Use a combined inputs hash to check if we need to run a build #606

Use a combined inputs hash to check if we need to run a build #606

Conversation

jakemac53 commented Nov 14, 2017

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

natebosch left a comment

Choose a reason for hiding this comment