Add triggersRerun setting to control fingerprint inheritance #238

aomarks · 2022-05-16T00:57:17Z

Background

Currently, if script A depends on script B, then script B's fingerprint is automatically included in script A's fingerprint. That means if an input file for script B changes, which causes B to re-run, then script A will re-run too — regardless of whether script B actually emitted different output.

This is a good and safe default because it means you aren't required to specify the input files for every script when those input files are generated by a dependency; Wireit assumes that any time script B runs, script A could be affected. However, it comes with the trade-off that scripts will sometimes re-run even when none of their input files changed.

This PR

This PR adds a "triggersRerun": false setting that can be annotated on dependencies. This prevents the fingerprint from being inherited, so running B won't necessarily cause A to run as well.

This can be used for more optimal builds via fewer re-runs, but requires that input files are always fully specified.

This will also be very useful for service mode, because a triggersRerun:false dependency won't require a restart across watch iterations. (Restarts will happen only if the fingerprint is different between iterations). For example, the script that builds the service itself can be a regular dependency (causing restarts), but the script that builds the assets served by the script can be a triggersRerun:false dependency (not causing restarts).

Note in order to have a place to put this annotation, dependencies can now be objects. They can still be plain strings as well, but must be objects to receive annotations. (This also opens the door for other annotations on dependency edges we may want in the future. E.g. #23 is probably better expressed as a "workspaces": true annotation instead of a magic $WORKSPACES: prefix as previously planned).

Fixes #237

Example

In this example, bundle has a triggersRerune:false dependency on build. Importantly, it also includes lib/**/*.js in its files array, which are the specific outputs from tsc that rollup consumes. Including these input files wasn't necessary before, but with triggersRerun:false it is now critical.

The advantage of this configuration is that if tsc re-runs but doesn't produce different .js files (for example, if it only produced different .d.ts files), then rollup won't need to re-run. We've also excluded the lib/test directory, because we know test files aren't included in our bundles.

{
  "scripts": {
    "build": "wireit",
    "bundle": "wireit"
  },
  "wireit": {
    "build": {
      "command": "tsc",
      "files": ["src/**/*.ts", "tsconfig.json"],
      "output": ["lib/**"]
    },
    "bundle": {
      "command": "rollup -c",
      "dependencies": [
        {
          "script": "build",
          "triggersRerun": false
        }
      ],
      "files": ["rollup.config.json", "lib/**/*.js", "!lib/test"],
      "output": ["dist/bundle.js"]
    }
  }
}

… 60 seconds

src/analyzer.ts

augustjk · 2022-05-16T20:41:04Z

src/analyzer.ts

+        }
+      } else if (maybeUnresolved.type === 'object') {
+        specifierResult = findNodeAtLocation(maybeUnresolved, ['script']);
+        if (specifierResult == null) {


Assuming this is specifically == to catch both undefined and null?

I copied this style from the other code @rictic wrote in this file. I have been in the habit of using === undefined when we know we don't need to handle null (a habit I learned from @justinfagnani) -- but maybe @rictic has a reason to use == null here?

I use == null to signify that either we do need to handle both (good), or that I'm not sure (bad - and I leave a TODO).

Switched everything to check === undefined when the type indicated that it could not be null (which is almost everything).

rictic

Please add a jump to definition and a codeaction test in an object dependency

rictic · 2022-05-17T14:25:47Z

schema.json

@@ -14,33 +14,57 @@
          },
          "command": {
            "markdownDescription": "The command to run.\n\nThis is a shell command that will be executed, with all binaries from npm dependencies and devDependencies available.\n\nFor example:\n\n```json\n\"command\": \"tsc\"\n```\n\nFor more info, see https://docs.npmjs.com/cli/v8/using-npm/scripts#environment",
-            "type": "string"
+            "type": "string",
+            "minLength": 1


rictic · 2022-05-17T14:47:44Z

schema.json

+                      "minLength": 1
+                    },
+                    "soft": {
+                      "markdownDescription": "If `false` (the default), the cache key of the dependency is automatically included in this script's cache key. This means that whenever the dependency re-runs, this script will re-run too, even if the output produced by the dependency didn't change.\n\nIf `true`, Wireit won't assume that this script needs to re-run just because the dependency re-ran. Instead, the dependency will still run first and be kept up-to-date, but whether this script runs is entirely determined by `files`. Be sure that any input files this script needs from the dependency are specified in `files`.\n\nFor more info, see https://github.com/google/wireit#soft-dependencies",


The docs on hover should lead with the operationally useful information first, then give more background in subsequent paragraphs, or link to the docs site.

What do you think about this for an opening paragraph:

When `false` (the default), whenever this dependency runs, this script (the dependent) will be marked stale and need to re-run too. When soft is `true` Wireit won't assume that the dependent is stale just because the dependency ran. This can reduce unnecessary re-building when `files` captures all of the relevant output of the dependency.

Done, I like the way this is phrased. Also updated the README docs more along these lines. PTAL.

rictic · 2022-05-17T14:47:57Z

schema.json

+                    },
+                    "soft": {
+                      "markdownDescription": "If `false` (the default), the cache key of the dependency is automatically included in this script's cache key. This means that whenever the dependency re-runs, this script will re-run too, even if the output produced by the dependency didn't change.\n\nIf `true`, Wireit won't assume that this script needs to re-run just because the dependency re-ran. Instead, the dependency will still run first and be kept up-to-date, but whether this script runs is entirely determined by `files`. Be sure that any input files this script needs from the dependency are specified in `files`.\n\nFor more info, see https://github.com/google/wireit#soft-dependencies",
+                      "enum": [true, false]


any reason to prefer enum over boolean? IDK either way, just curious

No reason, I just copied it from the "enum": [true, false, "if-file-deleted"] that was already here. boolean makes sense though.

src/analyzer.ts

rictic · 2022-05-17T14:54:06Z

src/script.ts

@@ -31,7 +31,8 @@ export interface ScriptReference extends PackageReference {

 export interface Dependency<Config extends PotentiallyValidScriptConfig> {
  config: Config;
-  astNode: JsonAstNode<string>;
+  specifier: JsonAstNode<string>;


The term specifier is much clearer, +1

rictic · 2022-05-17T14:54:58Z

src/executor.ts

@@ -310,7 +310,7 @@ class ScriptExecution {
  }

  async #executeScript(
-    dependencyStates: Array<[ScriptReference, ScriptState]>
+    dependencyStates: Array<[Dependency<ScriptConfig>, ScriptState]>


On reflection, ScriptConfig should be the default value of the generic parameter to Dependency, and only potentially invalid dependencies should be specially marked

aomarks · 2022-05-17T15:13:45Z

In a discussion @rictic and I had yesterday, we mulled over names that might be better than "soft" for this feature.

We considered some names that tried to encode more meaning, things like:

output-in-input-files: true
inherits-fingerprint: false
always-runs: false

But we had trouble thinking of one that felt really clear. One way these names can be confusing is that it can be ambiguous what the directionality is (e.g. is it the dependency that always runs, or the dependee?)

I think we agreed that there might actually be a benefit to using a fuzzy term like "soft", because it basically requires the user to look at the actual definition (in the docs or by hovering if they have the vscode extension), rather than giving them a false impression. Hopefully it is then memorable.

@rictic suggested "weak" as an alternative to "soft". I prefer "weak" as well.

Any other thoughts from reviewers here?

aomarks · 2022-05-17T16:04:31Z

Please add a jump to definition and a codeaction test in an object dependency

Added a test, but the originSelection squiggles are off by 2 spaces, and I haven't figured out why. Does anything jump out to you as to why, @rictic (see latest commit)?

2 spaces matches the indentation level, so I'm guessing it's something to do with finding the range for the parent object, instead of the string itself. But I can't see why, because we're getting the range for dependency.specifier, which is the AST node for the string, regardless of whether it was inside an object.

rictic · 2022-05-17T17:18:45Z

src/test/ide.test.ts

+    ~~~`,
+      // TODO(aomarks) The ~~~ is 2 spaces ahead of where it should be.
+      originSelection: `
+          "script": "b"


Oh, I bet I know what this is. The comparison that we use ignores leading and trailing whitespace. It probably should just ignore leading newlines. I bet if you add a couple of spaces to this line it will fix it

Oh yeah that was it, done.

rictic · 2022-05-17T17:19:54Z

We might want to release a new version of the extension before we release wireit, so that by the time people update wireit they'll probably already have the version extension that supports object deps

rictic · 2022-05-17T17:42:37Z

Ultimately, I think that soft is a little better than weak. Weak in the sense of WeakMap or WeakRef suggests to me that the dependency might not run, rather than the dependent.

Soft doesn't have that association so I think it's slightly better.

augustjk · 2022-05-17T18:15:13Z

I like soft over weak for reasons @rictic mentioned above.

Something that occurred to me though, would there be any benefit to having a script level option that treats all of its dependency as soft? That would be something like always-use-files-for-freshness or file-only-fingerprint kind of thing?

Since putting even just one dependency that is soft means the user must be sure to specify a comprehensive files input for the script, I wonder if it makes more sense that it's at the script level, rather than per dependency.

aomarks · 2022-05-17T18:24:25Z

Something that occurred to me though, would there be any benefit to having a script level option that treats all of its dependency as soft? That would be something like always-use-files-for-freshness or file-only-fingerprint kind of thing?

Since putting even just one dependency that is soft means the user must be sure to specify a comprehensive files input for the script, I wonder if it makes more sense that it's at the script level, rather than per dependency.

If a dependency is soft, you only need to be sure to specify the input files you care about from that one dependency. And there are legitimate cases where you want some dependencies soft, and some not.

I think allowing you to set it for a whole script could be a little dangerous, because you might set that flag, and then later add a dependency without actively considering whether it's safe for that dependency to be soft. With it being explicit per dependency, you're more likely to think about it on a per-dependency basis.

justinfagnani · 2022-05-17T20:21:06Z

src/analyzer.ts

+        }
+      } else if (maybeUnresolved.type === 'object') {
+        specifierResult = findNodeAtLocation(maybeUnresolved, ['script']);
+        if (specifierResult == null) {


I use == null to signify that either we do need to handle both (good), or that I'm not sure (bad - and I leave a TODO).

justinfagnani · 2022-05-17T20:22:33Z

src/executor.ts

  ): Promise<ScriptState> {
    let allDependenciesAreCacheable = true;
    const filteredDependencyStates: Array<
      [ScriptReferenceString, ScriptState]
    > = [];
    for (const [dep, depState] of dependencyStates) {
+      if (dep.soft) {


aomarks · 2022-05-17T23:25:23Z

I had a discussion with @justinfagnani about this change, and I've written up a draft proposal for an alternative way to think about achieving the same ends, but in a way that might be more intuitive and useful: #245

aomarks · 2022-11-06T19:52:27Z

This PR is ready for another review.

I've merged in the new services changes. For services, this setting controls whether the service needs to restart or not in watch mode when that particular dependency changes.
Added tests and documentation about how this setting interacts with services.
I've renamed soft:true to triggersRerun:false. Although it's more verbose, I think it gives a better impression of what the setting does. I think it also works for both standard and service scripts.

I would still like to pursue the idea of "output slices" described in Proposal: changes to output and fingerprinting #245 later, but I think we can still land this in the meantime. If we do add output slices, then that will be an additional, more sugary way to express the same thing for some use cases.

However, I think this setting will still have a place -- [1] for services that read output dynamically (where it's not a question of which output you consume, it's more about whether the binary reads it at startup time or request time), and [2] because maybe sometimes it won't make sense to have your dependency declare an output slice (maybe it's just weirdly specific, or maybe you don't own that other script).

cc @justinfagnani

aomarks requested review from rictic, justinfagnani, AndrewJakubowicz and augustjk May 16, 2022 00:57

aomarks added 7 commits May 15, 2022 18:38

Document soft dependencies

3205c17

Add "soft" to schema.json and also add minLength requirements

8d7700e

Allow dependency to be an object

a223dd2

Extract and validate "soft" property

a0335be

Add test for soft dependencies

4f78a1d

Implement soft dependencies

8e81d20

Drop concurrency of exclusive lock test, because it sometimes takes >…

29ed2d9

… 60 seconds

aomarks force-pushed the soft branch from bbdc0dd to 29ed2d9 Compare May 16, 2022 01:38

augustjk approved these changes May 16, 2022

View reviewed changes

rictic requested changes May 17, 2022

View reviewed changes

aomarks added 2 commits May 17, 2022 08:37

Address PR comments (1)

a085f5b

Add (slightly wrong) test for jump-to-def from object dep

5f98775

aomarks requested a review from rictic May 17, 2022 16:17

rictic approved these changes May 17, 2022

View reviewed changes

Fix test indentation

b4fe477

justinfagnani approved these changes May 17, 2022

View reviewed changes

aomarks added 3 commits May 17, 2022 13:40

Do undefined checks more precisely

db7e259

Merge branch 'main' into soft

52ed95d

Merge branch 'main' into soft

4c3f5e4

aomarks changed the title ~~Add soft dependencies, which don't inherit cache keys~~ Add soft dependencies, which don't inherit fingerprints May 17, 2022

aomarks enabled auto-merge (squash) May 17, 2022 20:57

aomarks disabled auto-merge May 17, 2022 20:58

aomarks mentioned this pull request May 17, 2022

Proposal: changes to output and fingerprinting #245

Open

SanderElias mentioned this pull request May 27, 2022

command gets re-executed even with no change in input files #262

Open

aomarks mentioned this pull request Sep 26, 2022

Set up separate dev and prod builds with esbuild webcomponents/webcomponents.org#1334

Merged

aomarks mentioned this pull request Nov 5, 2022

Services #508

Merged

aomarks added 2 commits November 6, 2022 08:35

Merge branch 'main' into soft

583a72f

Add test that service soft dependency does not require restart

a6df3d1

aomarks force-pushed the soft branch from 1830132 to a6df3d1 Compare November 6, 2022 18:49

Document soft dependencies under services

ae18fcb

aomarks force-pushed the soft branch from d3890c6 to 89be709 Compare November 6, 2022 19:36

aomarks changed the title ~~Add soft dependencies, which don't inherit fingerprints~~ Add rerunOnChange setting to control fingerprint inheritance Nov 6, 2022

aomarks force-pushed the soft branch from 89be709 to 44350ce Compare November 6, 2022 19:43

Rename soft:true to triggersRerun:false

3d15573

aomarks force-pushed the soft branch from 44350ce to 3d15573 Compare November 6, 2022 19:48

aomarks changed the title ~~Add rerunOnChange setting to control fingerprint inheritance~~ Add triggersRerun setting to control fingerprint inheritance Nov 6, 2022

aomarks requested a review from justinfagnani November 7, 2022 22:14

justinfagnani approved these changes Nov 7, 2022

View reviewed changes

aomarks merged commit 98e025e into main Nov 7, 2022

aomarks deleted the soft branch November 7, 2022 23:40

aomarks mentioned this pull request Nov 8, 2022

Service mode #33

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add triggersRerun setting to control fingerprint inheritance #238

Add triggersRerun setting to control fingerprint inheritance #238

aomarks commented May 16, 2022 •

edited

Loading

augustjk May 16, 2022

aomarks May 17, 2022

justinfagnani May 17, 2022

aomarks May 17, 2022

rictic left a comment

rictic May 17, 2022

rictic May 17, 2022

aomarks May 17, 2022

rictic May 17, 2022

aomarks May 17, 2022

rictic May 17, 2022

rictic May 17, 2022

aomarks May 17, 2022

aomarks commented May 17, 2022 •

edited

Loading

aomarks commented May 17, 2022 •

edited

Loading

rictic May 17, 2022

aomarks May 17, 2022

rictic commented May 17, 2022

rictic commented May 17, 2022

augustjk commented May 17, 2022

aomarks commented May 17, 2022

justinfagnani May 17, 2022

justinfagnani May 17, 2022

aomarks commented May 17, 2022 •

edited

Loading

aomarks commented Nov 6, 2022 •

edited

Loading

Add triggersRerun setting to control fingerprint inheritance #238

Add triggersRerun setting to control fingerprint inheritance #238

Conversation

aomarks commented May 16, 2022 • edited Loading

Background

This PR

Example

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

rictic left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

aomarks commented May 17, 2022 • edited Loading

aomarks commented May 17, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

rictic commented May 17, 2022

rictic commented May 17, 2022

augustjk commented May 17, 2022

aomarks commented May 17, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

aomarks commented May 17, 2022 • edited Loading

aomarks commented Nov 6, 2022 • edited Loading

aomarks commented May 16, 2022 •

edited

Loading

aomarks commented May 17, 2022 •

edited

Loading

aomarks commented May 17, 2022 •

edited

Loading

aomarks commented May 17, 2022 •

edited

Loading

aomarks commented Nov 6, 2022 •

edited

Loading