Optionally execute processes exclusively in the foreground #8974

stuhood · 2020-01-15T23:00:42Z

Problem

As #8923 explains: streaming, foreground access to processes run by Pants is a useful tool for implementing debugging facilities. But it is currently only possible inside of a @goal_rule via InteractiveProcessRequest (rather than ExecuteProcessRequest), and this makes it more challenging to implement toggling of debug facilities without parallel codepaths that deal with the independent types and APIs.

Solution

Add support for a foreground flag on ExecuteProcessRequest that forces the process to run:

locally
with streaming stdout/stderr (NB: and stdin: worth discussing)

Result

./pants test --debug is implemented using this facility.

stuhood · 2020-01-15T23:05:21Z

This is a working draft for discussion, with a few TODOs:
~~1. Node::cacheable (and ExecuteProcessRequest persistent caching) should be disabled for foreground processes (...that consume stdin: see the next item)~~
2. It's possible that we should turn foreground into an enum or ternary that will disable stdin and allow for streaming output while still allowing a process to be cached (because it will not have any additional inputs to the process)
~~3. We might want to block landing this until #6598 is implemented, to allow the uncacheable process to act as expected when it is deeper in the graph.~~

Commits are useful to review independently.

gshuflin · 2020-01-16T00:37:37Z

I'm concerned that this PR is too closely tied to what would've been useful for making specifically #8827 easier to implement, especially now that that change is done with and merged. I do recognize that we will probably want to add --debug flags for non-Python tests in the future though.

with streaming stdout/stderr (NB: and stdin: worth discussing)

We definitely need streaming stdin to work for at least two subprocess usecases - ./pants run, which is currently using InteractiveProcessRunner (InteractiveProcessRunner was basically built in order to implement ./pants run), and ./pants repl, which no one has gotten to porting to v2 yet, but which I had imagined would also use InteractiveProcessRunner. So even if we merge this PR, it doesn't let us get rid of InteractiveProcessRunner.

The test --debug usecase also needs it, since we expect that debugging tests is very often going to involve firing up some kind of debugger REPL within the test and typing commands into it real-time. So if streaming stdin doesn't currently work, I don't think this PR solves the problem we need it to solve either.

3. We might want to block landing this until #6598 is implemented, to allow the uncacheable process to act as expected when it is deeper in the graph.

I think this PR only makes sense if #6598 is implemented first. Without it, we can't guarantee that the engine will actually run an EPR with the foreground flag rather than pulling its cached value or the cached value of one of the rule graph nodes that includes it, and this needs to happen in an interactive context.

I basically see this proposal + #6598 as a way of getting rid of the need for side-effecting types like InteractiveProcessRunner. The engine can allow side-effecting operation safely in one of two ways - it can statically enforce that types that perform side-effecting operations get pushed to the edge of the graph, which is the current system, and the reason we implemented #8922 . Or, the engine can allow side-effecting operations anywhere, provided #6598 is implemented and the engine can automatically invalidate every cached node between a top-level goal rule and an EPR with a foreground flag.

I don't think it makes sense to have two separate ways of solving the problem of running a subprocess, so we should aim either to make IPR the right way to run a foregrounded subprocess, or implement this change to EPR and remove IPR. I'm concerned that right now this PR doesn't let us replicate everything IPR does with modifications to EPR; and also that even if it did, this would necessarily involve making large chunks of the rule graph selectively uncached by flipping a boolean in an EPR data structure somewhere.

It might desirable to force rule-writers to structure their types such that they can only do side-effecting things in a goal_rule, but that desideratum is in conflict with the goal of this PR to reduce parallel code paths. However, I do think that it's possible in principle to design rule code paths in such a way that the branch between the interactive and non-interactive states happens only at the very last moment (cf. this commit: 37eca64 , which pulled InteractiveProcessRunner out of a non-goal rule and put it in a goal rule).

stuhood · 2020-01-16T01:38:47Z

We definitely need streaming stdin to work for at least two subprocess usecases

To be clear: it already works here. I'm suggesting that in some cases folks might want streaming stdout/stderr without stdin, and so we might not want foreground to be boolean.

stuhood · 2020-01-23T21:05:53Z

I'm concerned that this PR is too closely tied to what would've been useful for making specifically #8827 easier to implement, especially now that that change is done with and merged. I do recognize that we will probably want to add --debug flags for non-Python tests in the future though.

Yes: all of them. Which means that all test runners will have this duplication.

But additionally, there has been a desire to (conditionally) stream output from processes: you could imagine the test goal automatically pushing down a signal for foreground=True if there was exactly one test to run, for example.

I don't think it makes sense to have two separate ways of solving the problem of running a subprocess, so we should aim either to make IPR the right way to run a foregrounded subprocess, or implement this change to EPR and remove IPR. I'm concerned that right now this PR doesn't let us replicate everything IPR does with modifications to EPR; and also that even if it did, this would necessarily involve making large chunks of the rule graph selectively uncached by flipping a boolean in an EPR data structure somewhere.

I agree that it would be unfortunate to need two ways to do things, but I think I agree that having sideeffecting operations (and to be clear: the sideeffecting part here is not stdin: rather, the fact that the process runs in the buildroot, rather than in a chroot) deeper in the graph is a no-go. Is it possible that there is a middle ground where IPR can be made significantly simpler, and essentially targeted at only repl/run?

### Problem The rust level `Node::cacheable` flag is currently only used to mark `@goal_rule`s as uncacheable (because they are allowed to operate on `@sideeffecting` types, such as the `Console` and the `Workspace`). But since the implementation of `cacheable` did not allow it to operate deeply in the Graph, we additionally needed to mark their parent `Select` nodes uncacheable, and could not use the flag in more positions. Via #7350, #8495, #8347, and #8974, it has become clear that we would like to safely allow nodes deeper in the graph to be uncacheable, as this allows for the re-execution of non-deterministic processes, or re-consumption of un-trackable state, such as: 1. a process receiving stdin from a user 2. an intrinsic rule that pokes an un-watched file on the filesystem 3. interacting with a stateful process like git Note that these would all be intrinsic Nodes: it's not clear that we want to expose this facility to `@rule`s directly. ### Solution Finish adding support for uncacheable nodes. Fixes #6598. When an uncacheable node completes, it will now keep the value it completed with (in order to correctly compute a `Generation` value), but it will re-compute the value once per `Session`. The accurate `Generation` value for the uncacheable node allows its dependents to "clean" themselves and not re-run unless the uncacheable node produced a different value than it had before. ### Result The `Node::cacheable` flag may be safely used deeper in the graph, with the semantics that requests for any of an uncacheable node's dependents will cause it to re-run once per `Session`. The dependents will not re-run unless the value of the uncacheable node changes (regardless of the `Session`).

…or the Display.

…temporarily relinquish stdio access.

…dio, and so we panic and deadlock on the logger.

stuhood · 2020-02-16T04:36:15Z

Still a working draft, but now based on #9015, so it has proper cache behavior.

stuhood · 2020-02-16T04:38:14Z

src/python/pants/backend/python/rules/python_test_runner.py

@@ -188,25 +187,15 @@ def get_packages_to_cover(
    description=f'Run Pytest for {target_with_origin.adaptor.address.reference()}',
    timeout_seconds=test_setup.timeout_seconds if test_setup.timeout_seconds is not None else 9999,
    env=env,
+    foreground=test_options.values.debug,


This flag replaces the debug-specific test runner code.

Eric-Arellano · 2020-05-18T03:30:57Z

Is this stale? I think you mentioned wanting to land this last week in a thread on test output.

stuhood · 2020-11-18T21:25:26Z

This is stale, but the idea isn't dead. Can close it if it would clean things up to do so.

I'd still like to unify the InteractiveProcess and Process but one complicating factor that we noticed today is that because pdb requires a TTY to implement its repl, just copying stdout/stderr through like we do here won't just work. The foreground boolean value could potentially be an enum of {background, foreground, tty}, where background and foreground could capture stdio, but tty could not.

Eric-Arellano · 2020-11-18T21:52:14Z

Can close it if it would clean things up to do so.

It'd be great to close, if you don't mind. I think it's helpful for the # of PRs to remain low and only be work that we have an intent to merge in the near future. Helps us to stay on track with things like users submitting PRs.

stuhood · 2020-11-18T22:18:40Z

Have updated #8923 with the new information. Closing for now.

stuhood requested review from illicitonion, codealchemy and gshuflin January 15, 2020 23:00

stuhood mentioned this pull request Jan 26, 2020

Dirty the dependents of uncacheable nodes #9015

Merged

stuhood added 6 commits February 15, 2020 19:20

Remove ui demo and simplify package structure.

f4c596a

Do not double-log when a Console is active.

ef7ddcd

Add an async Mutex covering stdio, and acquire it while logging and f…

dae0478

…or the Display.

Add support for spawning processes in the foreground, and a trait to …

0cd3873

…temporarily relinquish stdio access.

WIP: Working, but can deadlock when some other process already has st…

d5f47db

…dio, and so we panic and deadlock on the logger.

WIP: Remove debug runner.

10720dc

stuhood force-pushed the stuhood/process-execute-foreground branch from 0e03fc1 to 10720dc Compare February 16, 2020 04:33

stuhood commented Feb 16, 2020

View reviewed changes

Eric-Arellano added the stale? label May 18, 2020

stuhood mentioned this pull request May 18, 2020

Implement interactive/foreground mode for ExecutionProduct{Request,Result} #6002

Closed

stuhood removed the stale? label May 19, 2020

stuhood mentioned this pull request Oct 27, 2020

On-the-fly streaming access to a Process's outputs #11056

Open

stuhood mentioned this pull request Nov 18, 2020

Explore adding a foreground option to Process #8923

Open

stuhood closed this Nov 18, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optionally execute processes exclusively in the foreground #8974

Optionally execute processes exclusively in the foreground #8974

stuhood commented Jan 15, 2020 •

edited

Loading

stuhood commented Jan 15, 2020 •

edited

Loading

gshuflin commented Jan 16, 2020

stuhood commented Jan 16, 2020

stuhood commented Jan 23, 2020 •

edited

Loading

stuhood commented Feb 16, 2020

stuhood Feb 16, 2020

Eric-Arellano commented May 18, 2020

stuhood commented Nov 18, 2020

Eric-Arellano commented Nov 18, 2020

stuhood commented Nov 18, 2020

Optionally execute processes exclusively in the foreground #8974

Optionally execute processes exclusively in the foreground #8974

Conversation

stuhood commented Jan 15, 2020 • edited Loading

Problem

Solution

Result

stuhood commented Jan 15, 2020 • edited Loading

gshuflin commented Jan 16, 2020

stuhood commented Jan 16, 2020

stuhood commented Jan 23, 2020 • edited Loading

stuhood commented Feb 16, 2020

stuhood Feb 16, 2020

Choose a reason for hiding this comment

Eric-Arellano commented May 18, 2020

stuhood commented Nov 18, 2020

Eric-Arellano commented Nov 18, 2020

stuhood commented Nov 18, 2020

stuhood commented Jan 15, 2020 •

edited

Loading

stuhood commented Jan 15, 2020 •

edited

Loading

stuhood commented Jan 23, 2020 •

edited

Loading