mutest: new output arg for out-of-order cmd execution #39

liambrady · 2024-10-23T13:45:04Z

Adds a new argument, output: bool, for the mutest method step() that allows for a cmd to be executed within a node without waiting for any output. This introduces the ability to effectively run a series of (possibly lengthy) commands in the background while the mutest is allowed to continue executing further steps (some of which may require a previous command to still be in the middle of executing).

This commit also modifies _cmd_status() in base.py to support executing a command without returning output in the first place.

codecov · 2024-10-23T13:55:35Z

Codecov Report

Attention: Patch coverage is 60.00000% with 2 lines in your changes missing coverage. Please review.

Project coverage is 59.33%. Comparing base (f0447ca) to head (67b3716).
Report is 60 commits behind head on main.

Files with missing lines	Patch %	Lines
munet/base.py	60.00%	2 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main      #39      +/-   ##
==========================================
+ Coverage   58.94%   59.33%   +0.38%     
==========================================
  Files          18       19       +1     
  Lines        5286     5545     +259     
==========================================
+ Hits         3116     3290     +174     
- Misses       2170     2255      +85

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

choppsv1 · 2024-10-26T15:06:54Z

munet/base.py

+            o, e = p.communicate(pinput, timeout=timeout)
+        else:
+            o = ''
+            e = ''
        return self._cmd_status_finish(p, cmds, actual_cmd, o, e, raises, warn)


This call is expecting to run after the process has completed (one side effect of the p.communicate call above is that it waits until this happens). If we look at _cmd_status_finish you'll see it checks p.returncode as a boolean (expecting 0 to mean success); however, in the not-completed-running case p.returncode will be None so this also looks like success.

I figured that in the case that an immediate error occurred within the process, then maybe catching it isn't a bad thing. In retrospect though, leaving that up to chance is probably a bad idea and consistency would be preferred so I will probably modify this to skip the _cmd_status_finish entirely.

choppsv1 · 2024-10-26T15:08:01Z

munet/base.py

+        raises=False,
+        warn=True,
+        stdin=None,
+        output=True,


I understand you're looking for a "fire-and-forget" command, I just wonder if this is the right way to do that.

I think that perhaps instead of output the new param should be no_wait=False.

I'm also wondering if we should explicitly set stdout=subprocess.DEVNULL and stderr=subprocess.DEVNULL.

choppsv1 · 2024-10-26T15:08:40Z

munet/mutest/userapi.py

+        target: str,
+        cmd: str,
+        output: bool = True,
+    ) -> str:


In general creating these fire-and-forget processes is messy b/c there's no guarantee they will complete before the test exists (in which case they will probably be killed by the kernel, or have PIPE closed signals or something). For non-mutest uses one uses popen to start processes that you want to run in the background and you get back a p process object that you can kill or wait on later. I don't know if this is the right pattern to use for mutest though -- so maybe your way here is ok.

I wonder about the actual use case. Might a new API that runs multiple commands simultaneously and waits for them all to complete would be a cleaner solution (step_multi or step_parallel)? It depends on the problem we're trying to solve I guess.

The use case I have encountered twice now is the desire to create some sort of traffic generator on a node (or two nodes in the case of setting up a client/server pair situation) and then run a series of tests based on the results (perhaps on separate nodes). While it would probably be feasible to do this with some sort of new waitstep_multi or waitstep_parallel, such would require that API call to be responsible for multiple parallel steps plus multiple waiting steps which seems excessive. I figured that accepting the risk and running a few fire-and-forget cmd through a step (then using match/wait calls as usual but written by the tester to be robust to the state of the background process) was cleanest.

I do agree that letting the kernel kill the processes is a naive solution though, so perhaps the best solution would be to keep track of all fire-and-forget processes in a list and clean it up later when a node is being deleted?

mutest: new output arg for out-of-order cmd execution

67b3716

liambrady added enhancement New feature or request mutest mutest related item labels Oct 23, 2024

liambrady requested a review from choppsv1 October 23, 2024 13:45

choppsv1 reviewed Oct 26, 2024

View reviewed changes

liambrady marked this pull request as draft November 23, 2024 19:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mutest: new output arg for out-of-order cmd execution #39

mutest: new output arg for out-of-order cmd execution #39

liambrady commented Oct 23, 2024 •

edited

Loading

codecov bot commented Oct 23, 2024

choppsv1 Oct 26, 2024

liambrady Oct 28, 2024

choppsv1 Oct 26, 2024

choppsv1 Oct 26, 2024

choppsv1 Oct 26, 2024

liambrady Oct 28, 2024

mutest: new output arg for out-of-order cmd execution #39

Are you sure you want to change the base?

mutest: new output arg for out-of-order cmd execution #39

Conversation

liambrady commented Oct 23, 2024 • edited Loading

codecov bot commented Oct 23, 2024

Codecov Report

choppsv1 Oct 26, 2024

Choose a reason for hiding this comment

liambrady Oct 28, 2024

Choose a reason for hiding this comment

choppsv1 Oct 26, 2024

Choose a reason for hiding this comment

choppsv1 Oct 26, 2024

Choose a reason for hiding this comment

choppsv1 Oct 26, 2024

Choose a reason for hiding this comment

liambrady Oct 28, 2024

Choose a reason for hiding this comment

liambrady commented Oct 23, 2024 •

edited

Loading