RF: Submitter logic. #524

mgxd · 2022-04-01T21:32:05Z

A quick pass to try and clean up some of the logic in the Submitter class. This can definitely be reduced further, but hopefully this starts with the untangling.

for more information, see https://pre-commit.ci

codecov · 2022-04-02T02:28:54Z

Codecov Report

Merging #524 (a2cc367) into master (0520af5) will decrease coverage by 2.06%.
The diff coverage is 83.33%.

@@            Coverage Diff             @@
##           master     #524      +/-   ##
==========================================
- Coverage   79.04%   76.98%   -2.07%     
==========================================
  Files          20       20              
  Lines        4348     4279      -69     
  Branches     1231     1204      -27     
==========================================
- Hits         3437     3294     -143     
- Misses        720      799      +79     
+ Partials      191      186       -5

Flag	Coverage Δ
unittests	`76.88% <83.33%> (-2.07%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
pydra/engine/submitter.py	`88.32% <80.00%> (+3.65%)`	⬆️
pydra/engine/core.py	`88.94% <91.66%> (-0.39%)`	⬇️
pydra/engine/workers.py	`18.76% <100.00%> (+0.17%)`	⬆️
pydra/engine/boutiques.py	`16.50% <0.00%> (-67.97%)`	⬇️
pydra/engine/task.py	`85.24% <0.00%> (-2.61%)`	⬇️
pydra/engine/helpers_file.py	`78.72% <0.00%> (-1.83%)`	⬇️
pydra/engine/specs.py	`88.50% <0.00%> (-0.41%)`	⬇️
pydra/mark/functions.py	`100.00% <0.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 0520af5...a2cc367. Read the comment docs.

mgxd

I went through and left some thought / talking points - this is ready for a review.

mgxd · 2022-04-04T14:45:04Z

pydra/engine/core.py

@@ -1227,21 +1232,31 @@ def create_dotfile(self, type="simple", export=None, name=None):
                formatted_dot.append(self.graph.export_graph(dotfile=dotfile, ext=ext))
            return dotfile, formatted_dot

-    def _connect_and_propagate_to_tasks(self):
+    def _connect_and_propagate_to_tasks(


We're doing the "iterate through the graph, assign connections, and do something to each node based on a workflow attribute" in a few places, so let's methodize it.

mgxd · 2022-04-04T14:46:48Z

pydra/engine/submitter.py

-            self.worker = SGEWorker(**kwargs)
-        else:
-            raise Exception(f"plugin {self.plugin} not available")
+        try:


it seemed a little cleaner to just attempt to index the plugin here

mgxd · 2022-04-04T14:47:48Z

pydra/engine/submitter.py

-        if is_workflow(runnable):
-            # resetting all connections with LazyFields
-            runnable._reset()
+        self.loop.run_until_complete(self.submit_from_call(runnable, rerun))


Trying to keep task.run() successor as simple as possible

mgxd · 2022-04-04T15:25:18Z

pydra/engine/submitter.py

-    DaskWorker,
-    SGEWorker,
-)
+from .workers import WORKERS
 from .core import is_workflow
 from .helpers import get_open_loop, load_and_run_async


given that these functions are only used here, it might help with readability just having them here.

mgxd · 2022-04-04T15:26:40Z

pydra/engine/submitter.py

@@ -110,41 +98,37 @@ async def submit(self, runnable, wait=False, rerun=False):
            Coroutines for :class:`~pydra.engine.core.TaskBase` execution.

        """
+        if runnable.plugin and runnable.plugin != self.plugin:
+            raise NotImplementedError()


This was removed since it wasn't being tested.

original:

# dj: this is not tested!!! TODO await self.worker.run_el(workflow, rerun=rerun)

mgxd · 2022-04-04T15:30:15Z

pydra/engine/submitter.py

+                # job has no state anymore
+                futures.add(
+                    # This unpickles and runs workflow - why are we pickling?
+                    asyncio.create_task(load_and_run_async(task_pkl, sidx, self, rerun))


I think this bit is still excessively confusing, and could really benefit from a rework / simplification. Worker.run_el should have a streamlined behavior - currently it handles two pathways (state vs stateless)

djarecka · 2022-04-06T02:44:49Z

pydra/engine/core.py

-            if self.task_rerun and self.propagate_rerun:
-                task.task_rerun = self.task_rerun
+            if propagate_rerun:
+                task.task_rerun = True


I'm wondering if this should be always True, even if self.task_rerun is False?

it should never be False, since we call this as:

self._connect_and_propagate_to_tasks( propagate_rerun=self.task_rerun and self.propagate_rerun )

djarecka · 2022-04-06T03:19:28Z

@mgxd - thank you a lot! Any idea one of the slurm test is failing? (seems to be consistent) If not, I can try to debug tomorrow

mgxd · 2022-04-06T13:52:53Z

I haven't looked into the SLURM fail too much, but it seems like it is submitting each FunctionTask within the workflow through SLURM directly, instead CF (which I think it was doing before?).

…lugin than submitter

adding one more case to submit_from_call

djarecka · 2022-04-14T15:06:27Z

ok, I'm merging this for now, unless someone stops me We can have another round of refactoring in another PR at some point. Thanks @mgxd

mgxd and others added 5 commits April 1, 2022 17:20

RF: Clean up task/workflow submission

6d23a36

RF: Propagate some workflow settings when iterating graph

7bae1de

[pre-commit.ci] auto fixes from pre-commit.com hooks

16e9016

for more information, see https://pre-commit.ci

FIX: tuple ordering

a76ad7f

[pre-commit.ci] auto fixes from pre-commit.com hooks

3e7a871

for more information, see https://pre-commit.ci

FIX: Propagate rerun to state expansion

85f8630

mgxd force-pushed the rf/submitter branch from d100ef6 to 85f8630 Compare April 4, 2022 15:18

mgxd commented Apr 4, 2022

View reviewed changes

mgxd marked this pull request as ready for review April 4, 2022 15:31

djarecka reviewed Apr 6, 2022

View reviewed changes

djarecka and others added 3 commits April 13, 2022 20:56

adding one more case to submit_from_call, for a wf with a different p…

abc755d

…lugin than submitter

revert changes to sge_available

6e46b8f

Merge pull request #9 from djarecka/mgxd-rf/submitter

a2cc367

adding one more case to submit_from_call

djarecka merged commit 7e8232c into nipype:master Apr 14, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RF: Submitter logic. #524

RF: Submitter logic. #524

mgxd commented Apr 1, 2022

codecov bot commented Apr 2, 2022 •

edited

Loading

mgxd left a comment

mgxd Apr 4, 2022

mgxd Apr 4, 2022

mgxd Apr 4, 2022

mgxd Apr 4, 2022

djarecka Apr 6, 2022

mgxd Apr 4, 2022

mgxd Apr 4, 2022

djarecka Apr 6, 2022

mgxd Apr 6, 2022

djarecka commented Apr 6, 2022

mgxd commented Apr 6, 2022

djarecka commented Apr 14, 2022 •

edited

Loading

RF: Submitter logic. #524

RF: Submitter logic. #524

Conversation

mgxd commented Apr 1, 2022

codecov bot commented Apr 2, 2022 • edited Loading

Codecov Report

mgxd left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

djarecka commented Apr 6, 2022

mgxd commented Apr 6, 2022

djarecka commented Apr 14, 2022 • edited Loading

codecov bot commented Apr 2, 2022 •

edited

Loading

djarecka commented Apr 14, 2022 •

edited

Loading