Use the pympipool executor #77

liamhuber · 2023-11-15T19:44:58Z

Closes #75

Replaces Node.executor = True/False with providing actual executor instances (while keeping infrastructure support for instead instructing how to build an executor in the future, for nesting/reconnection/submitters).

The only big catch right now is that the tests hang on my local machine with the pympipool executor (while they run fine for the built-in and much simpler/weaker CloudpickleProcessPoolExecutor). This happens whenever a macro/workflow with children is the one getting an executor -- all the tests where function nodes have an executor work totally fine with both executors. Since this might be related to pympipool tests hanging on mac, I'm just going to naively push this PR and see if the tests pass OK on linux and windows. Additionally, the hanging tests work totally fine when run in a jupyter notebook -- so I'm optimistic this is something that can be overcome, I just need more data to figure out how.

TODO:

Update docs
Update demo notebook (at least the quickstart still uses executor = True
~~Go look at pympipool for how the flux and slurm guys get tested and try to do something similar here~~ I'm shoving this to "outside scope" for this PR, getting the PyMPI executor working is enough for now

Per the current API

We can change the standard to pympipool once its next version is live

For now, of course we still want all the infrastructure on hand to specify how to create them instead, but now that they're on the creator we don't really need this `True` business

github-actions · 2023-11-15T19:45:10Z

👈 Launch a binder notebook on branch pyiron/pyiron_workflow/use_pympipool

liamhuber · 2023-11-16T07:09:53Z

File "/usr/share/miniconda3/envs/my-env/lib/python3.10/site-packages/pympipool/backend/serial.py", line 30, in main
    input_dict = interface_receive(socket=socket)
  File "/usr/share/miniconda3/envs/my-env/lib/python3.10/site-packages/pympipool/shared/communication.py", line 147, in interface_receive
    return cloudpickle.loads(socket.recv())
ModuleNotFoundError: No module named 'test_macro'

This is the cause of failure for a couple of these. Might be those awkward expectations pympipool has on the PYTHONPATH that I mentioned somewhere else (the linked issue maybe?)

liamhuber · 2023-11-16T19:14:17Z

Ah, perhaps it runs in the notebook because it recognizes that things are being defined in __main__ and so knows to use by-value serialization, but when I'm running the tests they're in a .py file and so thinks that by-reference serialization will it doesn't realize that somehow this py file is not actually in the python path.

# Conflicts: # .binder/environment.yml # .ci_support/environment.yml # docs/environment.yml # setup.py

liamhuber · 2023-11-27T19:01:08Z

Ok, failures are from referencing objects defined in the same file but different scopes, such that the executor thinks it's something that should be available by reference but really it still needs to be pickled by value. Cf. this issue over on pympipool for a minimal(ish) example.

I can get around this by modifying the existing tests so they never have this same-file-different-scope problem, but I don't want some user/node dev to run into this problem later so let's see if it can be intelligently fixed upstream before giving up.

The pympipool executor complains about receiving 'self' twice

This check just had a holdover from when the executor attribute was a bool

liamhuber · 2023-11-28T20:51:45Z

pympipool is adding "." to the python path; the claim is that this is for flux, but could it be needed for other executors? The tests there involve a cd tests; python -m unittest discover ., so "." is directly the tests folder. In contrast, the centralized CI is directly invoking coverage run -m unittest discover ${{ inputs.test-dir }}, where the directory being passed in is the default value for the pyiron repos using the reusable workflows, i.e. tests/unit. So in my case I imagine "." is actually the main repository directory, which is not a python module.

Is there an equivalent explanation for the coverage tests? Yes, it similarly runs the central reusable workflow with its default value of tests s.t. the final command is coverage run -m unittest discover tests and still not cd tests; python -m unittest discover ..

To test this hypothesis, let's make sure tests/unit is part of the path before we use the executor s.t. test_macro is visible for import.

liamhuber · 2023-11-28T21:06:30Z

That did nothing, I'm getting the exact same ModuleNotFoundErrors. I should look to make sure I'm actually getting the path expected.

I am curious if this will play more nicely with pympipool.backend.serial

liamhuber · 2023-11-28T21:36:46Z

The path is as expected. Maybe the path needs to be added in another process too? I am now trying just manually using a pympipool test (mostly), where we first cd and then discover tests in ..

liamhuber · 2023-11-28T21:44:40Z

Ok, it looks like the cd then discover . worked -- it ran into errors discovering the static package, but it finished cleanly and the only errors had nothing to do with the executor:

======================================================================
ERROR: test_creator_access_and_registration (test_composite.TestComposite.test_creator_access_and_registration)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/usr/share/miniconda3/envs/test/lib/python3.11/site-packages/pyiron_workflow/interfaces.py", line 205, in _verify_identifier
    module = import_module(package_identifier)
             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/share/miniconda3/envs/test/lib/python3.11/importlib/__init__.py", line 126, in import_module
    return _bootstrap._gcd_import(name[level:], package, level)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "<frozen importlib._bootstrap>", line 1204, in _gcd_import
  File "<frozen importlib._bootstrap>", line 1176, in _find_and_load
  File "<frozen importlib._bootstrap>", line 1126, in _find_and_load_unlocked
  File "<frozen importlib._bootstrap>", line 241, in _call_with_frames_removed
  File "<frozen importlib._bootstrap>", line 1204, in _gcd_import
  File "<frozen importlib._bootstrap>", line 1176, in _find_and_load
  File "<frozen importlib._bootstrap>", line 1140, in _find_and_load_unlocked
ModuleNotFoundError: No module named 'static'

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
  File "/home/runner/work/pyiron_workflow/pyiron_workflow/tests/unit/test_composite.py", line 71, in test_creator_access_and_registration
    self.comp.register("demo", "static.demo_nodes")
  File "/usr/share/miniconda3/envs/test/lib/python3.11/site-packages/pyiron_workflow/composite.py", line 517, in register
    cls.create.register(domain=domain, package_identifier=package_identifier)
  File "/usr/share/miniconda3/envs/test/lib/python3.11/site-packages/pyiron_workflow/interfaces.py", line 163, in register
    self._verify_identifier(package_identifier)
  File "/usr/share/miniconda3/envs/test/lib/python3.11/site-packages/pyiron_workflow/interfaces.py", line 212, in _verify_identifier
    raise ValueError(
ValueError: The package identifier is static.demo_nodes is not valid. Please ensure it is an importable module with a list of Node objects stored in the variable `nodes`.

======================================================================
ERROR: test_registration (test_interfaces.TestCreator.test_registration)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/usr/share/miniconda3/envs/test/lib/python3.11/site-packages/pyiron_workflow/interfaces.py", line 205, in _verify_identifier
    module = import_module(package_identifier)
             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/share/miniconda3/envs/test/lib/python3.11/importlib/__init__.py", line 126, in import_module
    return _bootstrap._gcd_import(name[level:], package, level)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "<frozen importlib._bootstrap>", line 1204, in _gcd_import
  File "<frozen importlib._bootstrap>", line 1176, in _find_and_load
  File "<frozen importlib._bootstrap>", line 1126, in _find_and_load_unlocked
  File "<frozen importlib._bootstrap>", line 241, in _call_with_frames_removed
  File "<frozen importlib._bootstrap>", line 1204, in _gcd_import
  File "<frozen importlib._bootstrap>", line 1176, in _find_and_load
  File "<frozen importlib._bootstrap>", line 1140, in _find_and_load_unlocked
ModuleNotFoundError: No module named 'static'

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
  File "/home/runner/work/pyiron_workflow/pyiron_workflow/tests/unit/test_interfaces.py", line 27, in test_registration
    self.creator.register("demo", "static.demo_nodes")
  File "/usr/share/miniconda3/envs/test/lib/python3.11/site-packages/pyiron_workflow/interfaces.py", line 163, in register
    self._verify_identifier(package_identifier)
  File "/usr/share/miniconda3/envs/test/lib/python3.11/site-packages/pyiron_workflow/interfaces.py", line 212, in _verify_identifier
    raise ValueError(
ValueError: The package identifier is static.demo_nodes is not valid. Please ensure it is an importable module with a list of Node objects stored in the variable `nodes`.

----------------------------------------------------------------------

So it looks like the problem is that pympipool requires to first CD and then discover tests. IMO that's a problem with pympipool and not with pyiron_workflow. I'll go over there and see if I can reproduce this behaviour more simply with some new workflows and get help there.

liamhuber · 2023-11-28T23:14:18Z

Dammit, I can't invoke the reusable workflow as a step. That means, and the docs explicitly point this case out, that I can't modify the GITHUB_ENV of the called workflow.

If this problem can't be fixed directly in pympipool that will mean adding a step like below to the reusable workflow itself.

    - name: "Add test dirs to pythonpath (pympipool compliance)"
      run: |
        PWD=$(pwd)
        echo "PYTHONPATH=$PWD/tests/benchmark:$PYTHONPATH" >> $GITHUB_ENV
        echo "PYTHONPATH=$PWD/tests/integration:$PYTHONPATH" >> $GITHUB_ENV
        echo "PYTHONPATH=$PWD/tests/unit:$PYTHONPATH" >> $GITHUB_ENV

They can't be used from steps [per docs](https://docs.github.com/en/actions/using-workflows/reusing-workflows#limitations)

So that CI will actually fail instead of dangling indefinitely

liamhuber · 2023-11-29T18:13:48Z

Per pympipool #239, I'm trying to modify the centralized CI to allow the python path to be expanded. Targeting the branch seems to have failed and push-pull didn't even run:

error parsing called workflow
".github/workflows/push-pull.yml"
-> "pyiron/actions/.github/workflows/push-pull-main.yml@tests_in_python_path"
: failed to fetch workflow: reference to workflow should be either a valid branch, tag, or commit

Ahaaaa, because I have a typo in "pythonpath"

liamhuber · 2023-11-29T18:31:41Z

Perfect! Worked for all three OSs. I updated the upstream branch to also apply the same trick for the coverage job, which is still hanging.

liamhuber · 2023-11-29T18:34:41Z

Note that once everything is settled, the workflow tags will need to be reset to main or a release.

This will need to be updated to point at a correct tag at the end of the day

coveralls · 2023-11-29T18:38:45Z

Pull Request Test Coverage Report for Build 7052576125

35 of 43 (81.4%) changed or added relevant lines in 3 files are covered.
69 unchanged lines in 8 files lost coverage.
Overall coverage decreased (-0.8%) to 87.853%

Changes Missing Coverage	Covered Lines	Changed/Added Lines	%
pyiron_workflow/interfaces.py	16	24	66.67%

Files with Coverage Reduction	New Missed Lines	%
function.py	1	92.35%
io.py	2	90.23%
pyiron_workflow/io.py	2	85.71%
pyiron_workflow/util.py	2	84.62%
util.py	2	92.11%
composite.py	9	87.13%
interfaces.py	20	80.2%
node.py	31	87.5%

Totals
Change from base Build 7040333669:	-0.8%
Covered Lines:	3515
Relevant Lines:	4001

💛 - Coveralls

codacy-production · 2023-11-29T18:40:00Z

Coverage summary from Codacy

See diff coverage on Codacy

Coverage variation	Diff coverage
✅ -0.91% (target: -1.00%)	74.51%

Coverage variation details

	Coverable lines	Covered lines	Coverage
Common ancestor commit (`fa4d15b`)	2006	1656	82.55%
Head commit (`a8201e1`)	2048 (+42)	1672 (+16)	81.64% (-0.91%)

Coverage variation is the difference between the coverage for the head and common ancestor commits of the pull request branch: <coverage of head commit> - <coverage of common ancestor commit>

Diff coverage details

	Coverable lines	Covered lines	Diff coverage
Pull request (#77)	51	38	74.51%

Diff coverage is the percentage of lines that are covered by tests out of the coverable lines that the pull request added or modified: <covered lines added or modified>/<coverable lines added or modified> * 100%

See your quality gate settings Change summary preferences

In anticipation of the new python path features being merged

liamhuber added 13 commits November 7, 2023 14:00

Route executor access through the interfaces module

64c66c2

Don't package the future object (if any) in the gotten state

8d80af2

🐛 set the executor to a real true value

0909c28

Per the current API

Wait directly on the result that's being parallel processed

839550c

Add test

a23bd71

Import supported executors and set a standard one

f92fe2b

We can change the standard to pympipool once its next version is live

Merge branch 'main' into use_pympipool

885913c

Extract executor parsing

170fc6d

Remove debug prints

31758d6

Allow executor instances to be provided as the executor

f8b2b91

Expose supported executors on the creator

879f2a9

Only accept executor instances

1fc7901

For now, of course we still want all the infrastructure on hand to specify how to create them instead, but now that they're on the creator we don't really need this `True` business

Change the standard executor to pympipool

744fea9

liamhuber and others added 4 commits November 15, 2023 14:50

🤦 add the dependency

f4d6450

[dependabot skip] Update env file

3f0156b

Add missing comma

55b382b

Merge remote-tracking branch 'origin/use_pympipool' into use_pympipool

3faf8e6

Merge branch 'main' into use_pympipool

d10e5a0

# Conflicts: # .binder/environment.yml # .ci_support/environment.yml # docs/environment.yml # setup.py

liamhuber mentioned this pull request Nov 27, 2023

How to handle definitions in the same file? pyiron/executorlib#235

Closed

pyiron-runner and others added 4 commits November 27, 2023 19:02

[dependabot skip] Update env file

ea3886d

Update pympipool dependency

2cc364c

Don't allow 'self' input when using an executor

cdfe1ec

The pympipool executor complains about receiving 'self' twice

Catch falsey non-None executors.

34de66c

This check just had a holdover from when the executor attribute was a bool

liamhuber added the format_black trigger the Black formatting bot label Nov 27, 2023

pyiron-runner added 2 commits November 27, 2023 21:26

[dependabot skip] Update env file

c781c76

Format black

557c1d5

liamhuber added 3 commits November 28, 2023 13:19

Take a look at the path in the CI log

9d21273

Remove CI debug

9c968f9

Add a test where you cd and discover in . instead of giving a path

bd41be5

I am curious if this will play more nicely with pympipool.backend.serial

liamhuber added 3 commits November 28, 2023 15:02

Add the test dirs to the path

28dc693

Remove the canary workflow

60cdca9

Be explicit

45ebbdb

liamhuber added 3 commits November 28, 2023 15:14

Revert to invoking reusable workflow from a job

2469e6f

They can't be used from steps [per docs](https://docs.github.com/en/actions/using-workflows/reusing-workflows#limitations)

Add timeouts everywhere I wait for results

55b9e25

So that CI will actually fail instead of dangling indefinitely

Discuss shutting executors down and give a convenience method

35b74fb

liamhuber mentioned this pull request Nov 29, 2023

Expanding the python path pyiron/actions#63

Merged

Target the python path dev branch and ask for extra paths in CI

9e7c9d1

Use correct branch name

89e8183

Apply the same change to the daily run of coverage

4bd1015

This will need to be updated to point at a correct tag at the end of the day

liamhuber added 3 commits November 29, 2023 14:02

Target main tag on pyiron/actions

94cc443

In anticipation of the new python path features being merged

Merge branch 'main' into use_pympipool

53f9403

Merge branch 'main' into use_pympipool

a8201e1

liamhuber merged commit d0017f1 into main Nov 30, 2023
17 checks passed

liamhuber deleted the use_pympipool branch November 30, 2023 22:33

liamhuber mentioned this pull request Dec 1, 2023

Mac OS tests hang #101

Closed

liamhuber mentioned this pull request Jan 8, 2024

Serialization #6

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use the pympipool executor #77

Use the pympipool executor #77

liamhuber commented Nov 15, 2023 •

edited

Loading

github-actions bot commented Nov 15, 2023

liamhuber commented Nov 16, 2023

liamhuber commented Nov 16, 2023

liamhuber commented Nov 27, 2023

liamhuber commented Nov 28, 2023

liamhuber commented Nov 28, 2023 •

edited

Loading

liamhuber commented Nov 28, 2023

liamhuber commented Nov 28, 2023

liamhuber commented Nov 28, 2023

liamhuber commented Nov 29, 2023

liamhuber commented Nov 29, 2023

liamhuber commented Nov 29, 2023

coveralls commented Nov 29, 2023 •

edited

Loading

codacy-production bot commented Nov 29, 2023 •

edited

Loading

Use the pympipool executor #77

Use the pympipool executor #77

Conversation

liamhuber commented Nov 15, 2023 • edited Loading

github-actions bot commented Nov 15, 2023

liamhuber commented Nov 16, 2023

liamhuber commented Nov 16, 2023

liamhuber commented Nov 27, 2023

liamhuber commented Nov 28, 2023

liamhuber commented Nov 28, 2023 • edited Loading

liamhuber commented Nov 28, 2023

liamhuber commented Nov 28, 2023

liamhuber commented Nov 28, 2023

liamhuber commented Nov 29, 2023

liamhuber commented Nov 29, 2023

liamhuber commented Nov 29, 2023

coveralls commented Nov 29, 2023 • edited Loading

Pull Request Test Coverage Report for Build 7052576125

💛 - Coveralls

codacy-production bot commented Nov 29, 2023 • edited Loading

Coverage summary from Codacy

See diff coverage on Codacy

See your quality gate settings Change summary preferences

liamhuber commented Nov 15, 2023 •

edited

Loading

liamhuber commented Nov 28, 2023 •

edited

Loading

coveralls commented Nov 29, 2023 •

edited

Loading

codacy-production bot commented Nov 29, 2023 •

edited

Loading