[patch] Relocate `pyiron_workflow.job` #1126

liamhuber · 2024-08-08T20:28:02Z

Move the module here using git copy and update dependencies. Note that this is waiting on a release of pyiron_workflow that doesn't exist yet -- depending on existing releases would make a cyclic dependency.

Saving now requires that the saved nodes be importable, which would be the case if you copied and pasted the example code into a notebook, but is not the case in the dynamic place doctest is running things. So just don't define a custom "Sleep" node here, instead use a standard node.

It was just not showing up because the test name was doubled up

And test for it, hopefully, we'll find out on the CI I guess

Make NodeJob compliant with the storage interface

To clear room for a simpler NodeJob that _doesn't_ lean on pyiron_workflow's storage implementation

Rename NodeJob to StoredNodeJob

It's just a less-good version of the `NodeOutputJob`

I suspect what's happening is `DataContainer` is leaning on `h5io_browser`'s `use_state` defaults, but I won't dig into it now.

[breaking] Replace the wrapper job with a more robust job subclass

Simpler name access

In the process of getting nicer phrasing

* Change paradigm to whether or not the node uses __reduced__ and a constructor Instead of "Meta" nodes * Allow direct use of Constructed children * Move and update constructed stuff * Add new singleton behaviour so factory-produced classes can pass is-tests * PEP8 newline * Remove unnecessary __getstate__ The object isn't holding instance level state and older versions of python bork here. * Add constructed __*state__ compatibility for older versions * 🐛 add missing `return` * Format black * Revert singleton * Remove constructed It's superceded by the snippets.factory stuff * Format black * Let the factory clear method take specific names * Don't override __module__ to the factory function If it was explicitly set downstream, leave that. But if the user left it empty, still default it back to the factory function's module * Clean up storage if job tests fail * Make tinybase the default storage backend * Switch Function and Macro over to using classfactory With this, everything is pickleable (unless you slap something unpickleable on top, or define it in a place that can't be reached by pickle like inside a local function scope). The big downside is that `h5io` storage is now basically useless, since all our nodes come from custom reconstructors. Similarly, for the node job `DataContainer` can no longer store the input node. The `tinybase` backend is still working ok, so I made it the default, and I got the node job working again by forcing it to cloudpickle the input node on saving. These are some ugly hacks, but since storage is an alpha feature right now anyhow, I'd prefer to push ahead with pickleability. * Remove unused decorator And reformat tests in the vein of usage in Function and Macro * Format black --------- Co-authored-by: pyiron-runner <pyiron@mpie.de>

* Change paradigm to whether or not the node uses __reduced__ and a constructor Instead of "Meta" nodes * Allow direct use of Constructed children * Move and update constructed stuff * Add new singleton behaviour so factory-produced classes can pass is-tests * PEP8 newline * Remove unnecessary __getstate__ The object isn't holding instance level state and older versions of python bork here. * Add constructed __*state__ compatibility for older versions * 🐛 add missing `return` * Format black * Revert singleton * Remove constructed It's superceded by the snippets.factory stuff * Format black * Let the factory clear method take specific names * Don't override __module__ to the factory function If it was explicitly set downstream, leave that. But if the user left it empty, still default it back to the factory function's module * Clean up storage if job tests fail * Make tinybase the default storage backend * Switch Function and Macro over to using classfactory With this, everything is pickleable (unless you slap something unpickleable on top, or define it in a place that can't be reached by pickle like inside a local function scope). The big downside is that `h5io` storage is now basically useless, since all our nodes come from custom reconstructors. Similarly, for the node job `DataContainer` can no longer store the input node. The `tinybase` backend is still working ok, so I made it the default, and I got the node job working again by forcing it to cloudpickle the input node on saving. These are some ugly hacks, but since storage is an alpha feature right now anyhow, I'd prefer to push ahead with pickleability. * Remove unused decorator And reformat tests in the vein of usage in Function and Macro * Format black * Expose concurrent.futures executors on the creator * Only expose the base Executor from pympipool Doesn't hurt us now and prepares for the version bump * Extend `Runnable` to use a non-static method This is significant. `on_run` is no longer a property returning a staticmethod that will be shipped off, but we directly ship off `self.on_run` so `self` goes with it to remote processes. Similarly, `run_args` gets extended to be `tuple[tuple, dict]` so positional arguments can be sent too. Stacked on top of pickleability, this means we can now use standard `concurrent.futures.ProcessPoolExecutor` -- as long as the nodes are all defined somewhere importable, i.e. not in `__main__`. Since working in notebooks is pretty common, the more flexible `pympipool.Executor` is left as the default `Workflow.create.Executor`. This simplifies some stuff under the hood too, e.g. `Function` and `Composite` now just directly do their thing in `on_run` instead of needing the misdirection of returning their own static methods. * Format black * Expose concurrent.futures executors on the creator * Only expose the base Executor from pympipool Doesn't hurt us now and prepares for the version bump * Extend `Runnable` to use a non-static method This is significant. `on_run` is no longer a property returning a staticmethod that will be shipped off, but we directly ship off `self.on_run` so `self` goes with it to remote processes. Similarly, `run_args` gets extended to be `tuple[tuple, dict]` so positional arguments can be sent too. Stacked on top of pickleability, this means we can now use standard `concurrent.futures.ProcessPoolExecutor` -- as long as the nodes are all defined somewhere importable, i.e. not in `__main__`. Since working in notebooks is pretty common, the more flexible `pympipool.Executor` is left as the default `Workflow.create.Executor`. This simplifies some stuff under the hood too, e.g. `Function` and `Composite` now just directly do their thing in `on_run` instead of needing the misdirection of returning their own static methods. * Format black * Compute qualname if not provided * Fail early if there is a <locals> function in the factory made hierarchy * Skip the factory fanciness if you see <locals> This enables _FactoryMade objects to be cloudpickled, even when they can't be pickled, while still not letting the mere fact that they are dynamic classes stand in the way of pickling. Nicely lifts our constraint on the node job interaction with pyiron base, which was leveraging cloudpickle * Format black * Test ClassFactory this way too --------- Co-authored-by: pyiron-runner <pyiron@mpie.de>

…rkflow

Except pyiron_base, which is already in the main dependencies. This include a reference to a version of pyiron_workflow that doesn't exist yet -- we can't rely on an existing version because it would give a cyclic dependency error.

github-actions · 2024-08-08T20:28:14Z

👈 Launch a binder notebook on branch pyiron/pyiron_contrib/git-copy-target

liamhuber · 2024-08-26T16:36:19Z

pyiron_workflow and pyiron_base conflict over pyiron_snippets:

  Could not solve for environment specs
  The following packages are incompatible
  ├─ pyiron_base 0.9.11**  is installable and it requires
  │  └─ pyiron_snippets 0.1.3 , which can be installed;
  └─ pyiron_workflow 0.10.0**  is not installable because there are no viable options
     ├─ pyiron_workflow 0.10.0 would require
     │  └─ pyiron_snippets 0.1.4 , which conflicts with any installable versions previously reported;
     └─ pyiron_workflow 0.10.0 would require
        └─ pyiron_base 0.9.12 , which conflicts with any installable versions previously reported.

I guess I'll go try bumping base

liamhuber and others added 30 commits January 31, 2024 17:07

Add and test wrappers for sticking nodes in a pyiron job

ab7cd5e

Format black

0ab18dd

Make NodeJob compliant with the storage interface

276020f

Wrap tests that use storage in a version check

ad8fe57

🐛 de-double and de-bug the node test

5a5bf1a

It was just not showing up because the test name was doubled up

Fail hard and clean when the python version is too low

d65882c

And test for it, hopefully, we'll find out on the CI I guess

Add a warning to the node job tests

7db19c8

Format black

24426ba

Merge pull request #209 from pyiron/patch_jobs_in_storage

69a1db8

Make NodeJob compliant with the storage interface

Rename NodeJob to StoredNodeJob

5fe6d18

To clear room for a simpler NodeJob that _doesn't_ lean on pyiron_workflow's storage implementation

Merge pull request #221 from pyiron/rename_job

abf6158

Rename NodeJob to StoredNodeJob

Introduce a new node job

d86a124

Remove the wrapper interface

9eaeffe

It's just a less-good version of the `NodeOutputJob`

Add version controls

11471fe

I suspect what's happening is `DataContainer` is leaning on `h5io_browser`'s `use_state` defaults, but I won't dig into it now.

Format black

02ccc88

Merge pull request #222 from pyiron/node_output_job

4074289

[breaking] Replace the wrapper job with a more robust job subclass

Expose NodeOutputJob under the name NodeJob

4f2d9f3

Fix test class name typo

e0ebedf

Version protect the new test

7772cf5

Merge pull request #223 from pyiron/simpler_name_access

23f96e2

Simpler name access

NodeJobOutput: Overload the save function to not change the job name

a5ae044

Explain the override to future devs

b3909d1

Add a test

01d15ef

Purge the single_value_node decorator

0656ede

Revert save overload (#234)

8562c45

Rename wrap_as to wrap

1c2f818

In the process of getting nicer phrasing

Refactor: rename

18e03c2

liamhuber added 4 commits August 8, 2024 13:10

Copy pyiron_workflow/job.py and tests/unit/test_job.py from pyiron_wo…

245feac

…rkflow

Move new files to the right location

db783e3

Remove unused import

5b6f3a9

Add dependencies

aa17159

Except pyiron_base, which is already in the main dependencies. This include a reference to a version of pyiron_workflow that doesn't exist yet -- we can't rely on an existing version because it would give a cyclic dependency error.

[dependabot skip] Update env file

bdf67f7

liamhuber mentioned this pull request Aug 8, 2024

[minor] Remove node job pyiron/pyiron_workflow#415

Merged

liamhuber closed this Aug 26, 2024

liamhuber reopened this Aug 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[patch] Relocate `pyiron_workflow.job` #1126

[patch] Relocate `pyiron_workflow.job` #1126

liamhuber commented Aug 8, 2024

github-actions bot commented Aug 8, 2024

liamhuber commented Aug 26, 2024

[patch] Relocate pyiron_workflow.job #1126

Are you sure you want to change the base?

[patch] Relocate pyiron_workflow.job #1126

Conversation

liamhuber commented Aug 8, 2024

github-actions bot commented Aug 8, 2024

liamhuber commented Aug 26, 2024

[patch] Relocate `pyiron_workflow.job` #1126

[patch] Relocate `pyiron_workflow.job` #1126