Rewrite KFP code generation #2993

ptitzler · 2022-10-28T22:23:55Z

This PR:

rewrites code generation for Kubeflow Pipelines v1 (v2 is still not supported)
adds a new (previously removed) export option to the pipeline editor, enabling output of Python DSL in addition to the already supported YAML output (YAML remains the pre-selected option)
adds a new optional format parameter to the elyra-pipeline export CLI command:
- if the parameter is not specified, the export format defaults to YAML for Kubeflow Pipelines and PY for Apache Airflow
- if the parameter is specified, it must be one of YAML / PY (Kubeflow Pipelines) or PY (Airflow)
- The value is processed in a case insensitive manner, e.g. YAML = yAML = yaml
```
$ elyra-pipeline export one-custom-node.pipeline --runtime-config cloning1 --format py
```
Fixes an existing pipeline export bug (image pull secret information is not exported, rendering the pipeline unusable)

Follow-up for tests: #3002

What changes were proposed in this pull request?

The Kubeflow Pipelines processor now always generates Python DSL code as intermediary output when a pipeline is submitted or exported:

Submit: Internal Elyra pipeline representation -> Python DSL (generated by processor) -> YAML (generated by KFP argo/tekton compiler)
Export: Internal Elyra pipeline representation -> Python DSL (generated by processor)
Export: Internal Elyra pipeline representation -> Python DSL (generated by processor) -> YAML (generated by KFP argo/tekton compiler)
The pipeline documentation was updated to list the new output format.

How was this pull request tested?

Updated existing CLI tests
Added new server tests that validate that code generation yields the expected results. There are now dedicated tests for all code generation aspects:
- generates expected code for the configured workflow engine
- generates expected code for CRIO environments
- generates expected code for a plain generic component
- generates expected code for a plain generic component that utilizes a runtime image, which is protected by a pull secret
- generates expected code for a generic component configured with all supported elyra-owned properties
- generates expected code for a plain custom component
- generates expected code for data exchange between generic components
Reviewed the output of make docs

Developer's Certificate of Origin 1.1

   By making a contribution to this project, I certify that:

   (a) The contribution was created in whole or in part by me and I
       have the right to submit it under the Apache License 2.0; or

   (b) The contribution is based upon previous work that, to the best
       of my knowledge, is covered under an appropriate open source
       license and I have the right under that license to submit that
       work with modifications, whether created in whole or in part
       by me, under the same open source license (unless I am
       permitted to submit under a different license), as indicated
       in the file; or

   (c) The contribution was provided directly to me by some other
       person who certified (a), (b) or (c) and I have not modified
       it.

   (d) I understand and agree that this project and the contribution
       are public and that a record of the contribution (including all
       personal information I submit with it, including my sign-off) is
       maintained indefinitely and may be redistributed consistent with
       this project or the open source license(s) involved.

elyra-bot · 2022-10-28T22:23:57Z

Thanks for making a pull request to Elyra!

To try out this branch on binder, follow this link:

elyra/pipeline/kfp/processor_kfp.py

+        """
+        # Load Kubeflow Pipelines Python DSL template
+        loader = PackageLoader("elyra", "templates/kubeflow")
+        template_env = Environment(loader=loader)


elyra/pipeline/kfp/processor_kfp.py

- Missing input data type - Where necessary use quotes around values - Fix incorrect debug statement

elyra/pipeline/kfp/processor_kfp.py

+            generic_component_template = Environment(
+                loader=PackageLoader("elyra", "templates/kubeflow/v1")
+            ).get_template("generic_component_definition_template.jinja2")


…e-gen

akchinSTC

small nits, CRIO options look right and exported configs (volumes and modified bootstrapper options) look good.

docs/source/user_guide/pipelines.md

elyra/pipeline/kfp/processor_kfp.py

Co-authored-by: Alan Chin <akchin@us.ibm.com>

ptitzler · 2022-11-09T19:29:32Z

Based on a discussion we've had during today's dev meeting, I've updated code generation to "unload" the generated Python DSL module after it was compiled. This change should address the potential concern that over time module artifacts might accumulate, which could lead to increased memory consumption over long periods of time.

kiersten-stokes

Wow this is awesome! Tested a ton of scenarios and all are turning out as expected 🎉

elyra/templates/kubeflow/v1/python_dsl_template.jinja2

elyra/pipeline/kfp/processor_kfp.py

Co-authored-by: Kiersten Stokes <kierstenstokes@gmail.com>

… the GitHub UI

ptitzler · 2022-11-10T00:16:54Z

To address offline review feedback I've updated the Python DSL template to render a comment that identifies the node name:

    # Task for node 'Download File'
    task_8ee5ee17_1222_434e_bcf4_fd12f43e2510 = factory_8e4384f422a088e4814024df7955e952c1488bd091fa0d4873d5f611d741ceb4(
        url="https://raw.gith...",
    )

This is similar to what is done for Apache Airflow

Signed-off-by: Patrick Titzler <ptitzler@us.ibm.com>

Rewrite KFP code generation

85ad80c

ptitzler added the kind:enhancement New feature or request label Oct 28, 2022

ptitzler added this to the 3.13.0 milestone Oct 28, 2022

ptitzler marked this pull request as draft October 28, 2022 22:24

ptitzler added component:pipeline-runtime issues related to pipeline runtimes e.g. kubeflow pipelines platform: pipeline-Kubeflow Related to usage of Kubeflow Pipelines as pipeline runtime labels Oct 28, 2022

github-advanced-security bot found potential problems Oct 28, 2022

View reviewed changes

ptitzler mentioned this pull request Oct 28, 2022

Add support for pipeline parameters #2994

Closed

ptitzler added the status:Work in Progress Development in progress. A PR tagged with this label is not review ready unless stated otherwise. label Oct 28, 2022

ptitzler added 11 commits October 28, 2022 17:28

Fix issues that surfaced in CLI test

2857304

- Missing input data type - Where necessary use quotes around values - Fix incorrect debug statement

Fix bugs related to generic op inputs and outputs

6601fd3

Add missing tests and fix uncovered bug

b0ae294

Add code-gen test for custom component pipeline

8a81c0e

Add debug to failing server test

9c9befc

Update CLI and related tests

23aea4a

Fix linting

4fff5c5

Fix bugs and tests

bc10c23

Try to fix failing cypress tests

91db2e9

Rename cypress test pipeline

c94db93

Move templates and fix generic op bug

7ace98b

github-advanced-security bot found potential problems Nov 2, 2022

View reviewed changes

ptitzler added 2 commits November 2, 2022 10:39

Update documentation

1aad138

Fix failing server test

19ce9b3

ptitzler marked this pull request as ready for review November 2, 2022 19:13

Merge branch 'main' of github.com:ptitzler/elyra into rewrite-kfp-cod…

40e0f10

…e-gen

ptitzler requested review from kiersten-stokes and kevin-bates November 2, 2022 21:19

ptitzler added 2 commits November 2, 2022 16:58

Introduce new code gen test for generic pipeline components

d34b4a8

Update code gen tests for generic components

fee155e

ptitzler added 4 commits November 3, 2022 16:41

Finalize first code gen test

3363155

Update code gen tests and related assets

baf51e5

Fix failing tests, add stubs for WIP tests, and use enum

7471a22

Implement pipeline conf test and fix related issues

893ef96

kiersten-stokes mentioned this pull request Nov 7, 2022

Add support for pipeline parameters #3001

Merged

20 tasks

Add test for CRIO env and fix related issues

b4ded74

ptitzler requested a review from akchinSTC November 7, 2022 23:27

Tweak implementation

3861821

ptitzler removed the status:Work in Progress Development in progress. A PR tagged with this label is not review ready unless stated otherwise. label Nov 8, 2022

ptitzler mentioned this pull request Nov 8, 2022

Add more KFP code generation tests #3002

Closed

akchinSTC reviewed Nov 8, 2022

View reviewed changes

docs/source/user_guide/pipelines.md Outdated Show resolved Hide resolved

elyra/pipeline/kfp/processor_kfp.py Show resolved Hide resolved

ptitzler and others added 2 commits November 9, 2022 07:00

Update docs/source/user_guide/pipelines.md

6e0a306

Co-authored-by: Alan Chin <akchin@us.ibm.com>

Release module after compilation

bd5c6e8

kiersten-stokes approved these changes Nov 9, 2022

View reviewed changes

elyra/templates/kubeflow/v1/python_dsl_template.jinja2 Outdated Show resolved Hide resolved

elyra/pipeline/kfp/processor_kfp.py Outdated Show resolved Hide resolved

ptitzler and others added 4 commits November 9, 2022 16:04

Add comment to task instance in generated Python DSL

1a38194

Update elyra/templates/kubeflow/v1/python_dsl_template.jinja2

9fa23b4

Co-authored-by: Kiersten Stokes <kierstenstokes@gmail.com>

Update elyra/pipeline/kfp/processor_kfp.py

b418d6b

Co-authored-by: Kiersten Stokes <kierstenstokes@gmail.com>

Fix linting errors that were caused by accepting a proposed change in…

c897c4c

… the GitHub UI

Fix codeql issue

337e3fc

Signed-off-by: Patrick Titzler <ptitzler@us.ibm.com>

akchinSTC approved these changes Nov 10, 2022

View reviewed changes

akchinSTC merged commit e682ef4 into elyra-ai:main Nov 11, 2022

ptitzler mentioned this pull request Nov 30, 2022

lint Jupyter notebooks #3031

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rewrite KFP code generation #2993

Rewrite KFP code generation #2993

ptitzler commented Oct 28, 2022 •

edited

Loading

elyra-bot bot commented Oct 28, 2022

akchinSTC left a comment

ptitzler commented Nov 9, 2022

kiersten-stokes left a comment

ptitzler commented Nov 10, 2022

Rewrite KFP code generation #2993

Rewrite KFP code generation #2993

Conversation

ptitzler commented Oct 28, 2022 • edited Loading

What changes were proposed in this pull request?

How was this pull request tested?

elyra-bot bot commented Oct 28, 2022

akchinSTC left a comment

Choose a reason for hiding this comment

ptitzler commented Nov 9, 2022

kiersten-stokes left a comment

Choose a reason for hiding this comment

ptitzler commented Nov 10, 2022

ptitzler commented Oct 28, 2022 •

edited

Loading