Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Asset op_tags are not propagated to step when materialization runs using celery-k8s executor #26720

Open
alekseik1 opened this issue Dec 24, 2024 · 4 comments
Labels
area: tags Related to tagging and labeling deployment: k8s Related to deploying Dagster to Kubernetes type: bug Something isn't working

Comments

@alekseik1
Copy link
Contributor

What's the issue?

Asset materialization can require special pods (e.g. with particular node_selector in k8s). Plugins like dagster-celery-k8s allow for such customization via dagster-k8s/config tag.
However, setting op_tags in @asset decorator does not propagate this tag to steps during run when running from UI.
Running from schedules seem to propagate these tags.

What did you expect to happen?

Setting key "dagster-k8s/config" in op_tags would customize pod creation behavior (e.g. "node_selector") when running from UI (for instance, when clicking "Materialize" button in asset tab).

How to reproduce?

Make a simple asset definition:

from dagster import AssetExecutionContext, asset


@asset(
    op_tags={
        "dagster-celery/queue": "default",
        "dagster-k8s/config": {
            "job_spec_config": {"ttl_seconds_after_finished": 600},
            "pod_spec_config": {
                "node_selector": {"dedicated": "pipelines"},
                "tolerations": [
                    {
                        "effect": "NoSchedule",
                        "key": "dedicated",
                        "operator": "Equal",
                        "value": "pipelines",
                    }
                ],
            },
        }
    }
)
def small_asset_with_tags(context: AssetExecutionContext):
    raise ValueError(f"current run tags: {context.run_tags}")

Go to Dagster UI, find asset named small_asset_with_tags, click "Materialize", wait for pod to be created. Check the pod configuration via kubectl get pod -oyaml POD_NAME | grep dedicated - it will not find such line.

Also, you would not find tag key "dagster-k8s/config" in job logs.

Dagster version

dagster, version 1.7.16

Deployment type

Dagster Helm chart

Deployment details

Deployment in ArgoCD with Celery workers, I use charts from official repo.

Additional information

No response

Message from the maintainers

Impacted by this issue? Give it a 👍! We factor engagement into prioritization.
By submitting this issue, you agree to follow Dagster's Code of Conduct.

@alekseik1 alekseik1 added the type: bug Something isn't working label Dec 24, 2024
@garethbrickman garethbrickman added deployment: k8s Related to deploying Dagster to Kubernetes area: tags Related to tagging and labeling labels Dec 26, 2024
@garethbrickman
Copy link
Contributor

To use the op_tags specified on the asset itself would require using the k8s_job_executor as it's necessary to materialize each asset in its own pod instead of the 1 run = 1 pod that the multiprocess executor does (the latter is the default behavior because it's most cost efficient for most use cases).

In the docs here "step" and "op" can be thought of as analogous to "asset materialization".

You can specify a different default executor for all jobs and assets provided to a code location, otherwise you would need to define the particular executor to use on a per job basis instead.

@ion-elgreco
Copy link
Contributor

@garethbrickman let's close this :) it's working fine if you set the executor correctly

@alekseik1
Copy link
Contributor Author

To use the op_tags specified on the asset itself would require using the k8s_job_executor as it's necessary to materialize each asset in its own pod instead of the 1 run = 1 pod that the multiprocess executor does (the latter is the default behavior because it's most cost efficient for most use cases).

In the docs here "step" and "op" can be thought of as analogous to "asset materialization".

You can specify a different default executor for all jobs and assets provided to a code location, otherwise you would need to define the particular executor to use on a per job basis instead.

Hi, and sorry for late response! We're using celery k8s executor everywhere, no job is run with multiprocess executor. So, it would seem like celery-k8s package has a bug as described in original post.

@ion-elgreco
Copy link
Contributor

@alekseik1 there might be a discrepancy then between k8s and celery-k8s executors, because the k8s executor passes the op tags properly to k8s

@garethbrickman garethbrickman changed the title Asset op_tags are not propagated to step when materialization runs Asset op_tags are not propagated to step when materialization runs using celery-k8s executor Jan 17, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area: tags Related to tagging and labeling deployment: k8s Related to deploying Dagster to Kubernetes type: bug Something isn't working
Projects
None yet
Development

No branches or pull requests

3 participants