Skip to content

Conversation

@schustmi
Copy link
Contributor

@schustmi schustmi commented Dec 17, 2025

Describe changes

  • Environment variable that allows configuring how many workers the DAG runner uses
  • Fetch hydrated cache candidate to avoid extra request
  • Pass step runs cache to avoid unnecessary requests
  • If step pod failed during startup, publish a failed step run

Pre-requisites

Please ensure you have done the following:

  • I have read the CONTRIBUTING.md document.
  • I have added tests to cover my changes.
  • I have based my new branch on develop and the open PR is targeting develop. If your branch wasn't based on develop read Contribution guide on rebasing branch to develop.
  • IMPORTANT: I made sure that my changes are reflected properly in the following resources:
    • ZenML Docs
    • Dashboard: Needs to be communicated to the frontend team.
    • Templates: Might need adjustments (that are not reflected in the template tests) in case of non-breaking changes and deprecations.
    • Projects: Depending on the version dependencies, different projects might get affected.

Types of changes

  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds functionality)
  • Breaking change (fix or feature that would cause existing functionality to change)
  • Other (add details above)

@schustmi schustmi added the release-notes Release notes will be attached and used publicly for this PR. label Dec 17, 2025
@github-actions github-actions bot added internal To filter out internal PRs and issues enhancement New feature or request labels Dec 17, 2025
@schustmi schustmi force-pushed the feature/more-efficient-kubernetes-orchestrator-caching branch from 2d3a0c1 to f475b79 Compare December 17, 2025 16:59
@schustmi schustmi force-pushed the feature/more-efficient-kubernetes-orchestrator-caching branch from f475b79 to 05191e7 Compare December 18, 2025 08:24
@github-actions
Copy link
Contributor

github-actions bot commented Jan 2, 2026

⚠️ This PR has been inactive for 2 weeks and has been marked as stale.
Timeline:

  • Week 2 (now): First reminder - PR marked as stale
  • Week 4: PR will be automatically closed if no activity
    Please update this PR or leave a comment to keep it active. Any activity will reset the timer and remove the stale label.

@github-actions github-actions bot added the stale label Jan 2, 2026
@github-actions
Copy link
Contributor

github-actions bot commented Jan 2, 2026

LLM Finetuning template updates in examples/llm_finetuning have been pushed.

@github-actions
Copy link
Contributor

github-actions bot commented Jan 2, 2026

Classification template updates in examples/mlops_starter have been pushed.

@github-actions
Copy link
Contributor

github-actions bot commented Jan 2, 2026

E2E template updates in examples/e2e have been pushed.

@github-actions github-actions bot removed the stale label Jan 3, 2026
@schustmi schustmi changed the title Improve Kubernetes orchestrator caching efficiency Kubernetes orchestrator caching efficiency + reliability Jan 8, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

enhancement New feature or request internal To filter out internal PRs and issues release-notes Release notes will be attached and used publicly for this PR.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants