Skip to content

Conversation

@AlexejPenner
Copy link
Contributor

@AlexejPenner AlexejPenner commented Nov 28, 2025

Describe changes

I added a section per deployment scenario - https://zenml-io.gitbook.io/alexej/zenml-pro

Pre-requisites

Please ensure you have done the following:

  • I have read the CONTRIBUTING.md document.
  • I have added tests to cover my changes.
  • I have based my new branch on develop and the open PR is targeting develop. If your branch wasn't based on develop read Contribution guide on rebasing branch to develop.
  • IMPORTANT: I made sure that my changes are reflected properly in the following resources:
    • ZenML Docs
    • Dashboard: Needs to be communicated to the frontend team.
    • Templates: Might need adjustments (that are not reflected in the template tests) in case of non-breaking changes and deprecations.
    • Projects: Depending on the version dependencies, different projects might get affected.

Types of changes

  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds functionality)
  • Breaking change (fix or feature that would cause existing functionality to change)
  • Other (add details above)

@AlexejPenner AlexejPenner changed the base branch from develop to docs/pro-vs-oss November 28, 2025 10:33
@AlexejPenner AlexejPenner requested a review from htahir1 November 28, 2025 10:33
@github-actions github-actions bot added internal To filter out internal PRs and issues documentation Improvements or additions to documentation labels Nov 28, 2025
@github-actions
Copy link
Contributor

github-actions bot commented Nov 28, 2025

Documentation Link Check Results

Absolute links check failed
There are broken absolute links in the documentation. See workflow logs for details
Relative links check failed
There are broken relative links in the documentation. See workflow logs for details
Last checked: 2025-12-05 10:01:47 UTC

@github-actions
Copy link
Contributor

github-actions bot commented Nov 28, 2025

🔍 Broken Links Report

Summary

  • 📁 Files with broken links: 3
  • 🔗 Total broken links: 3
  • 📄 Broken markdown links: 2
  • 🖼️ Broken image links: 1
  • ⚠️ Broken reference placeholders: 0

Details

File Link Type Link Text Broken Path
zenml-pro/hybrid-deployment-helm.md 📄 "Set up users and teams" ../organization.md
zenml-pro/self-hosted-deployment.md 🖼️ "Self-hosted deployment architecture" ../../.gitbook/assets/air-gapped-architecture.png
zenml-pro/hybrid-deployment-ecs.md 📄 "Set up users and teams" ../organization.md
📂 Full file paths
  • /home/runner/work/zenml/zenml/scripts/../docs/book/getting-started/zenml-pro/hybrid-deployment-helm.md
  • /home/runner/work/zenml/zenml/scripts/../docs/book/getting-started/zenml-pro/self-hosted-deployment.md
  • /home/runner/work/zenml/zenml/scripts/../docs/book/getting-started/zenml-pro/hybrid-deployment-ecs.md

Comment on lines 72 to 93
1. **Code Execution**: You write code and run pipelines with your client SDK using Python
2. **Authentication & Token Acquisition**:
- Users authenticate via your internal identity provider (LDAP/AD/OIDC)
- The ZenML Pro control plane (running in your infrastructure) handles authentication and RBAC
- The ZenML client fetches short-lived tokens from your ZenML workspace for:
- Pushing Docker images to your container registry
- Communicating with your artifact store
- Submitting workloads to your orchestrator
- *Note: Your local Python environment needs the client libraries for your stack components*
3. **Authorization**: RBAC policies enforced by your control plane before token issuance
4. **Image & Workload Submission**: The client pushes Docker images (and optionally code if no code repository is configured) to your container registry, then submits the workload to your orchestrator
5. **Orchestrator Execution**: In the orchestrator environment within your infrastructure:
- The Docker image is pulled from your container registry
- Within the pipeline/step entrypoint, the necessary code is pulled in
- A connection to your ZenML workspace is established
- The relevant pipeline/step code is executed
6. **Runtime Data Flow**: During execution (all within your infrastructure):
- Pipeline and step run metadata is logged to your ZenML workspace
- Logs are streamed to your log backend
- Artifacts are written to your artifact store
- Metadata pointing to these artifacts is persisted in your workspace
7. **Observability**: The ZenML Pro dashboard (running in your infrastructure) connects to your workspace and uses all persisted metadata to provide you with a complete observability plane
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@stefannica fact check pls


The diagram above illustrates a complete air-gapped ZenML Pro deployment with all components running within your organization's VPC. This architecture ensures zero external communication while providing full enterprise MLOps capabilities.

### Architecture Components
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@stefannica fact check pls

- **Backup sites** for disaster recovery
- **Monitoring and alerting** for all components

## Pre-requisites
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@stefannica fact check pls

@AlexejPenner
Copy link
Contributor Author

https://zenml-io.gitbook.io/alexej/zenml-pro - view here to see it in action

Copy link
Contributor

@htahir1 htahir1 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think its good for a first round. many comments apply to many pages

Comment on lines 59 to 60
-**Vulnerability Assessment Reports** available on request
-**Software Bill of Materials (SBOM)** available on request
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@stefannica should verify this


All three deployment scenarios follow a similar pipeline execution pattern, with differences in where authentication happens and where data resides:

### Standard Data Flow Steps
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This definitely needs a diagram

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Agreed - we might even have one laying around somewhere


**SaaS**: Metadata is stored in ZenML infrastructure. Your ML data and compute remain in your infrastructure.

**Hybrid**: Metadata and control plane are split — authentication/RBAC happens at ZenML control plane, but all run metadata, artifacts, and compute stay in your infrastructure.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I thnk the authentication bit is the most important here and isnt really elaborated but maybe it is later?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What more would you like to know about this at this stage?


You control this access by configuring appropriate cloud IAM permissions.

## Getting Started
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

IMO super strnage to have this whole section here...

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

the whole section ? maybe we dont need the example pipeline - butt i like how it shows how quickly youi're ready

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hmm really? its in the dashboard already when you sign up

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

well somebody in the docs here wants to know what complexity awaits them - "Is it worth my time?"

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

im not sure tbh

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

in my experience these are the questions we get very early on

@github-actions
Copy link
Contributor

github-actions bot commented Dec 3, 2025

Images automagically compressed by Calibre's image-actions

Compression reduced images by 34%, saving 132.21 KB.

Filename Before After Improvement Visual comparison
docs/book/getting-started/zenml-pro/.gitbook/assets/pro-workload-managers.png 388.76 KB 256.55 KB -34.0% View diff

383 images did not require optimisation.

Update required: Update image-actions configuration to the latest version before 1/1/21. See README for instructions.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

documentation Improvements or additions to documentation internal To filter out internal PRs and issues

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants