From d0a85b47f419fd936a0290641498fa671055ea67 Mon Sep 17 00:00:00 2001 From: Shirshanka Das Date: Mon, 15 Apr 2024 23:30:18 -0700 Subject: [PATCH 1/4] docs: add content describing diff between datahub and acryl datahub --- .../managed-datahub-overview.md | 134 +++++++++++++++++- 1 file changed, 130 insertions(+), 4 deletions(-) diff --git a/docs/managed-datahub/managed-datahub-overview.md b/docs/managed-datahub/managed-datahub-overview.md index 79ffc2ce98451..21a764af9214e 100644 --- a/docs/managed-datahub/managed-datahub-overview.md +++ b/docs/managed-datahub/managed-datahub-overview.md @@ -1,8 +1,134 @@ -# Managed DataHub Exclusives +# How Acryl DataHub compares to DataHub -Acryl DataHub offers a slew of additional features on top of the normal OSS project. +DataHub is the #1 open source metadata platform for developers. +Acryl DataHub takes DataHub to the next level by offering features that allow +you to roll out the product to the entire organization beyond your central data +platform team. -## Chrome Extension +We’ve organized this comparison using a few categories: +* Search and Discovery: the ability to discover data assets at your organization and understand relationships between them +* Data Governance: features that help you govern the crown jewels of your organization, and trim out the datasets that seem to grow like weeds when no one’s looking +* Data Quality: features that help you ensure your data pipelines are producing high quality assets, and if they’re not, you’re the first to know +* Enterprise Grade: features needed to roll out at scale to large enterprises +* Implementation and Support: features related to ease of deployment and maintenance + +## How to read this list + +Not a fan of checkboxes? Neither are we. +As a result we simply color coded this list to give you a simple visual representation of things that are present in these platforms. +* Features available in *both* open source DataHub and Acryl DataHub are in green, +* Features available only in Acryl DataHub are in purple, and +* Features that are only available in open source DataHub are in blue. + +## Search and Discovery + + +Integrations for 50+ Data Sources
+Table level, Column-level, Job-level lineage
+Search across all metadata (technical, operational, business)
+
+ + +Usage and graph-based search ranking
+
+ + +Table and column-level lineage and impact analysis
+Support for domains, data products, data contracts
+Developer friendly experiences (for data engineers, AI engineers, etc.)
+
+ +Business User friendly experiences (for data analysts, BI analysts, data +governance leads, PMs, etc.)
+Personalization across the product
+BI tool browser extension
+No-code metadata automations
+Slack integration
+Subscribe to changes and notify affected owners
+
+ +## Data Governance + + +Shift-Left governance
+Dataset ownership management
+Business glossary basics
+
+ + +Human-assisted Asset Certification Workflows (data owners, stewards)
+Computational Governance standards as continuous tests
+Approval Workflows - Business glossary modifications
+Approval Workflows - Associating glossary terms with assets
+Approval Workflows - Ownership
+
+ +## Data Observability + + +Surface data quality results
+
+ + +Build and enforce continuous data SLAs
+Continuous monitoring of dataset health
+Data observability alerts and notifications
+Data Incident management
+
+ +## Enterprise Grade + + +Battle-tested open source metadata platform
+Metadata change events as a real-time stream
+
+ + +99.5% Uptime SLA
+Data Lake and Data Warehouse off-ramp for metadata analytics
+Fine-grained access control policies (e.g., default DENY)
+Pre-defined roles for permissions
+Shared audit logs
+
+ + +## Implementation and Support + + +Your own engineering team
+Your private fork of DataHub
+
+ + +Community support
+
+ + +Cloud-hosted instance (AWS, GCP, BYOC)
+Support for AWS PrivateLink, IP address restrictions, etc.
+Accelerators for your code contributions to DataHub
+Dedicated customer success team
+Implementation Support
+Ingestion Support
+Dedicated Slack Support
+
+ +## More Questions? + +Have more questions and want to talk to someone? Fill out +the form using the link below, and someone from the Acryl team will reach +out to set up a chat. + + + Learn about Acryl DataHub + + + From 5b4ca469f6ceb2f6f27784f6e6463dfb33100819 Mon Sep 17 00:00:00 2001 From: Shirshanka Das Date: Tue, 16 Apr 2024 22:27:24 -0700 Subject: [PATCH 2/4] review comments --- .../managed-datahub-overview.md | 162 ++++++++---------- 1 file changed, 68 insertions(+), 94 deletions(-) diff --git a/docs/managed-datahub/managed-datahub-overview.md b/docs/managed-datahub/managed-datahub-overview.md index 21a764af9214e..34792dd5c5438 100644 --- a/docs/managed-datahub/managed-datahub-overview.md +++ b/docs/managed-datahub/managed-datahub-overview.md @@ -1,117 +1,91 @@ # How Acryl DataHub compares to DataHub DataHub is the #1 open source metadata platform for developers. + Acryl DataHub takes DataHub to the next level by offering features that allow you to roll out the product to the entire organization beyond your central data platform team. -We’ve organized this comparison using a few categories: -* Search and Discovery: the ability to discover data assets at your organization and understand relationships between them -* Data Governance: features that help you govern the crown jewels of your organization, and trim out the datasets that seem to grow like weeds when no one’s looking -* Data Quality: features that help you ensure your data pipelines are producing high quality assets, and if they’re not, you’re the first to know -* Enterprise Grade: features needed to roll out at scale to large enterprises -* Implementation and Support: features related to ease of deployment and maintenance - -## How to read this list - -Not a fan of checkboxes? Neither are we. -As a result we simply color coded this list to give you a simple visual representation of things that are present in these platforms. -* Features available in *both* open source DataHub and Acryl DataHub are in green, -* Features available only in Acryl DataHub are in purple, and -* Features that are only available in open source DataHub are in blue. +We’ve organized this comparison using a few categories. We love well-organized +checklists and we hope you love them too! ## Search and Discovery - - -Integrations for 50+ Data Sources
-Table level, Column-level, Job-level lineage
-Search across all metadata (technical, operational, business)
-
- - -Usage and graph-based search ranking
-
- - -Table and column-level lineage and impact analysis
-Support for domains, data products, data contracts
-Developer friendly experiences (for data engineers, AI engineers, etc.)
-
- -Business User friendly experiences (for data analysts, BI analysts, data -governance leads, PMs, etc.)
-Personalization across the product
-BI tool browser extension
-No-code metadata automations
-Slack integration
-Subscribe to changes and notify affected owners
-
+Features aimed at making it easy to discover data assets at your organization and understand relationships between them. + +| Feature | DataHub | Acryl DataHub | +| ---------------------------------------------- | ------- | ------------- | +| Integrations for 50+ data sources | ✅ | ✅ | +| Table level, Column-level, Job-level lineage | ✅ | ✅ | +| Search across all metadata (technical, operational, business) | ✅ | ✅ | +| Usage and graph-based search ranking | ❌ | ✅ | +| Table and column-level lineage and impact analysis | ✅ | ✅ | +| Support for domains, data products, data contracts | ✅ | ✅ | +| Developer friendly experiences (for data engineers, AI engineers, etc.) | ✅ | ✅ | +| Business User friendly experiences (for data analysts, BI analysts, data governance leads, PMs etc.) | ❌ | ✅ | +| Personalization across the product | ❌ | ✅ | +| Browser extension for BI Tools | ❌ | ✅ | +| No-code metadata automations for automated enrichment | ❌ | ✅ | +| Slack integration | ❌ | ✅ | +| Subscribe to assets, activity, and notifications | ❌ | ✅ | ## Data Governance - - -Shift-Left governance
-Dataset ownership management
-Business glossary basics
-
- - -Human-assisted Asset Certification Workflows (data owners, stewards)
-Computational Governance standards as continuous tests
-Approval Workflows - Business glossary modifications
-Approval Workflows - Associating glossary terms with assets
-Approval Workflows - Ownership
-
+Features that help you govern the crown jewels of your organization, and trim +out the datasets that seem to grow like weeds when no one’s looking. + +| Feature | DataHub | Acryl DataHub | +| ---------------------------------------------- | ------- | ------------- | +| Shift-Left governance | ✅ | ✅ | +| Dataset ownership management | ✅ | ✅ | +| Business glossary basics | ✅ | ✅ | +| Human-assisted Asset Certification Workflows (data owners, stewards) | ❌ | ✅ | +| Computational Governance standards as continuous tests | ❌ | ✅ | +| Approval Workflows - Business glossary modifications | ❌ | ✅ | +| Approval Workflows - Associating glossary terms with assets | ❌ | ✅ | +| Approval Workflows - Ownership | ❌ | ✅ | ## Data Observability +Features that help you ensure your data pipelines are producing high quality +assets, and if they’re not, making sure you and impacted users are the first to +know. - -Surface data quality results
-
+| Feature | DataHub | Acryl DataHub | +| ---------------------------------------------- | ------- | ------------- | +| Surface data quality results | ✅ | ✅ | +| Build and enforce continuous data SLAs | ❌ | ✅ | +| Continuous monitoring of dataset health | ❌ | ✅ | +| Data observability alerts and notifications | ❌ | ✅ | +| Data Incident management | ❌ | ✅ | - -Build and enforce continuous data SLAs
-Continuous monitoring of dataset health
-Data observability alerts and notifications
-Data Incident management
-
## Enterprise Grade +Features needed to roll out at scale to large enterprises. - -Battle-tested open source metadata platform
-Metadata change events as a real-time stream
-
- - -99.5% Uptime SLA
-Data Lake and Data Warehouse off-ramp for metadata analytics
-Fine-grained access control policies (e.g., default DENY)
-Pre-defined roles for permissions
-Shared audit logs
-
+| Feature | DataHub | Acryl DataHub | +| ---------------------------------------------- | ------- | ------------- | +| Battle-tested open source metadata platform | ✅ | ✅ | +| Metadata change events as a real-time stream | ✅ | ✅ | +| 99.5% Uptime SLA | ❌ | ✅ | +| Data Lake and Data Warehouse off-ramp for metadata analytics | ❌ | ✅ | +| Fine-grained access control policies (e.g., default DENY) | ❌ | ✅ | +| Pre-defined roles for permissions | ❌ | ✅ | +| Shared audit logs | ❌ | ✅ | ## Implementation and Support - - -Your own engineering team
-Your private fork of DataHub
-
- - -Community support
-
- - -Cloud-hosted instance (AWS, GCP, BYOC)
-Support for AWS PrivateLink, IP address restrictions, etc.
-Accelerators for your code contributions to DataHub
-Dedicated customer success team
-Implementation Support
-Ingestion Support
-Dedicated Slack Support
-
+Features related to ease of deployment and maintenance. + +| Feature | DataHub | Acryl DataHub | +| ---------------------------------------------- | ------- | ------------- | +| Community support | ✅ | ✅ | +| Your own engineering team | ✅ | ❌ (they can instead focus on high-value work like contributing features to the open source product, or build amazing data applications with the APIs!)| +| Your private fork of DataHub | ✅ | ❌ (you won't need to manage and maintain your own fork, upgrade to latest releases etc.)| +| Accelerators for your code contributions to DataHub | ❌ | ✅ | +| Cloud-hosted instance (AWS, GCP, BYOC) | ❌ | ✅ | +| Support for AWS PrivateLink, IP address restrictions, etc. | ❌ | ✅ | +| Dedicated customer success team | ❌ | ✅ | +| Implementation Support | ❌ | ✅ | +| Ingestion Support | ❌ | ✅ | +| Dedicated Slack Support | ❌ | ✅ | ## More Questions? @@ -119,7 +93,7 @@ Have more questions and want to talk to someone? Fill out the form using the link below, and someone from the Acryl team will reach out to set up a chat. - + Learn about Acryl DataHub From 365f62eeae59edd2b1b3f85fb022c1be143532b9 Mon Sep 17 00:00:00 2001 From: Shirshanka Das Date: Tue, 16 Apr 2024 23:48:05 -0700 Subject: [PATCH 3/4] Update managed-datahub-overview.md Co-authored-by: Hyejin Yoon <0327jane@gmail.com> --- docs/managed-datahub/managed-datahub-overview.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs/managed-datahub/managed-datahub-overview.md b/docs/managed-datahub/managed-datahub-overview.md index 34792dd5c5438..464ba3b70f0ce 100644 --- a/docs/managed-datahub/managed-datahub-overview.md +++ b/docs/managed-datahub/managed-datahub-overview.md @@ -78,7 +78,7 @@ Features related to ease of deployment and maintenance. | ---------------------------------------------- | ------- | ------------- | | Community support | ✅ | ✅ | | Your own engineering team | ✅ | ❌ (they can instead focus on high-value work like contributing features to the open source product, or build amazing data applications with the APIs!)| -| Your private fork of DataHub | ✅ | ❌ (you won't need to manage and maintain your own fork, upgrade to latest releases etc.)| +| Your private fork of DataHub | ✅ | ❌ (You won't need to manage and maintain your own fork, upgrade to latest releases etc.)| | Accelerators for your code contributions to DataHub | ❌ | ✅ | | Cloud-hosted instance (AWS, GCP, BYOC) | ❌ | ✅ | | Support for AWS PrivateLink, IP address restrictions, etc. | ❌ | ✅ | From 4be526c101782d9ed68704ffe1688fb4c4746ca0 Mon Sep 17 00:00:00 2001 From: Shirshanka Das Date: Tue, 16 Apr 2024 23:48:16 -0700 Subject: [PATCH 4/4] Update managed-datahub-overview.md Co-authored-by: Hyejin Yoon <0327jane@gmail.com> --- docs/managed-datahub/managed-datahub-overview.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs/managed-datahub/managed-datahub-overview.md b/docs/managed-datahub/managed-datahub-overview.md index 464ba3b70f0ce..c463da7e01678 100644 --- a/docs/managed-datahub/managed-datahub-overview.md +++ b/docs/managed-datahub/managed-datahub-overview.md @@ -77,7 +77,7 @@ Features related to ease of deployment and maintenance. | Feature | DataHub | Acryl DataHub | | ---------------------------------------------- | ------- | ------------- | | Community support | ✅ | ✅ | -| Your own engineering team | ✅ | ❌ (they can instead focus on high-value work like contributing features to the open source product, or build amazing data applications with the APIs!)| +| Your own engineering team | ✅ | ❌ (They can instead focus on high-value work like contributing features to the open source product, or build amazing data applications with the APIs!)| | Your private fork of DataHub | ✅ | ❌ (You won't need to manage and maintain your own fork, upgrade to latest releases etc.)| | Accelerators for your code contributions to DataHub | ❌ | ✅ | | Cloud-hosted instance (AWS, GCP, BYOC) | ❌ | ✅ |