Replies: 1 comment 1 reply
-
I'm definitely in favor of the easier option for now. It looks like the 3rd party tools are really sleek with some potentially super useful features, but we also have lots of other big infrastructure work to do right now and the client side search seems "good enough" for the time being. |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Problem
We have 227 tables and several thousand columns in our data warehouse. While we've attempted to provide some structure via the
raw
/core
/out
naming scheme, it's still confusing and overwhelming to figure out whether we have data that's relevant to someone's needs.Currently the only real option to answer "do you have data I want?" is to go into the Data Dictionary page and do a browser-based search.
Options
3rd party data catalog
We could spin up a service like Open Metadata or DataHub, which have many delicious features that help us:
etc.
These mostly operate on the following model:
Client-side search
We could also throw some client-side search at the data dictionary. This lets people do a full-text search of our data dictionary & do various filtering, without spinning up any new services.
The model would be:
Here's a prototype video & the code lives on the
docs-search
branch.Analysis
Here's a very simplistic analysis. Overall, I think the client-side search will meet our needs and be simpler to set up as well as maintain - but open to further discussion!
3rd party data catalog
Pros:
Cons:
Client-side search
Pros:
Cons:
Beta Was this translation helpful? Give feedback.
All reactions