-
-
Notifications
You must be signed in to change notification settings - Fork 23
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Improve the way we find and load a dataset/table/indicator #2648
Comments
Some background on why
In principal, we can make whatever we like here, so we should just get the interface right and then solve the (trivial) code to make it work smoothly. |
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. |
I'd keep this open for a bit longer, although it doesn't seem to be a priority at the moment. |
We still want this, though it might be a part of a larger API redesign. |
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. |
Summary
We need a better way to quickly access an ETL dataset/table/indicator from within Python.
Goal: Improve our
catalog.find*
function(s) (or create other another function).Current workflow to search and load data
Currently, every time I need to access an ETL table (for example, to do a quick check) I need to go through one or more of the following steps:
Then, once you know which dataset/table/indicator you need, you have to write quite a lot of code to load it, e.g.:
Note that here there are multiple things that need to be manually given (the namespace, version, short name of the dataset, and short name of the table).
Then, if I need a specific indicator, I'd need to do all this, plus then something like
[c for c in tb.columns if "solar" in c]
and then figure out which indicator I need.Overall, it takes a few minutes to "quickly" access some data, which is not ideal.
Issues with
catalog.find*
We currently have a
catalog.find
(and other related functions) that are supposed to help with this, but this function has issues. From the top of my head:So, in practice,
catalog.find
is not useful. Therefore, we need a better function to quickly find and access our data. Superusers would also benefit from these improvements.The text was updated successfully, but these errors were encountered: