-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Replace Pelican with Cardinal in pipeline (and make detailed coverage available) #291
Comments
Edit: Moved tangential comment to #292 |
From Pelican we get field counts and also some collection metadata. We can get the latter via an HTTP request to Kingfisher Process in the Process task's |
|
Sentry Issue: REGISTRY-PELICAN-FRONTEND-B |
I linked a Sentry issue where a Pelican API request is quite slow on some collections (20s). |
We can maybe also (semi-)automate the |
Once that's done, we can simplify the field list notebook: https://github.com/open-contracting/notebooks-ocds/blob/main/component_get_field_list_all_registry.ipynb |
Or, add a button somewhere in the data registry to export the list of fields from all publications (if there is demand for that) |
We are only using Pelican for field coverage, for which Cardinal is much faster.
We can store the output as part of the job, and make it available as part of the API in #268. We can also consider designing a report for the dataset's page, where a user can opt to view the detailed coverage.
We can then also use the output to either report:
The text was updated successfully, but these errors were encountered: