Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Clean-up and generalize TDR source specs #6426

Closed
hannes-ucsc opened this issue Jul 19, 2024 · 1 comment
Closed

Clean-up and generalize TDR source specs #6426

hannes-ucsc opened this issue Jul 19, 2024 · 1 comment
Assignees
Labels
- [priority] Medium code [subject] Production code debt [type] A defect incurring continued engineering cost demo [process] To be demonstrated at the end of the sprint demoed [process] Successfully demonstrated to team enh [type] New feature or request orange [process] Done by the Azul team

Comments

@hannes-ucsc
Copy link
Member

Current spec format:

tdr:datarepo-dev-6fdac3db:snapshot/hca_dev_e8808cc84ca0409680f2bba73600cba6__20210902_20210907:/1

  1. Rename TDRSourceSpec.project to TDRSourceSpec.subdomain.

  2. Remove is_snapshot field (dataset/ or snapshot/) and remove support for /dataset:

    tdr:datarepo-dev-6fdac3db:hca_dev_e8808cc84ca0409680f2bba73600cba6__20210902_20210907:/1

  3. Add type field (enum of bigquery or parquet, but reject parquet for now):

    tdr:bigquery:datarepo-dev-6fdac3db:hca_dev_e8808cc84ca0409680f2bba73600cba6__20210902_20210907:/1

  4. Add domain field (enum of gcp or azure, but reject azure for now) and reject the combination of type bigquery and provider azure:

    tdr:bigquery:gcp:datarepo-dev-6fdac3db:hca_dev_e8808cc84ca0409680f2bba73600cba6__20210902_20210907:/1

The order of the attributes of TDRSourceSpec should be: prefix (inherited), type, domain, subdomain and name.

@hannes-ucsc hannes-ucsc added orange [process] Done by the Azul team enh [type] New feature or request debt [type] A defect incurring continued engineering cost code [subject] Production code - [priority] Medium labels Jul 19, 2024
@nadove-ucsc nadove-ucsc added the demo [process] To be demonstrated at the end of the sprint label Aug 3, 2024
@nadove-ucsc
Copy link
Contributor

For demo, show new source specs in the service response. Show that the update_subgraph_counts.py script still works. Try filtering an index endpoint using a source spec.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
- [priority] Medium code [subject] Production code debt [type] A defect incurring continued engineering cost demo [process] To be demonstrated at the end of the sprint demoed [process] Successfully demonstrated to team enh [type] New feature or request orange [process] Done by the Azul team
Projects
None yet
Development

No branches or pull requests

2 participants