-
Notifications
You must be signed in to change notification settings - Fork 597
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add flag for cost_observability table writing to support sub-cohort use case [VS-521] #8093
Merged
Merged
Changes from 15 commits
Commits
Show all changes
16 commits
Select commit
Hold shift + click to select a range
131161e
first draft of doc
rsasch 6d0fc9a
add cost tracking flag
rsasch 5270c87
fun with string interpolation
rsasch 2ea1d42
d'oh
rsasch ec1f47d
d'oh, again
rsasch 80d7f2f
a bit more project id logic
rsasch 4231776
always pass write_cost_to_db
rsasch a2bb04e
use empty string for False
rsasch 06cccba
make cost-observability-tablename truly optional
rsasch f5deaa8
add gatk_override for GvsExtractCohortFromSampleNames.wdl
rsasch 2ca0dbf
formatting nit
rsasch 2267b62
PR feedback
rsasch 9a11e26
clarification from PR comment
rsasch 140ae61
Merge branch 'ah_var_store' into rsa_vs_521_subcohort_cost
rsasch 1420577
update to new docker image
rsasch ecc4220
cleanup dockstore
rsasch File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,28 @@ | ||
# How to Run the GvsExtractCohortFromSampleNames Workflow | ||
|
||
The purpose of `GvsExtractCohortFromSampleNames.wdl` is to take advantage of an existing GVS (Genomic Variant Store) in BigQuery, complete with filter model, to generate a callset with a subset of specified samples' data. It calls existing WDLs for the "Prepare" and "Extract" steps and allows for the data required to create the subset callset to be stored and queried (and, therefore, paid for) by a different Google project than the "parent" GVS (most probably paid for by AoU). | ||
|
||
Required inputs: | ||
|
||
| Parameter | Description | | ||
|------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | ||
| call_set_identifier | a unique name for this cohort (used for cost tracking) | | ||
| cohort_table_prefix | a unique name for this cohort (can be the same as `call_set_identifier`) should only contain letters and underscores | | ||
| destination_dataset_name | name of the BigQuery dataset that's in the destination Google project ID | | ||
| destination_project_id | Google project ID where the `destination_dataset_name` lives | | ||
| extraction_uuid | unique name used as a query label for the "Prepare" step, can be the same as call_set_identifier and/or cohort_table_prefix (requested as an input by Verily for cost-trackings; if this isn't Verily-related, the value doesn't matter) | | ||
| filter_set_name | the same input that was used for `GvsCreateFilterSet` (or `GvsJointVariantCalling`) | | ||
| gvs_dataset | the same input value that was used for `dataset_name` in the GVS step WDLs (or `GvsJointVariantCalling`) | | ||
| gvs_project | the same input value that was used for `project_id` in the GVS step WDLs (or `GvsJointVariantCalling`) | | ||
| output_file_base_name | the base file name for the VCFs that will be created in the "Extract" step | | ||
| query_project | Google project ID for the permissions/billing of the "Prepare" and "Extract" steps for this sub-cohort (can be the same as `destination_project_id`) | | ||
| scatter_count | Number of shards to scatter "Extract" step across | | ||
| control_samples | GCS path to a file that contains a list of the samples (`sample_name` field in GVS) to include in the sub-cohort; if not set, `cohort_sample_names_array` must be set | | ||
| cohort_sample_names_array | Array of sample identifiers (`sample_name` field in GVS) to include in the sub-cohort; if not set, `control_samples` must be set | | ||
|
||
Optional inputs of interest: | ||
|
||
| Parameter | Description | | ||
|------------------------|--------------------------------------------| | ||
| drop_state | This should correspond to the same value set in `GvsImportGenomes` (or `GvsJointVariantCalling`) | | ||
| output_gcs_dir | GCS path to a directory to copy the interval list files, the extract VCFs and a sample manifest into | |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
TOL can this be simplified (actual question, I don't know)?