Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Reduce memory and CPU for CreateImportTsvs task, check for files before attempting load #7121

Merged
merged 3 commits into from
Mar 4, 2021

Conversation

mmorgantaylor
Copy link
Member

@mmorgantaylor mmorgantaylor commented Mar 3, 2021

both of these are updates to the ImportGenomes wdl:

  • reduce memory/cpus for the CreateImportTsvs task from 10GB to 3.75GB and 2 CPU to 1 CPU. these settings were tested on 3000 gvcfs and none errored out because of memory. this ties out spec-ops issues Abandon SamFileTester in favor of ArtificialSAMUtils for unit tests #211 and Determine ideal value for FeatureDataSource default query lookahead #233
  • before loading files using bq load, check for existing files in the gs bucket. only run bq load if there are matching files in the bucket. this will prevent an error if you run a subset of samples corresponding to a larger sample map such that you've created a pet_002 table but there aren't any samples to load for pet_002 yet. this was tested in Terra and worked as expected.

@mmorgantaylor mmorgantaylor requested a review from ahaessly March 3, 2021 18:24
.dockstore.yml Outdated
@@ -57,6 +57,7 @@ workflows:
branches:
- master
- ah_var_store
- mmt_reduce_mem
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

will remove this before merging

Copy link
Contributor

@ahaessly ahaessly left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

👍

@mmorgantaylor mmorgantaylor merged commit 2e4884a into ah_var_store Mar 4, 2021
@mmorgantaylor mmorgantaylor deleted the mmt_reduce_mem branch March 4, 2021 15:41
kcibul pushed a commit that referenced this pull request Mar 9, 2021
…re attempting load (#7121)

* reduce memory for ImportGenomes and add this branch to dockstore yml

* do not fail bq load if no files to ingest

* remove feature branch from dockstore
mmorgantaylor added a commit that referenced this pull request Apr 6, 2021
…re attempting load (#7121)

* reduce memory for ImportGenomes and add this branch to dockstore yml

* do not fail bq load if no files to ingest

* remove feature branch from dockstore
This was referenced Mar 17, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants