Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

HPC/Biowulf memory overallocation #85

Closed
samarth8392 opened this issue Apr 22, 2024 · 0 comments · Fixed by #86
Closed

HPC/Biowulf memory overallocation #85

samarth8392 opened this issue Apr 22, 2024 · 0 comments · Fixed by #86

Comments

@samarth8392
Copy link
Collaborator

samarth8392 commented Apr 22, 2024

Issue

I ran >100 whole exome seq samples sequenced at average 300-400x through the XAVIER pipeline and I received a warning from HPC/Biowulf about memory overallocation.

Coverage histogram from MultiQC

qualimap_coverage_histogram

Message from HPC:

We have noticed that many of your jobs on Biowulf ALLOCATED
considerably MORE MEMORY than they used during a recent time period:

JOBS:         13345


Resources:    Needed: 15,608 GBh   Alloc: 232,901 GBh  Efficiency: 6.7%
From:         2024-04-12 10:02:08
Until:        2024-04-13 10:02:09

GBh = Gigabyte hours (i.e. GB * runtime in hours)

Please check on the resource usage of types of jobs you run regularly
and adjust memory allocations to reduce idle memory.

Memory that is allocated by a job but not used reduces cluster
efficiency and prevents the jobs of your NIH colleagues from using
this memory. In addition, allocating more resources than needed
will reduce the priority of your future jobs resulting in longer queue
wait times.

I looked at the runtime and max memory used for all the rules and noticed that all rules were only using 5-10GB of max memory and most memory was used by gatk_recal, LearnReadOrientationModel, and a few others.

usage

Default memory:

rule alloc_mem (gb)
fc_lane 64
fastq_screen 64
gatk_recal 72
bam_check 64
bam2fastq 48
haplotypecaller 48
samtools_flagstats 64
fastqc_bam 64
pileup_single 64
mutect2_single 64
varscan_single 64
vardict_single 64
mutect_single 64
somatic_mafs 64
LearnReadOrientationModel 64
somatic_merge_callers 64
mergegvcfs 48
genotype 96

Proposed solution

Reduce memory allocation for non-GATK and other rules that don't need a whole lot of memory.

samarth8392 added a commit that referenced this issue Apr 22, 2024
@samarth8392 samarth8392 mentioned this issue Apr 22, 2024
1 task
dnousome added a commit that referenced this issue May 7, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant