-
Notifications
You must be signed in to change notification settings - Fork 424
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
minimap2-2.20(r1061) gets killed #749
Comments
What's your input? I tested the recent versions on E. coli reads and got the right output. |
I don't have access to dnanexus, but on another HG002 Nanopore dataset, minimap2 doesn't stop at "collected minimizers". Do you have enough memory? Does older minimap2 work? BTW, these days, it is not recommended to use minimap2 for assembly. There are better Nanopore assemblers for large datasets. |
@lh3 |
Could you make your dataset available? 32GB should be enough for 100k reads. v2.20 shouldn't use more memory than v2.18 in the ava-ont mode. |
I have also been facing a similar problem and the available disk space is 712GB, with 32GB RAM. I have been trying to align the nanopore datasets to the mouse genome (using minimap v2.18), however the process is killed. Could @lh3 you please suggest on the possible solutions? |
I'm having a similar problem aligning reads against a 16S index. I was able to run this earlier with ~v2.17 on an older iMac with 16GB memory. Now I've tried running the same job on my newer MacBook Air running Big Sur with 8GB memory and an EC2 t2.2xlarge instance running ubuntu with 32GB memory and they both get killed. I do not get a message saying insufficient memory, just the following after running for a few minutes (version 2.21-r1071): ubuntu@ip-172-31-1-218:~/Classification$ cat run_minimap2.out |
Hello, I am also have similar issue. I am trying to align two different versions genome assemblies of the same species using the following command and my job is getting killed similar to other people posted here. Please see the error message below. Error Message: [M::mm_idx_gen::9.428*1.31] collected minimizers [M::mm_idx_gen::10.605*1.83] sorted minimizers [M::main::10.607*1.83] loaded/built the index for 177 target sequence(s) [M::mm_mapopt_update::11.006*1.80] mid_occ = 50 [M::mm_idx_stat] kmer size: 19; skip: 19; is_hpc: 0; #seq: 177 [M::mm_idx_stat::11.283*1.78] distinct minimizers: 20039135 (95.27% are singletons); average occurrences: 1.130; average spacing: 9.947; total leng th: 225250884 /cm/local/apps/slurm/var/spool/job1799308/slurm_script: line 14: 13532 Killed minimap2 -t 15 -ax asm5 /work/user/thu/genomes /AmelHAv3.1/GCF_003254395.2_Amel_HAv3.1_genomic.fna /work/user/thu/HB_assemblies/asm.v2.FINAL.fasta > asm_vs_AmelHAv3.1.mMap2.out Script used to run minimap2: #SBATCH --cpus-per-task 14 #SBATCH --mem-per-cpu 20000 #SBATCH -J minMap2 cd $SLURM_SUBMIT_DIR cd ../results/AmelHAv3.1_vs_asm module load minimap2/7025b0b minimap2 -t 13 -ax asm5 /work/user/thu/genomes/AmelHAv3.1/GCF_003254395.2_Amel_HAv3.1_genomic.fna /work/user/thu/HB_assemblies/asm.v2.FINAL.fasta > asm_vs_AmelHAv3.1.mMap2.out |
Do you have sequences such that I can reproduce the issue? Thanks. |
Heng,
The sequences are here: https://www.dropbox.com/s/lyn8d7vvsjkfoqt/barcode06_all.fastq.gz?dl=0
The index is here: https://www.dropbox.com/s/36mmhvtloyjcb15/ncbi_16SRNA.mmi?dl=0
Here’s the command: ./minimap2 -ax map-ont -o barcode06_fast.sam --secondary=no ncbi_16SRNA.mmi barcode06_all.fastq.gz
It appears to be a memory leak - I ran top while it was running and the memory usage keeps going up and then it gets killed when there is no more memory available. I’ve tried it on the current version (2.21-r1071) and minimap2.2.17-r974-dirty with the same results. If I only run 100k of the 975k reads it will complete. 500k will not.
I do not have access to the iMac I ran it on a few months back where it worked.
Thanks for taking a look!
—Mike Sierk
… On Aug 3, 2021, at 9:34 AM, Heng Li ***@***.***> wrote:
Do you have sequences such that I can reproduce the issue? Thanks.
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub <#749 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ANL5LMKL2TD5SJMYM5R27RTT27WADANCNFSM45VDEX5A>.
Triage notifications on the go with GitHub Mobile for iOS <https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675> or Android <https://play.google.com/store/apps/details?id=com.github.android&utm_campaign=notification-email>.
|
This is more apparent when there are many candidate chains. Although only a small numbers of them are extended, they are still occupying memory. A realloc() solves this problem. This is a long existiing issue.
Thanks, @msierk. This is indeed caused by an unusual memory leak. Please try the dev branch. It should have this issue fixed. All versions of minimap2 have this problem. For mapping against a large number of 16S, you may add @donkirkby Your issue looks different because your reference is not repetitive (based on |
I suspect that @lh3's comment above is for @donthuanalyst instead of me. |
@donkirkby Yes. Thanks, and sorry for mentioning you instead. |
That seems to have fixed it - thanks! |
This is more apparent when there are many candidate chains. Although only a small numbers of them are extended, they are still occupying memory. A realloc() solves this problem. This is a long existiing issue.
Dr Li, (@lh3 ),
With the two most recent releases,
./minimap2 -t 1 -x ava-ont reads.fq reads.fq > /dev/null
gets killed right after the print for "collected minimizers" is completed.
The text was updated successfully, but these errors were encountered: