You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thanks for making this wonderful tool! I have run into an issue:
After running the following command... samtools view ./WT_96_2.bam | BAFExtract -generate_compressed_pileup_per_SAM stdin ./CaSpER/dm6.list ./CaSpER/baf/wt96_2 50 0; BAFExtract -get_SNVs_per_pileup ./CaSpER/dm6.list ./CaSpER/baf/wt96_2 ./CaSpER/dm6_fasta_pileup/ 20 4 0.1 ./CaSpER/baf/wt96_2/WT_96_2.baf
...I get the following std output:
Generating pileup from SAM file with min mapp qual 50, min base qual 0
Dumping the pileups per SAM file stdin
Allocating pileup memory.
Done.
Maximum # of alleles at each position: 65536
Phred+33 encoding @ 14. read (1):
CACTCAATTATATACTTTATATGGTCGGAAAAGCTTCCTTCTGCCTGTAA
D@@@@EHHEEHEHHEHHGGIH1G1D1FHHHGHGC@<C@HHHHECHHCE@H
Finished reading.00. read
27379256 0 signal, 4037446 4-bit, 636538 8-bit, 26073 12-bit, 18 14-bit, 0 16-bit positions.
Skipping loading check.
24265304 0 signal, 3313416 4-bit, 517137 8-bit, 14343 12-bit, 27 14-bit, 0 16-bit positions.
Skipping loading check.
21308450 0 signal, 3421544 4-bit, 532190 8-bit, 24291 12-bit, 461 14-bit, 0 16-bit positions.
Skipping loading check.
19142111 0 signal, 3866743 4-bit, 516616 8-bit, 16734 12-bit, 67 14-bit, 0 16-bit positions.
Skipping loading check.
20211830 0 signal, 2818271 4-bit, 463435 8-bit, 20107 12-bit, 69 14-bit, 0 16-bit positions.
Skipping loading check.
3663413 0 signal, 3939 4-bit, 0 8-bit, 0 12-bit, 0 14-bit, 0 16-bit positions.
Skipping loading check.
1060875 0 signal, 256237 4-bit, 28894 8-bit, 2125 12-bit, 0 14-bit, 0 16-bit positions.
Skipping loading check.
Calling SNVs with minimum 20 total and 4 alternate read coverage and 0.100 min alternate frequency.
Procesing 3R
Loading pileup of length 32079331
Loaded 32079331 long pileup.
Procesing 3L
Loading pileup of length 28110227
Loaded 28110227 long pileup.
Procesing 2R
Loading pileup of length 25286936
Loaded 25286936 long pileup.
Procesing X
Loading pileup of length 23542271
Loaded 23542271 long pileup.
Procesing 2L
Loading pileup of length 23513712
Loaded 23513712 long pileup.
Procesing Y
Loading pileup of length 3667352
Loaded 3667352 long pileup.
Procesing 4
Loading pileup of length 1348131
Loaded 1348131 long pileup.
Question 1: what does the 20 4 0.1 parameter for BAFExtract -get_SNVs_per_pileup mean?
It seems to have worked and I can see the .baf file along with the allele_counts.bin files for each chromosome in the output folder. This is the output:
HOWEVER, given that dm6 (fruit-flies) has 2L, 2R, 3L, 3R, 4, X, Y chromosomes, and that the generated baf file only ends up containing the values for chromosome 4, I don't quite understand what happened there.
So, Question 2: How can I fix this?
This is for every sample run for the overall experiment. Any help would be greatly appreciated!
Regards,
Deep
The text was updated successfully, but these errors were encountered:
@akdess I think I may have figured out both the problems (first one is explained within the source code for main.cpp). But I need a solution for the second one.
I think BAFExtract is only pulling/writing for chromosome 4 because it is the only integer in the lot - fly chromosomes are written as 2L, 2R, 3L, 3R, X, Y and 4 and so there's an issue with detection or with writing somewhere. Am I wrong to assume so? Could you please help me fix this issue?
Hi,
Thanks for making this wonderful tool! I have run into an issue:
After running the following command...
samtools view ./WT_96_2.bam | BAFExtract -generate_compressed_pileup_per_SAM stdin ./CaSpER/dm6.list ./CaSpER/baf/wt96_2 50 0; BAFExtract -get_SNVs_per_pileup ./CaSpER/dm6.list ./CaSpER/baf/wt96_2 ./CaSpER/dm6_fasta_pileup/ 20 4 0.1 ./CaSpER/baf/wt96_2/WT_96_2.baf
...I get the following std output:
Question 1: what does the 20 4 0.1 parameter for BAFExtract -get_SNVs_per_pileup mean?
It seems to have worked and I can see the .baf file along with the allele_counts.bin files for each chromosome in the output folder. This is the output:
HOWEVER, given that dm6 (fruit-flies) has 2L, 2R, 3L, 3R, 4, X, Y chromosomes, and that the generated baf file only ends up containing the values for chromosome 4, I don't quite understand what happened there.
So, Question 2: How can I fix this?
This is for every sample run for the overall experiment. Any help would be greatly appreciated!
Regards,
Deep
The text was updated successfully, but these errors were encountered: