Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

make_snps parameter generates empty output #13

Open
bounlu opened this issue Aug 4, 2022 · 7 comments
Open

make_snps parameter generates empty output #13

bounlu opened this issue Aug 4, 2022 · 7 comments

Comments

@bounlu
Copy link

bounlu commented Aug 4, 2022

I have set make_snps = True at the extract step in the config file as below to get SNP output. Although it runs the process and generates the specified *_snps.txt.gz file, it is empty and there is no error code. It seems there is a bug with writing the output to the file.

base = "/home"

index_dir = ${base}/reference
sequence_dir = ${base}/fastq
bam_dir = ${base}/mapping
bcf_dir = ${base}/calling
extract_dir = ${base}/extract
report_dir = ${base}/report

# hg38 genome index prepared in advance
reference = ${base}/reference/genome.fasta
index = ${base}/reference/genome.gemBS.gem

jobs = 8
cores = 32
threads = 64
memory = 256G


[dbsnp]

# dbsnp index prepared in advance
dbsnp_files = ${base}/reference/dbsnp_146.hg38.vcf.gz
dbsnp_index = ${base}/reference/dbsnp_146.hg38.gemBS.vcf.idx
dbsnp_type = "VCF"

[mapping]

non_stranded = False
remove_individual_bams = True

[calling]

jobs = 8
mapq_threshold = 10
qual_threshold = 13
reference_bias = 2
left_trim = 5
right_trim = 0
keep_improper_pairs = False
keep_duplicates = False
haploid = False
conversion = 0.01,0.05
remove_individual_bcfs = True
contig_pool_limit = 25000000

[extract]

jobs = 8
strand_specific = True
bigwig_strand_specific = True
phred_threshold = 10
make_cpg = True
make_non_cpg = True
make_bedmethyl = True
make_bigwig = True
make_snps = True
@MareikeJaniak
Copy link

Hi @bounlu,
Did you ever find a solution to this issue? I'm experiencing the same thing - gemBS extract seems to run without error and generates the _snps.txt.gz output file, index, and md5, but the snps file is always empty.

Thanks!
Best,
Mareike

@bounlu
Copy link
Author

bounlu commented Oct 17, 2023

Hi Mareike,

Unfortunately not, it seems the tool is not maintained anymore either. So I switched using BISCUIT.

@MareikeJaniak
Copy link

Thanks for the quick reply!

@heathsc
Copy link
Owner

heathsc commented Oct 17, 2023 via email

@MareikeJaniak
Copy link

Hi Simon,

Thanks for confirming that the tool is still maintained. We're currently building it into our methylation analysis pipeline, so glad to hear that! All other steps are working well, we're just running into the issue with the snp file.

The dbSNP index file is generated and is not empty (~4.2G) but the output file includes this line, which suggests that maybe something is going wrong?

        /lb/project/mugqic/analyste_dev/software/gemBS-rs/bin/dbsnp_index --loglevel info --threads 1 --output /lb/project/mugqic/projects/mjaniak/test/methylseq/alignment/index/dbSNP_gemBS.idx /cvmfs/soft.mugqic/CentOS6/genomes/species/Homo_sapiens.GRCh38/annotations/Homo_sapiens.GRCh38.dbSNP156.vcf.gz
n_snps 890444194, n_selected_snps 0

Is this expected or are 0 snps actually being selected?

Best,
Mareike

@heathsc
Copy link
Owner

heathsc commented Oct 17, 2023 via email

@bounlu
Copy link
Author

bounlu commented Oct 17, 2023

Hi Simon,

Good to hear that it's still maintained. I have 2 more issues pending reply, could you please help me with those as well?

I will later check your suggestions on this issue. Also it would greatly help if you can share your in-house config file.

Thanks.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants