-
Notifications
You must be signed in to change notification settings - Fork 26
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Calling the same physical positions between sequencing runs #60
Comments
Hi @gertzem, My recommendation would be to make a unified VCF file that contains the union of all the variants that you've detected. You could do this by merging the BAM files (with Then run vartrix against the BAM file of sample separately, but use the unified VCF file. That way each output of vartrix will contain quantification of the reference and alt alleles for all the variants. You'll need to do a bit of work to collate these vartrix results, while keeping track of which sample they came from, but that should be straightforward because they'll be in different outputs. Does that make sense? |
Thanks for your response. I that may makes sense, but let me write out my understanding of what you said to make sure. I also have a syntactic question about implementing your solution. You are suggesting that the VCF file passed to vartrix using the the If my understanding is correct, I'd prefer to do this by merging the VCF files, rather than variant calling on a merged BAM, because I think calling on the merged BAM would unfairly disfavor calling region specific alternative alleles. Using a merged VCF file leads to a syntactic question. I see that the vcf file I have been using contains one "sample", which happens to be the name of the grandparent directory -- the parent is "outs". The obvious way of merging, which is |
Correct.
There are pros & cons to each approach. Probably the most important thing is to correctly tune the variant caller parameters and do some visual QC of the calls in IGV to be sure you trust them.
The VCF file you give to |
Hi, and thanks for the project.
We have a metastasis project in which we sample from different physical sites -- which we call regions -- but it is very plausible due to metastasis that mutations will be shared between regions or private to a regions. The separate regions are separate sequencing runs, and we'd like to have reference allele called in the regions in which the alt allele doesn't appear. But, while I may be missing something, I don't see any way to combine the runs into one mega-run for variant calling. The issue is that the barcodes may be reused between sites.
The text was updated successfully, but these errors were encountered: