-
Notifications
You must be signed in to change notification settings - Fork 21
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
very low number of mapped read when using --read-format #136
Comments
@mourisl Is this way of using read-format supported? |
@nouroddinc Is your barcode starting from 60-67 followed by 22-29? Chromap will internally sort it to 22-29 followed by 60-67 to avoid typos in ranges. Maybe this is an undesirable feature now. |
@haowenz @mourisl thanks for helping me on this. yes, my barcode starts from 60-67 and ends with 22-29. I have tested with
I have both combinations of barcodes AACGTGAT and AACGTGAT which is |
@nouroddinc I have removed the sort in the branch "li_dev4". Could you please checkout this branch, "make clean; make" to recompile chromap, and give the new version a try? |
@mourisl I have tested the new branch and the output still shows the low number of mapped reads! |
I rechecked your running log. For the "_without_read_format" run, you used the files
For the files with "_with_read_format", the files are:
Since some of the files are in cellranger_inputs, and some are not, I just want to make sure you used the right input files. |
@mourisl I think you might be right. I tested with a new data set with smaller size to get faster output and sounds like they are very similar but still not identical. Is that normal? |
The "Number of barcodes in whitelist" is still different "580224" vs "539675". So I guess the manually extracted barcode and the automatic extracted barcode are still different. Could you please try:
Thank you for helping us debug the issue. |
@mourisl is this what you asked for? 1- I have tried reverse barcode like 2- I have extracted CB:Z tag barcodes from both sam files and they match the whitelist. I mean I can't see any wrong barcode in neither of them. |
Thank you for the testing. |
alright, I have two outputs.
sam files output in normal barcode is like this:
chromap_with_read_format_rev.log and reverse barcode sam file is like :
and the read2 file is this:
hopefully, this is what you were looking for. Thanks |
Seems the barcode order issue is indeed fixed. Are the barcodes from readformat run the same as your manually-created barcode file (without readformat) based on the SAM file? |
it sounds like NOT; they are in reverse of each other!
the out put is
and without
this means that reverse barcode that i already tested is aligned with the one with out |
Could you please be more specific about whether the "without --read-format" run had mistakes due to the input order being wrong, or the "with --read-format" run generated wrong barcodes? Thank you. |
@mourisl I think it works now pretty well with my manual barcode extraction pipeline. If I use |
Hi
chromap
team,Thank you so much for preparing this helpful tools and I'm very impressed by its performances. I am testing it on my scATAC-seq data and it works pretty good when I don't use
--read-format
.I have a scATAC-seq with R1 and R2 to be mapped and cell barcodes in R2 ( at positions bc1:22:29 and bc2:60-67)
Looking at my R2 file:
using the followoing command:
gives me very low number of the mapped reads as the chromap_with_read_format.log
shows.
but if I extract barcodes from read_R2.fq.gz into read_R3.fq and leave the genome in read_R2.fq as a new file like the below:
and
and then run like this:
it works very good as log file chromap_without_read_format.log shows. my
chromap
version isand this is my barcode whitelist file:
I would appreciate if you could help me to find what i am doing wrong because using --read-format argument can make it much faster and easier to run my pipeline.
many Thanks,
Noori
The text was updated successfully, but these errors were encountered: