-
Notifications
You must be signed in to change notification settings - Fork 13
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
should we be filtering for mapping quality? #110
Comments
Hi James! That's a good question, and I did some experiments in the past, mostly with respect to transcript discovery. IsoQuant has no general MAPQ cutoff, but filters some alignments by quality in some specific cases. Constants are a bit ad-hoc and may seem to be complicated, but I certainly did some experiments to implement this filtering. Although I cannot say I did an excessive analysis of MAPQ values in general. Also, some long-read RNA aligners to not report meaningful MAPQs, setting 60 for all alignments. However, I presume minimap2 is used in 99% of cases. In order not to complicate the usage, I can simply add MAPQ filtering option to IsoQuant. It should be also fairly easy to perform some experiments on simulated data to figure out how transcript discovery is affected by the global MAPQ filtering. Best |
I thought you were using secondary alignments based on the help text that said I noticed this because I was comparing multiple runs of the same sample, using different library preps, and I saw a very large difference in read counts at a pseudogene. It seemed like they were primarily low-quality alignments, which made me suspicious that this was the right assignment. |
Yes, secondary alignments are not discarded by default. I meant that MAPQ filtering is only applied to the primary alignments. |
Implemented now in version 3.4. |
Hi Andrey 👋
We've been running IsoQuant a lot lately. This week I was looking at some of our data in IGV and noticed a discrepancy between the number of reads assigned by isoquant and the pile-up I could see there. It turned out that this was because IGV had a filter on MAPQ and was only showing high-quality mapping, whereas IsoQuant will use (and quantify?) all the reads regardless of mapping quality.
Is this intended behavior, and if so were there any experiments to suggest that this provides better results? Or perhaps we should be filtering our BAM file before running isoquant, to remove the low-quality alignments?
The text was updated successfully, but these errors were encountered: