-
Notifications
You must be signed in to change notification settings - Fork 424
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Cigars of secondary alignments are always set to the primary one #157
Comments
What do you mean by "informative cigars"? "100M" is the right CIGAR for all three alignments. |
SAM spec v1.5 includes I see now that you have The Lines 136 to 142 in aef7b07
This additional data ( qseq and tseq ) is, in turn, generated inLines 220 to 234 in aef7b07
I am not quite sure what exactly is being done there since this definition Lines 24 to 26 in aef7b07
is a bit cryptic |
It looks like minimap uses 4 bits for cigar codes and only 4 values are actually used, so one could have separate codes for |
I see there is a PR #156 actually |
Then this is a duplicate to #124. I think X/= is the second biggest mistake in the SAM spec (the top one being the bit flag). It is against the principle of alignment, inflates file sizes and has little use in practice as it doesn't tell us what are the deleted and mismatching bases. Typically I wouldn't consider X/= at all. Nonetheless, pacbio required their BAMs to have X/=. I could make an exception for them as a vendor, even though that is a wrong decision. Now #156 is pending. I will merge some form of it in future, but mappy won't support X/=.
That is the typical way to encode bases to integers.
Positions of insertions/deletions are encoded in CIGAR. For now, you can retrieve reference sequences with |
Thanks for the explanations. Exposing This intermediate representation could be used by library users and it would make it easier to expose to Python in meaningful way. |
With #156 effectively merged (with small changes), I am closing this issue. Thanks a lot for participating the discussions, which has helped the decision making. |
I have noticed that the secondary alignments do not have informative cigars:
Is it a bug or by design? Is there a way to get informative cigars for all alignments?
The text was updated successfully, but these errors were encountered: