You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thanks for reporting this @pauca! We will look into this in the next week. We have some separate logic to extract the OQ field, and I think this isn't getting handled properly.
We had a bug in `org.bdgenomics.adam.util.AttributeUtils` where the regex for
splitting out the formatting string for array attributes was applied to all
attributes. In an array attribute (SAM "B" tags), the type of the array elements
is encoded before the attribute values, and is split off by commas. E.g.,
"B:i,1,2,3". If the attribute is a string (SAM "Z" tags), commas are allowed.
To resolve this, I split this regex into two regexes. We only apply the
regex for splitting out the array type if we are working on an array
attribute. This resolvesbigdatagenomics#1061.
We had a bug in `org.bdgenomics.adam.util.AttributeUtils` where the regex for
splitting out the formatting string for array attributes was applied to all
attributes. In an array attribute (SAM "B" tags), the type of the array elements
is encoded before the attribute values, and is split off by commas. E.g.,
"B:i,1,2,3". If the attribute is a string (SAM "Z" tags), commas are allowed.
To resolve this, I split this regex into two regexes. We only apply the
regex for splitting out the array type if we are working on an array
attribute. This resolvesbigdatagenomics#1061.
line
adam/adam-core/src/main/scala/org/bdgenomics/adam/util/AttributeUtils.scala
Line 31 in b29d204
with
val attrRegex = RegExp("([^:]{2,4}):([AifZHB]):([cCiIsSf]{1},)?(.*)")
does not handle properly alignmentrecords with attributes like
OQ:Z:C55/15D:::::::.7GFFAFDA442.40F=AGHHE
ie. have colons in the value part
some problematic reads are contained in gatk bundle file CEUTrio.HiSeq.WGS.b37.NA12878.bam
The text was updated successfully, but these errors were encountered: