-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
comethylation
may not work with soft-clipped reads or reads containing indels
#45
Comments
Bismark does not allow soft-clipping of reads. Bismark with Bowtie1 Bismark with Bowtie2
|
This is a (non-bisulfite) read aligned with
The last 10 cycles of the read have been soft-clipped: >>> read.cigarstring
'90M10S' Here's a comparison of >>> read.alen
90 read.inferred_length (not available in PySam v < 0.7.6) >>> read.inferred_length
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
AttributeError: 'pysam.csamtools.AlignedRead' object has no attribute 'inferred_length' read.qlen >>> read.qlen
90 read.rlen >>> read.rlen
100 |
This is a (non-bisulfite) read aligned with
There is a one base insertion at cycle 71 in the read. >>> read.cigarstring
'70M1I29M' Here's a comparison of >>> read.alen
99 read.inferred_length (not available in PySam v < 0.7.6) >>> read.inferred_length
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
AttributeError: 'pysam.csamtools.AlignedRead' object has no attribute 'inferred_length' read.qlen >>> read.qlen
100 read.rlen >>> read.rlen
100 |
This is a (non-bisulfite) read aligned with
The first 5 cycles are soft-clipped and there is a deletion at cycle 66 of the read. >>> read.cigarstring
'5S60M1D35M' Here's a comparison of >>> read.alen
96 read.inferred_length (not available in PySam v < 0.7.6) >>> read.inferred_length
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
AttributeError: 'pysam.csamtools.AlignedRead' object has no attribute 'inferred_length' read.qlen >>> read.qlen
95 read.rlen >>> read.rlen
100 |
This is a (non-bisulfite) read aligned with
There is a deletion at cycle 25 of the read. >>> read.cigarstring
'24M1D76M' Here's a comparison of >>> read.alen
101 read.inferred_length (not available in PySam v < 0.7.6) >>> read.inferred_length
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
AttributeError: 'pysam.csamtools.AlignedRead' object has no attribute 'inferred_length' read.qlen >>> read.qlen
100 read.rlen >>> read.rlen
100 |
This is a (non-bisulfite) read aligned with
The first 40 cycles are soft-clipped, there is an insertion at cycle 63 and a deletion at cycle 79 of the read. >>> read.cigarstring
'40S22M4I12M1D22M' Here's a comparison of >>> read.alen
57 read.inferred_length (not available in PySam v < 0.7.6) >>> read.inferred_length
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
AttributeError: 'pysam.csamtools.AlignedRead' object has no attribute 'inferred_length' read.qlen >>> read.qlen
60 read.rlen >>> read.rlen
100 |
Another point to check when using soft-clipped reads will be the definition of n_overlap = read_1.alen + read_2.alen - abs(read_1.tlen) |
|
|
Closing as |
comethylation.
usesread.alen
to infer read length. This may give odd results for soft-clipped reads or reads containing indels. I need to investigate which of the length methods is most appropriate:alen
,inferred_length
,qlen
orrlen
.To be on the safe side, the initial public release of
comethylation.py
will simply skip reads containing indels.The text was updated successfully, but these errors were encountered: