-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
nonCG, single context reporting #102
Comments
Hi @amvondras, Thanks for the report and my apologies for it taking me a while to reply. This is how the option methtuple/methtuple/scripts/methtuple Lines 102 to 106 in 33e425a
You can see that the default value is CG and that if the user supplies So far, the suggested ways to deal with this seem to be setting the default value to reprex(for my own records; note the pip install methtuple
git clone git@github.com:PeteHaitch/methtuple.git
# This is v1.6.0
peter@PC1331:~/GitHub/methtuple (master)*$ ~/.local/bin/methtuple -m 2 --methylation-type CHH data/se_directional.fq.gz_bismark_bt2.bam
[E::idx_find_and_load] Could not retrieve index file for 'data/se_directional.fq.gz_bismark_bt2.bam'
methtuple (v1.5.3)
Input BAM file = data/se_directional.fq.gz_bismark_bt2.bam
Output file of CG/CHH 2-tuples = data/se_directional.fq.gz_bismark_bt2.CG_CHH.2.tsv
Reads that fail to pass QC filters will be written to = data/se_directional.fq.gz_bismark_bt2.reads_that_failed_QC.txt
Assuming quality scores are Phred33
Assuming BAM file was created with Bismark version >= 0.8.3.
Ignoring improper read-pairs
Using all positions of each read (if data are single-end) or of each read_1 (if data are paired-end).
Using all positions of each read_2 (if data are paired-end).
Ignoring methylation calls with base-quality less than 0
Ignoring reads with mapQ less than 0
Ignoring any overlapping positions of paired-end reads where the XM-tags disagree and counting once the remaining overlapping positions.
Creating CG/CHH 2-tuples
WARNING: 2-tuples may still have intervening methylation loci (i.e. NIC > 0). Such 2-tuples generally occur in paired-end reads with non-overlapping mates but can also be caused by filtering methylation calls by base quality, read-position, etc. You may wish to post-hoc filter 2-tuples with NIC > 0.
Verified that the XR-, XG- and XM-tags are set for the first mapped read.
Finished extracting CG/CHH 2-tuples.
Now writing output to data/se_directional.fq.gz_bismark_bt2.CG_CHH.2.tsv ...
Summary of the number of DNA fragments processed by methtuple
Number of DNA fragments in file = 8266
Number of DNA fragments skipped due to being marked as PCR duplicates = 0
Number of DNA fragments skipped due to failing the --min-mapq filter = 0
Number of DNA fragments skipped due to the read or its mate being unmapped = 0
Number of DNA fragments skipped due to the read-pair being improperly paired = 0
Number of DNA fragments skipped due to mates mapping to different chromosomes = 0
Number of DNA fragments skipped due to a complicated CIGAR in the read or its mate = 0
Number of DNA fragments informative for CG/CHH 2-tuples = 8254 (99.9% of total fragments)
Writing histogram with the number of CG/CHH methylation loci per DNA fragment that passed QC filters to data/se_directional.fq.gz_bismark_bt2.CG_CHH_per_read.hist ...
|
I've realised this probably won't give you what you want because the way the tuples are created in |
This is currently available on https://github.com/PeteHaitch/methtuple/tree/issue-102 |
Hi @amvondras, I've made a new version of methtuple ( Cheers, |
I will try this using Python3 and report back. Thank you! |
I'm using methtuple (v1.6.0) with the
—mt
flag, ex.I noticed that
—mt CG
is outputting CG tuples, ex. CS06.h1.CG.2.tsv.gzbut
—mt CHH
is outputting CHH and CG, ex. CS06.h1.CG_CHH.2.tsv.gzAm I making a use or conceptual error? I had thought
—mt CHH
would only output CHH tuples.The text was updated successfully, but these errors were encountered: