-
Notifications
You must be signed in to change notification settings - Fork 18
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Output multiple artifacts per primer, similar to Cutadapt's demultiplexing method #60
Comments
This is totally possible now with |
Great to know, thanks @ebolyen! Yes, I would be more than happy to work on it. Is |
Hey @lina-kim, I am actually working on some tutorial content that includes You can also find the new API docs on Want to take a look at that and let us know if you have questions about how to use |
Perfect, thanks for the resources @gregcaporaso! I'll check them out and get back to you with any questions. |
Addition Description
It would be useful to bin reads by primer prior to primer removal. I'd like to separate a single FASTQ-based artifact (containing several different primers) into multiple output artifacts by primer; each output artifact would be characterized by a single primer. This would be helpful for meta-analyses in which sequences with multiple primers/variable regions may be found in a single QIIME artifact.
This is possible with native Cutadapt (as of
v4.5
) using steps to demultiplex, but not in the QIIME 2 plugin as its inputs are restricted to specific semantic types.Current Behavior
qiime cutadapt demux
(based on adapter sequence), but generates only a single output for demultiplexed sequences. It also requires an input artifact of typeMultiplexedSingleEndBarcodeInSequence
and does not acceptSampleData[Single/PairedEndSequencesWithQuality]
.qiime cutadapt trim
could technically perform this by running the command once per primer (pair), but that is quite inefficient.Proposed Behavior
q2-cutadapt
would take as input 1) a FASTQ artifact ofSampleData[Single/PairedEndSequencesWithQuality]
, which contains N different primer sequences among its many reads, and 2) a tab-separated metadata file containing the N primer names and corresponding primer sequences.SampleData[Single/PairedEndSequencesWithQuality]
; each output artifact would contain reads of the same primer sequence. There would also be an output artifact (alsoSampleData[Single/PairedEndSequencesWithQuality]
) of sequences that did not have any of the N primer names.Questions
References
qiime cutadapt trim-paired
The text was updated successfully, but these errors were encountered: