Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Take care with samples with no matching tags&primers #1

Open
tobiasgf opened this issue Sep 12, 2016 · 0 comments
Open

Take care with samples with no matching tags&primers #1

tobiasgf opened this issue Sep 12, 2016 · 0 comments

Comments

@tobiasgf
Copy link

Dear Frederic (& others)

I just realized a potential pitfall with this pipeline (and similar approaches).
Like me, you may include empty/blank samples (or have real samples with no reads matching tags and primer sequences). When processing such a sample, Cutadapt will eventually have nothing to work on (so to speak), resulting in an empty file after removal of tags and primers. However vsearch will just work on the (fasta) tmp-file from the previous sample, resulting in the final dereplicated file (S00x.fas) for the current empty/blank/negative being identical to the previous "real" sample. This is of course easy to spot in a project with only a few samples, but may be less apparent with large datasets.

You will need to identify samples that were devoid of reads matching the actual tags for that sample, and remove them before downstream processing. They can be identified eg by searching for the sentence "Unable to read from file" in the logfiles. For example like this:

grep -c "Unable to read from file" S[0-9][0-9][0-9]*log

Regards
Tobias

@tobiasgf tobiasgf changed the title Tak care with empty files Take care with empty files Sep 12, 2016
@tobiasgf tobiasgf changed the title Take care with empty files Take care with samples with no matching tags&primers Sep 12, 2016
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant