Non-fitted error rates #1135

GianyA · 2020-09-17T16:27:25Z

Hello there,

I was running the pipeline and during the error rate modeling, I got a quite bizarre results during the modeling (find attached the graphs).
forward-err.pdf
reverse-error.pdf

This data was sequenced with Iseq-Illumina and it's 16s amplicons from a bioreactor.
So, is this kind of deviation normal for the error learning or it's just my data?

Cheers,
Giany.

benjjneb · 2020-09-18T13:48:13Z

The issue here is that the iSeq uses binned quality scores, rather than the normal quality scores 1-40, and this interacts with error model learning. See this issue for a longer discussion: #791

Short answer, things still seem to work fine as far as we can tell, but there are tweaks you can pursue to improve the monotonicity of the fitted error rates.

benjjneb · 2020-09-18T13:55:24Z

See also this analysis of DADA2 performance on iSeq data by @ong8181: #1083 (comment)

diriano · 2020-12-10T15:59:05Z

Hi,
I think I am having a problem with binned quality scores.
I have 56 samples, each with over 400K paired-end reads (2x150bp, using 16S rDNA V4 region), after filtering.
When trying to run learnErros using loessErrfun, the R1 was OK, could fit the data. But learnErrors was always failing for R2, no matter which sample I was using. The error mesage is:

139227062 total bases in 926543 reads from 2 samples will be used for learning the error rates.
Error rates could not be estimated (this is usually because of very few reads).
Error in getErrors(err, enforce = TRUE) : Error matrix is NULL.

See the aggregate quality plots for the 56 samples forward reads:

And reverse reads:

From these figs, the quality valies seems to be binned.

Following are the results of plotErrors for the foward reads, where loessErrfun works. That seems OK to me.

AS the reverse reads dis not work with loessErrfun , I tried with noqualErrfun, and that was able to finish fitting the data. Here is the plotErrors

Would it be OK to go on with the learnErrors for the reverse reads? Can I mix the learnErrors of the forward reads using loessErrfun and the reverse reads with noqualErrfun?^

Thanks,
Diego

benjjneb · 2020-12-14T22:27:08Z

I think I might just use the forward read error model for both forward and reverse reads in this case. The observed data looks very similar, so it should work well enough.

benjjneb closed this as completed Oct 30, 2020

JacobRPrice mentioned this issue Mar 24, 2021

Binned quality scores and their effect on (non-decreasing) trans rates #1307

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Non-fitted error rates #1135

Non-fitted error rates #1135

GianyA commented Sep 17, 2020

benjjneb commented Sep 18, 2020

benjjneb commented Sep 18, 2020

diriano commented Dec 10, 2020 •

edited

Loading

benjjneb commented Dec 14, 2020

Non-fitted error rates #1135

Non-fitted error rates #1135

Comments

GianyA commented Sep 17, 2020

benjjneb commented Sep 18, 2020

benjjneb commented Sep 18, 2020

diriano commented Dec 10, 2020 • edited Loading

benjjneb commented Dec 14, 2020

diriano commented Dec 10, 2020 •

edited

Loading