Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Error! Quality score file missing values for score range #4

Closed
kingdynasty opened this issue Jan 22, 2019 · 10 comments
Closed

Error! Quality score file missing values for score range #4

kingdynasty opened this issue Jan 22, 2019 · 10 comments

Comments

@kingdynasty
Copy link

My fastq file is Illumina-1.8 Phred+33 format, how to solve this problem?

@jsh58
Copy link
Owner

jsh58 commented Jan 22, 2019

As described here, the quality score profiles of NGmerge are limited to the range [0, 40]. If you have scores outside that range, you would need to provide alternative profiles. This can be accomplished by modifying qual_profile.txt, adding rows and columns to each matrix.

@mchimenti
Copy link

kingdynasty, I had this same issue. Let me give a clearer answer:

You need to use Phred+33 (0,41). That means setting '-u 41' instead of the default (40). This will allow "J" to be an acceptable quality score character.

@jsh58 jsh58 closed this as completed Apr 14, 2019
@tiantaibird
Copy link

As described here, the quality score profiles of NGmerge are limited to the range [0, 40]. If you have scores outside that range, you would need to provide alternative profiles. This can be accomplished by modifying qual_profile.txt, adding rows and columns to each matrix.

I have the same issues. What numbers shoul I add for the rows and columns to each matrix?

@marcgall
Copy link

marcgall commented Nov 5, 2019

I have the same issue, and it is not solved by setting '-u 41' by itself: The -w parameter must also be supplied with a modified qual_profile.txt. Modifying the matrixes with an extra row and column worked (I simply copied the outermost values). But isn't it odd that this problem crops up for such a common quality scheme?

@jsh58
Copy link
Owner

jsh58 commented Nov 5, 2019

Not odd at all; the matrices were created by analyzing sequence data that was limited to [0, 40] (cf. Figure 2).

@czhu
Copy link

czhu commented Apr 23, 2020

can you provide an option for quality score 41 since this is so common nowadays? I get the reason but it's quite common now to have Illumina 1.9 quality score, this issue really prevent us from using this nice tool.

@damienmarsic
Copy link

I tried NGmerge for the first time today and I have the same problem. the option '-u 41' does not help. Adding a row and a column to qual_profile.txt does not help either, unless I did not do it correctly. It would be nice to have a valid qual_profile.txt file to download.

@Jdbeck66
Copy link

Same issue. Seems like some explanation of how a custom profile is constructed would be a useful addition to the documentation. Suggesting that adding rows and columns to the matrix probably isn't sufficiently detailed to be helpful and pointing to a chart as the basis for the default profile doesn't indicate how that data was used to create the profile. This tool is a really useful tool for getting the relative abundances correctly identified in my downstream counting algorithms - so I'm hopeful you'll see this as a useful suggestion.

@jsh58
Copy link
Owner

jsh58 commented Apr 17, 2021

The format of a file with custom quality score profiles is described in the README under -w <file>. The process of creating the default profiles is thoroughly described in the NGmerge manuscript.

@damienmarsic
Copy link

Easy fix: add both -u 41 and -g arguments.
Example:
NGmerge -1 Reads_1.fq.gz -2 Reads_2.fq.gz -o Merged_reads.fq.gz -u 41 -g

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

8 participants