-
Notifications
You must be signed in to change notification settings - Fork 79
This issue was moved to a discussion.
You can continue the conversation there. Go to discussion →
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
N bases #1799
Comments
Hi, what command do you run to analyze the data? |
preset of analyze rnaseq-full-length with MiXCR version 4.3.2 I believe. Is there a possibility where updating to newer version of MiXCR might solve the issue? Also, if updating to new MiXCR version is hard to do(predefined sets of workflow), is there a way to modify the parameter to handle this? |
Not exactly. In the example above, there is no ambiguity, but rather a single nucleotide deletion in FR3, which will shift the reading frame, rendering the clone non-productive. The appearance of “N” occurs during the
|
Did you try using: Regarding the first case: a deletion of A nucleotide in FR3 will lead to a frameshift in translation of CDR3, FR4 and C gene and this clone will not be functional. |
I have tried using -MassembleContigs.parameters.discardAmbiguousNucleotideCalls=true and it does discard ambiguous nucleotides and replaces with the reference sequence. Possibility that I am considering here is that the variants captured in these reads are mutations rather than sequencing error and therefore i was wondering if there is a way to output all possible variation at those N base positions (rather than getting replaced with ambiguous base). |
I see. Generally speaking, there is an algorithm behind
These are the default values for MiXCR v4.7 with the That said, based on our experience, the default parameters work best, as they have been empirically evaluated on hundreds of different datasets. |
This issue was moved to a discussion.
You can continue the conversation there. Go to discussion →
i have noticed in that in the reports files, if there is a base N in the gene feature sequences, it translates to amino acid X.
I was wondering if there is a way to handle those base Ns. is there a way to replace those bases based on the reference?
The text was updated successfully, but these errors were encountered: