Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Gubbins - Ns in output alignment #192

Closed
mstagliamonte opened this issue Jun 19, 2017 · 5 comments
Closed

Gubbins - Ns in output alignment #192

mstagliamonte opened this issue Jun 19, 2017 · 5 comments

Comments

@mstagliamonte
Copy link

mstagliamonte commented Jun 19, 2017

Dear Gubbins Team,

I am trying to use Gubbins to scan for recombination a SNP alignment which contains no missing data. The input alignment has 21,134 sites, the output file (my_output.filtered_polymorphic_sites.fasta) contains 20,174 sites. I initially thought that this was due to the potential recombinant sites being excluded; however I noticed that there are 'Ns' in the output alignment. What are these due to?
I apologize in case this is a duplicated topic, as I found a similar question here: #182 , but there is no answer yet.

Thank you for your kind attention,
Max Tagliamonte

@andrewjpage
Copy link
Contributor

If at a particular site there are SNPs somewhere in the tree and at a different point there is a recombination, the recombination will be masked out with Ns.

@mstagliamonte
Copy link
Author

Thank you, @andrewjpage for responding back so quickly.

I am sorry, but I am not sure I understand the answer. If I understand, the alignment is shorter, as some sites were removed because of recombination, but some other sites undergoing recombination were masked instead? I am not sure I understand the difference between the two.

Thank you,
Max

@andrewjpage
Copy link
Contributor

andrewjpage commented Jun 19, 2017 via email

@mstagliamonte
Copy link
Author

Thank you,
I understand now. I appreciate you taking the time to write the explanation, it was really useful.

Have a nice day,
Max

@wanyuac
Copy link

wanyuac commented Oct 31, 2020

I reckon coord 1 in the example above should also be removed and only leave coord 3 in the final alignment, as coord 1 is invariant.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants