Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

assembly issues in arcC loci for Staphylococcus Aureus #164

Open
1 of 3 tasks
talnor opened this issue Jan 8, 2024 · 0 comments
Open
1 of 3 tasks

assembly issues in arcC loci for Staphylococcus Aureus #164

talnor opened this issue Jan 8, 2024 · 0 comments
Labels

Comments

@talnor
Copy link
Contributor

talnor commented Jan 8, 2024

Describe the bug
Since the switch from NovaSeq 6000 to NovaSeq X, Staphylococcus Aureus samples have started failing in the analysis. The issue is very systematic and has a similar effect on the analysis of a relatively large percentage of the Staphylococcus Aureus samples. This is seen as the contig covering the arcC loci is split in the middle of the region, meaning that no MLST type can be reliably assigned to the sample due to insignificant coverage of any of the spanning contigs. See more info in the deviation here.

The issue needs to be fixed so that these samples can be typed in microSALT.

To Reproduce
Steps to reproduce the behavior:

  1. Run microsalt on a Staphylococcus Aureus sample sequenced on the NovaSeq X
  2. Check the loci results in the "MLST" table for the sample
  • The field Längd (HSP) % will show a low span of around ~70%.
  1. Check position 2631741 in AP017922.1 coordinates for an A->G minority SNP.

Expected behavior
To circumvent the issues discussed there are a number of options:

  • A. Implement the --uncareful flag in cg and rerun samples that fail. Notify customer in ticket.
  • B. Update microSALT to not run "careful" assembly" for all S.aureus samples.
  • C. Update microSALT to remove the "careful" assembly for all species by default. [preferred option]

With the data we have to work with, we think it is better to skip the spades --careful flag. Given that we get the same results as before with option C, this can be done for all samples to ensure that it is clear how the analysis is performed and to enable easier handling of microbial samples.

Test with e.g. ticket 121778.

Software version

  • MicroSALT version 3.3.5

Additional context

As a side note, microSALT does still give an estimate of the loci allele for samples that fail typing QC, but because of the limited data, this allele estimation can be expected to vary when resquencing the sample.

@talnor talnor added the bug label Jan 8, 2024
@talnor talnor self-assigned this Jan 8, 2024
@talnor talnor removed their assignment Feb 7, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant