-
Notifications
You must be signed in to change notification settings - Fork 28
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Issue when using AlleleCall with a large number of genomes #176
Comments
Greetings @ccrinconc, I downloaded the S. enterica schema available in Chewie-NS and performed allele calling with 7,744 genome assemblies downloaded from the NCBI. The process completed without any errors or warnings. The warning that chewBBACA is printing to the stdout indicates that it cannot find any FASTA files in the input directory, which can happen for several reasons. Some common causes that might lead to this issue are the following:
You might have already checked several or all of the possible causes, but it was important to provide a list to rule out common causes and close in on the issue leading to the problem you found. The warning you are getting is triggered when chewBBACA cannot detect the input path as either a file with a list of paths or a directory containing FASTA files. This might indicate that there's an incomplete or missing path or that it is an issue related to system configuration or permissions. Let me know if my suggestions help solve the issue. Kind regards, Rafael |
The command you shared includes the |
Dear Rafael, Thank you very much for the quick answer. I had indeed verified most of the things you mentioned, files exists, paths are correct, file extension is ".fasta". I did not think about the permissions, this was indeed the problem as I'm using symlinks. Thank you once again, Best regards, Cristian |
Dear team,
I have tried to do an allele call using a large number of genomes (>3000) and I ran into an issue.
chewie AlleleCall -i Genomes/ -g senterica_INNUENDO_cgMLST/ -o Results --cpu 30 --ptf ../senterica_INNUENDO_cgMLST/Salmonella_enterica.trn --output-novel --no-inferred
chewBBACA version: 3.2.0
Authors: Rafael Mamede, Pedro Cerqueira, Mickael Silva, João Carriço, Mário Ramirez
Github: https://github.com/B-UMMI/chewBBACA
Documentation: https://chewbbaca.readthedocs.io/en/latest/index.html
Contacts: imm-bioinfo@medicina.ulisboa.pt
==========================
chewBBACA - AlleleCall
Started at: 2023-05-31T21:09:57
Input argument is not a valid directory or file with a list of paths to FASTA files. Please provide a valid input, either a folder with FASTA files or a file with the list of full paths to FASTA files (one per line and ending with one of the following file extensions: ['.fasta', '.fna', '.ffn', '.fa', '.fas']).
When I provide a folder with 1000 genomes, the same command works
The install was done using conda
I guess an option is to run the allele call by parts and then use "chewBBACA.py JoinProfiles". What would be your suggestion when using this command?
Thanks for having a look into this
Best regards,
Cristian
The text was updated successfully, but these errors were encountered: