Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Error during Casboundary #5

Open
almutwerner opened this issue Apr 26, 2022 · 2 comments
Open

Error during Casboundary #5

almutwerner opened this issue Apr 26, 2022 · 2 comments

Comments

@almutwerner
Copy link

Hello everyone!

I am running CRISPRloci in dna mode over a multi fasta and get the following error:
`
/work_beegfs/sunam157/CRISPRloci/CRISPRloci-1.0.0/Casboundary
Running Prodigal on input Fasta file.
Generating Genome DataFrame
Searching for potential signature proteins
Extracting potential regions.
Traceback (most recent call last):
File "Casboundary.py", line 384, in
regions_fasta_file = extract_regions_sequences(proteins_fasta_file, regions_dataframes, args.output_dir)
File "Casboundary.py", line 125, in extract_regions_sequences
proteins_ids_regions = set.union(*regions_prot_ids)
TypeError: descriptor 'union' of 'set' object needs an argument

###################################################################################
########## CRISPR: Master Script #########
###################################################################################

mkdir /work_dagan/sunam157/BGR_2013_2015/CRISPRloci/output/NODE_457_length_14276_cov_6173406__bin10_BGR2_DNA_150204_S0
res local folder /work_dagan/sunam157/BGR_2013_2015/CRISPRloci/output/NODE_457_length_14276_cov_6173406__bin10_BGR2_DNA_150204_S0
input file /work_dagan/sunam157/BGR_2013_2015/CRISPRloci/output/NODE_457_length_14276_cov_6173406__bin10_BGR2_DNA_150204_S0.fasta
Traceback (most recent call last):
File "/work_beegfs/sunam157/CRISPRloci/CRISPRloci-1.0.0/CRISPRloci_standalone.py", line 1447, in
if len(os.listdir(dirname_cas + '/predictions/raw_cassettes/')) != 0:
FileNotFoundError: [Errno 2] No such file or directory: '/work_dagan/sunam157/BGR_2013_2015/CRISPRloci/output/NODE_457_length_14276_cov_6173406__bin10_BGR2_DNA_150204_S0/tmp/output-Casboundary/predictions/raw_cassettes/'
`

I briefly checked the output-Casboundary/hmmsearch_output_dir/Sig_HMM/ contents and didn't see any hits in the files. So I maybe, Casboundary was confused because there were no hits?
I also ran it on the example data. There it completed that step without issues. I will add the failed fasta to this issue, so you can better trace the error.

NODE_457_length_14276_cov_6173406__bin10_BGR2_DNA_150204_S0.zip

@Alexander-Mitrofanov
Copy link
Collaborator

Yes it seems that the problem exactly with the Casboundary module.
It seems that the output is empty and therefore the error is occurring.

Can I kindly ask you to provide the labled intervals for the given input sequence if you know that certain cassettes are missing?

@almutwerner
Copy link
Author

Hello, thank you for your reply!
I have no idea whether the sequence contains anything or not. That was just my guess based on the HMM output. I will attach the whole output folder, maybe that is helpful to you.

NODE_457_length_14276_cov_6173406__bin10_BGR2_DNA_150204_S0.zip

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants