Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Issue with Ortholog abundance correction #8

Open
santoshe1 opened this issue Oct 7, 2021 · 4 comments
Open

Issue with Ortholog abundance correction #8

santoshe1 opened this issue Oct 7, 2021 · 4 comments

Comments

@santoshe1
Copy link

Hello,

I am getting an error shown below with the test dataset provided by Metalaffa. I tried increasing the latency period to 180 seconds and still I face the same error:-

Running MUSiCC... Loading data using pandas module... Traceback (most recent call last): File "/miniconda3/envs/metalaffa/bin/run_musicc.py", line 26, in <module> correct_and_normalize(vars(given_args)) File "/miniconda3/envs/metalaffa/lib/python3.6/site-packages/musicc/core.py", line 208, in correct_and_normalize genes = genes[genes_sort_ind] IndexError: arrays used as indices must be of integer (or boolean) type gzip: /tmp//ortholog_abundance_correction/orthologs.tab: No such file or directory mv: cannot stat '/tmp//ortholog_abundance_correction/orthologs.tab.gz': No such file or directory Error in job ortholog_abundance_correction while creating output file output/ortholog_abundance_correction/host_filter_type_default/duplicate_filter_type_default/quality_filter_type_default_max_info_90:0.7_min_len_60/map_reads_to_genes_type_default_method_blastx_top_percentage_1_evalue_cutoff_0.001_sensitivity_/hit_filter_type_default_method_best_hit/hit_filter_combine_type_default/gene_map_type_default_method_fractional/ortholog_map_type_default_method_fractional/ortholog_map_combine_type_default/ortholog_abundance_correction_type_default_method_musicc_musicc_method_['-n',_'-c',_'learn_model']/orthologs.tab.gz. MissingOutputException in line 357 of /scratch_ws/santosh_ws_env/metalaffa/metalaffa_test_example/Snakefile: Missing files after 5 seconds: output/ortholog_abundance_correction/host_filter_type_default/duplicate_filter_type_default/quality_filter_type_default_max_info_90:0.7_min_len_60/map_reads_to_genes_type_default_method_blastx_top_percentage_1_evalue_cutoff_0.001_sensitivity_/hit_filter_type_default_method_best_hit/hit_filter_combine_type_default/gene_map_type_default_method_fractional/ortholog_map_type_default_method_fractional/ortholog_map_combine_type_default/ortholog_abundance_correction_type_default_method_musicc_musicc_method_['-n',_'-c',_'learn_model']/orthologs.tab.gz This might be due to filesystem latency. If that is the case, consider to increase the wait time with --latency-wait. Exiting because a job execution failed. Look above for error message

@engal
Copy link
Collaborator

engal commented Oct 7, 2021

It looks like the issue is occurring when running MUSiCC (see first couple of lines in the traceback). When you say test dataset provided by MetaLAFFA, do you mean you downloaded one or more samples from HMP? Also, could you share the file that was being used as input to the abundance correction step? You should be able to find it in the following directory:

output/ortholog_map_combine/host_filter_type_default/duplicate_filter_type_default/quality_filter_type_default_max_info_90:0.7_min_len_60/map_reads_to_genes_type_default_method_blastx_top_percentage_1_evalue_cutoff_0.001_sensitivity_/hit_filter_type_default_method_best_hit/hit_filter_combine_type_default/gene_map_type_default_method_fractional/ortholog_map_type_default_method_fractional/ortholog_map_combine_type_default/

@santoshe1
Copy link
Author

Hello Engal,

Thank you for the response.

I am referring to the test dataset which you have provided in metalaffa page - <wget http://downloads.hmpdacc.org/data/Illumina/stool/SRS017307.tar.bz2 >

I am also attaching the requested input file to abundance correction step file.
orthologs.tab.gz

@engal
Copy link
Collaborator

engal commented Oct 14, 2021

Thanks for the additional info. I suspect that is is related to #7, so once that issue is resolved, I think you should try rerunning the prepare_databases.py script and then rerunning MetaLAFFA on your data. I'll keep you updated on the status of that issue.

@engal
Copy link
Collaborator

engal commented Nov 16, 2021

Sorry for the delay. I've just released an update for MetaLAFFA that should address the underlying issue. Please let me know if the issue persists or if there are any further issues.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants