Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The output files don't update #198

Open
Betula0422 opened this issue May 13, 2024 · 4 comments
Open

The output files don't update #198

Betula0422 opened this issue May 13, 2024 · 4 comments

Comments

@Betula0422
Copy link

Hi there,
I have been successfully run the script with your test dataset within two days. But when I use my data, my output files have not been updated for a week, but there are no errors in the log file. I can't figure out what happened. There are 46,200,228 contigs in my data. Is it because the data is too large? Looking forward to your reply.

Here is my script:

#!/bin/bash
source activate base
conda activate vs
virsorter run -w /data2/Betula/virus/virsorter2/test.out2 -i /data1/wxh/contigs.rnm.fa -d ~/database/virsorter2/db --min-length 5000 -j 28 --include-groups dsDNAphage,ssDNA --min-score 0.5 --use-conda-off all

Here is my log:

[2024-04-30 15:11 INFO] VirSorter 2.2.4
[2024-04-30 15:11 INFO] /data2/Betula/miniconda3/envs/vs/bin/virsorter run -w /data2/Betula/virus/virsorter2/test.out -i /data1/wxh/contigs.rnm.fa -d /data2/Betula/database/virsorter2/db --min-length 1500 -j 28 --use-conda-off all
[2024-04-30 15:11 INFO] Using /data2/Betula/miniconda3/envs/vs/VirSorter2/virsorter/template-config.yaml as config template
[2024-04-30 15:11 INFO] conig file written to /data2/Betula/virus/virsorter2/test.out/config.yaml

[2024-04-30 15:11 INFO] Executing: snakemake --snakefile /data2/Betula/miniconda3/envs/vs/VirSorter2/virsorter/Snakefile --directory /data2/Betula/virus/virsorter2/test.out --jobs 28 --configfile /data2/Betula/virus/virsorter2/test.out/config.yaml --latency-wait 600 --rerun-incomplete --nolock --quiet all
[2024-04-30 15:18 INFO] # of seqs < 1500 bp and removed: 43178270
[2024-04-30 15:18 INFO] # of circular seqs: 1017
[2024-04-30 15:18 INFO] # of linear seqs : 3020941
[2024-04-30 15:19 INFO] Finish spliting circular contig file with common rbs
[2024-04-30 15:20 INFO] Finish spliting linear contig file with common rbs
[2024-04-30 16:26 INFO] Step 1 - preprocess finished.
[2024-05-05 17:10 INFO] Step 2 - extract-feature finished.

@jiarong
Copy link
Owner

jiarong commented May 14, 2024

Hi, your data is quite large. I see the step 2 (the longest step) was finished a week ago. The last step should NOT take this long. Can you check which process it is stuck at? (using top command). Also note that if you this run errors out, you can just rerun the same command, VirSorter2 can pick up where it stops and continue.

@Betula0422
Copy link
Author

Betula0422 commented May 14, 2024 via email

@jiarong
Copy link
Owner

jiarong commented May 15, 2024

It shows that VirSorter2 is still processing the prophage extraction step, NOT stuck due to errors. For future runs, you can split the large input file and process them in parallel.

@Betula0422
Copy link
Author

Betula0422 commented May 15, 2024 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants