-
Notifications
You must be signed in to change notification settings - Fork 111
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
BUSCO update #77
Comments
If all possibilities would be implemented, I'd think the most requested cases would be |
UPDATE: With BUSCO version 4.0.6 there are frequent, non-reproducible errors occurring, caused by a replace("faa", "fna") function corrupting nextflow filenames that contain the substring "faa" in their hash id. I prepared a fix for BUSCO (https://gitlab.com/ezlab/busco/-/issues/305). Waiting currently. We need a new BUSCO release before preparing pipeline release. |
Hi @skrakau . Are you referring to this type of error? I am getting a lot of these with BUSCO version 3.0.2when running mag with revision: 8586c49 [dev]
|
Hi @ropolomx, to be precise, the problem I described was for BUSCO 4 versions, which prevented us from updating BUSCO to solve some issues. But in BUSCO 3.0.2 there was a related problem, which can cause such errors as you described above (BUSCO itself did not return an error, but since an output file is missing, the downstream |
Solved in #103 :) |
BUSCO v3.0.2 is currently failing on some datasets, among others on the
test
profile whenscratch = false
. Somehow the results and thrown errors differ betweenscratch = true
andscratch = false
(on CFC), which we cannot not explain currently. In the past such errors were ignored, which was changed now (see #68 and #72).Moreover it seems that if no tblastn hits are found, this causes an error and is not handled properly.
To achieve more control, update BUSCO to v4.0.6 and handle the case of no tblastn hits. Use a parameter to pass over the path to an already downloaded db (test if this works with
--offline
), the name of the db for automatic download or the auto-lineage parameterHowever, currently there is an issue with the download of the BUSCO databases: https://busco.ezlab.org/frames/bact.htm.
See also https://gitlab.com/ezlab/busco/-/issues/293. So I need to wait until this works again, to test this.
Additionally, for offline use, one can also download the whole dataset https://busco-data.ezlab.org/v4/data/ and add the path to the custom config file. I think this should work both with
--lineage_dataset bacteria_odb10
and--auto-lineage
The text was updated successfully, but these errors were encountered: