-
Notifications
You must be signed in to change notification settings - Fork 45
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Step target example broken #983
Comments
In this example, you have
and then
so are you depending on a subworkflow? |
Yes. I wonder if this is okay, because I believe it used to work. |
I never knew it worked, so I would say it was an undefined behavior and we need to formalize now. It is like triggering a subworkflow from dependents. I mean, if you can do
instead of
why would not we allow formal workflow here?
|
Sorry I'm a bit confused. Is this a proposal? It seems reasonable, if not difficult ... right? |
I do not know how difficult it is. I am just saying that the behavior was undefined before and triggers an error now, and if we are to support it, we are essentially allowing the triggering of an entire workflow, not just |
I see. I'm okay with the name But do we want to support it? It seems an natural extension and feature; but if it is not a robust feature and can cause potential problems we then would rather not have it? Currently we do not have an immediate need for it -- in the RNA-seq DE example I wanted to demonstrate the feature. Though I can also try to use file targets, the logic seems more intuitive with this dependency on the mini-workflow. What do you think? |
In our manuscript, I specifically said something like: the trunk of the workflow can be outcome based where process-oriented subworkflows are triggered ... So conceptually speaking this is what we want to do, and we do not have an explicit syntax for it. |
Indeed ... I guess we should formalize it. I think you've make it clear about the behavior, but we need new syntax like |
Yes, |
|
In your case
would not work now, but
would work. I am wondering if we can allow something like
It is conceptually not a workflow, but multiple |
I agree with not messing too much with the code at this moment until there is a real need. For mini-workflow, one can always do
right? then it should be good to claim handling mini-workflows. But I think the |
There is a minor difference here: we claim the "connectedness" of workflow steps so in So in the end there might be a difference between
and
although the latter might just work in a majority of the cases. |
Okay, but I guess if |
ok, I can ensure order so The problem with the implementation is that we are using separate DAGs for |
Hmm now I guess my nested dependency has made things more complicated:
But |
Unfortunately,
and it throws |
Okay, now I get this error:
|
Trying to figure out how to trigger this bug from the test case
|
It was caused by the |
At this point I was able to test and verify separately the 2 commands:
but
by itself still fails. See this thread for details: #983 (comment) |
I see
I suppose this is because the Large file stuff is outside of the current working directory, right? Note that we do not mount home by default now. |
@BoPeng yes. and I already updated the notebook to reflect the change, and added a note. Please pull the latest version. currently the |
On my machine, I got the error for 2G ram, then also for 11G, and I am increaging to 20G. The error message is clear enough though.
|
Yes, I have made a note that it requires 30g+ memory. My computer is 64G. This example is meant to be reproduced so I didn't make it external task. The other RNA seq example is external task with 96G memory requirement not meant to be reproduced. |
Come on, I only ordered 32G of RAM for my new mac and thought that it would be enough even for virtual machines... Now I can not run the star step. |
Well I guess that is why many people want to use tophat instead. But there is a sparse mode in STAR that will half the memory. I believe I have implemented its interface in this notebook too. I assume users can reserve an interactive cluster node for the full version though? I will make a note |
BTW there should be no need to use VM for this one because I think most tools are dockerized. Maybe it fits if you don't use VM? My desktop is a $2600 Linux machine of 40 threads 64G memory ... |
should be fixed. |
Separated from a private correspondence: the RNA-seq DE example workflow now is broken. To reproduce:
Error message:
This used to work, though. I guess maybe it is similar to issue in #981 introduced due to recent changes of target dependency codes.
The text was updated successfully, but these errors were encountered: