-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Saving pseudodata of replicas whit multiple fits #2138
base: master
Are you sure you want to change the base?
Conversation
Do you actually need this? We disabled this for multiple replicas due to issues with reproducibility. And the (main) issue is not that easy to solve, namely that in parallel replicas datasets with a single point must all enter training or validation. So if you generate the data in parallel then you cannot reproduce it with the vp functions to do so. If you need it we need to find a way to tag the data as having been generated in parallel. |
Thanks for starting this. I actually meant doing something along the lines of I suspect this will save the pseudodata all in the same folder since somehow the output folder is probably the nnfit/replica_1 folder, using the |
This is easy to solve by always including datasets with a single point in the training data. But indeed, we do need it because otherwise doing parameter determinations with CRM will be sketchy and with TCM will be impossible using GPU. |
I would not do that. I'd rather go for the tag. In any case, before continuing, could you check that the sequential and parallel fit (without taking into account the 1 point datasets) produce exactly the same pseudodata and trvl masks? (if that works we can think about the rest) |
Why? If we're serious about using GPU as well as CPU fits, then I'd say they should be as close as possible in behaviour, no? The choice of what to do with those datasets was a bit arbitrary and has been changed over time anyway. |
Then we should go for an actual solution instead of changing the behaviour every time it becomes an inconvenience. But first let's make sure that the rest works the same, then I'll take care of masking the 1 point datasets in the same way in both modes, eventually. |
When using parallel models, pseudodata are not saved for each replica, resulting in the following error when running multiple fits with n3fit.
This branch ensures that pseudodata for each replica are saved, even when
parallel_models=true
is set.