-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add example configs #166
Add example configs #166
Conversation
55f0969
to
e91fe62
Compare
e91fe62
to
2512562
Compare
examples/cli_workflow/README.md
Outdated
|
||
### data | ||
|
||
Configure the `DistributedAnnDataCollection`. Here we validate `obs` columns that are used by the transforms and the model (`total_mrna_umis`): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Say a word about what it means to "validate` an obs column.
> attr: model.var_names_g | ||
> convert_fn: numpy.ndarray.tolist | ||
``` | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Maybe add a note below that cellarium-ml does not perform any validation on the content of data loaded from the checkpoints, or whether it is consistent with the rest of the configuration. For example, the mean and std were calculate in a prior stop from data that was subject to normalize total and log1p transform. If the user inadvertently forgets to perform the same transforms here before z-scoring, the workflow will run w/o any error though will produce wrong results.
> n_components: 50 | ||
> perform_mean_correction: true | ||
``` | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Again, add a sticky note that since we have z-scored the data, mean correction is not strictly necessary but its presence may help mitigate roundoff errors. It is a good exercise for the user to ascertain that the learned mean by IPCA is close to 0/
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I setperform_mean_correction: true
by mistake. It should be false
in this case because of the z-score!
examples/cli_workflow/README.md
Outdated
|
||
### train | ||
|
||
Change the number of devices, change strategy to `ddp_find_unused_parameters_true` (because trained PCA model contains parameters that are fixed during training), set the number of epochs, and set the path for logs and weights: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
perhaps give a reference to PL doc related to this?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nicely done, just a few small suggestions.
8546bd4
to
9685908
Compare
Resolves #114