-
Notifications
You must be signed in to change notification settings - Fork 40
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Request of clarifications about processed data in cpg0019
#360
Comments
In particular, perhaps this is related? |
Oops I realized the training data is composed of subsets of those datasets as stated in the paper:
Still, could you please suggest possible ways to acquire the full processed datasets? If that's not readily available, could you please kindly point me towards the script/notebook that would generate the processed images from raw ones? Thanks! |
Hi @jasperhyp, datasets were processed with a compression pipeline of DeepProfiler (full images), then single-cells were also extracted from those compressed images. Full datasets can be found from the corresponding CPG \BBBC entries. We did not share this intermediate data. |
Hi @Arkkienkeli , Thank you for the clarification! Currently, I am primarily interested in the BBBC036 dataset. If I understand correctly, there is no single-cell crops for the full BBBC036 dataset, and here are the processing steps to convert original images to processed single-cell crops as in
Could you confirm if the above understanding is correct?
Edit: I am now considering using random crops instead of single-cell crops as it requires less effort in preprocessing. In that case, only Step 1 (compression + illumination correction) needs to be run. Could you kindly point me towards the correct json configuration to use for Thank you very much for your time and effort in sharing all these! |
Its would be great if the RunDeepProfiler can be made functional so that it runs in the same pipeline as cellprofiler |
Hi! Thanks for creating this great resource. I was aware that the processed datasets used in this study have been uploaded to CPG as indicated here -- thanks for sharing!
I was wondering if you could kindly clarify if the processed dataset only contains the training split (but not the validation split, so that it is not the full processed e.g. cpg0012 dataset), since the parent folder is named(See third comment.) I would also appreciate it if you could suggest possible ways to acquire the full processed datasets. If that's not readily available, could you please kindly point me towards the script/notebook that would generate the processed images from raw ones?training_images
(e.g.cpg0019-moshkov-deepprofiler/broad/training_images/BBBC036/
). Also, it seems that there are much fewer folders in the processed BBBC036 compared with the original cpg0012 images as in here. For example,24277
is not in the processed version of BBBC036/CDRP, and even in24278
, there are many subfolders missing in the processed dataset compared with the raw dataset. Could you please clarify?Thank you very much in advance! Happy New Year!
The text was updated successfully, but these errors were encountered: