-
Notifications
You must be signed in to change notification settings - Fork 2
Usage decisions #1
Comments
My current thinking for the command line interface:
$ cytominer-transport . experiment.parquet \
--compartment foo.csv \
--compartment bar.csv \
--compartment baz.csv \
--experiment experiment.csv \
--images images.csv If
the command can be shortened to:
|
I like it! tagging @bethac07 |
looks good to me. In cytominer-database, we had an Also, if we have noncanonical compartments, would your compartment-naked option be:
|
Yes. The user can specify any filename they'd like for |
parquet is 100% compatible with pandas, so no need IMO |
I'm very much in favor of scrapping SQLite
I agree |
@gwaygenomics I was thinking something like: to_pandas(experiment: Path, image: Path, compartments: List[Path]) -> pandas.DataFrame |
@gwaygenomics and @shntnu Would you still like a public |
Oh – and this is a big one because
|
Yeah, I think so. I think our lab will mostly use this package via command line (assay devs final step before handoff), but I can imagine a scenario in which someone would want to run an image-based profiling pipeline end-to-end in python |
That's wise to have a public API. One use case I can think of: future profiling recipes can use it directly. |
ditto |
Enabling this will likely result in a 5x performance improvement off the bat because our current approach is to mount the bucket using using |
This comment has been minimized.
This comment has been minimized.
@shntnu Good to know. Where does your n (i.e., 24) come from? It can be arbitrary, but ideally, it would correspond to some structural detail of the experiment. Would you mind putting this in a separate issue for tracking purposes? |
So is the idea that you would call this on a folder of data, or on one subfolder at a time? AKA if our structure is the below, is
|
(I'm fine with either behavior, as long as we're all on the same page as to what it is; in cytominer-database, we'd be calling it on |
Not sure if the q is for @0x00b1 but I think we should call it on a folder of data; I have a related implementation comment here #2 (comment) |
How shall users interact with the codebase? Let's track our thoughts and decide here
The text was updated successfully, but these errors were encountered: