-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add clientpath to Filesets #12
Conversation
|
After running for nearly 8 hours, we have reached 14 plates into @joshmoore I wonder if we could teach the Then, for each row in the array, e.g.
we'd need to be able to generate the clientpath within the Is that possible within |
Still running...
This is taking 3 minutes per Fileset just now.... |
|
I'm not sure I fully understand but in general you can do anything with SQL if slightly more verbosely. I like your idea of templating the output, but there would still need to be checks for the existence of the files, no? |
This reverts commit edf7be3.
Having experimented with trying this in This also means that we don't need the complex logic to resolve clientpath from e.g.
|
Tested at IDR/idr-utils#56 (comment)
Re: @joshmoore "checks for the existence of the files" - I'm not sure what you mean, but in that example the |
Since existing
FilesetEntry.clientpath
values are set tounknown
formkngff
Filesets, and we also don't have any reference to the original source of the data, we can set this value to something more useful.This PR adds a
--clientpath
option which is a path or URL to the Fileset e.g.https://s3-server/bucket/data.zarr
that corresponds to the mounted s3 Fileset/dir/path/to/data.zarr
.This enables the creation of a
clientpath
for every file found under the mounted Fileset.E.g.
This creates
sql
output with a 4thclientpath
item in each sql ROW. If the--clientpath
option is not used as above then the placeholderunknown
is added to each ROW in thesql
, which results in the same outcome as before.Tested at IDR/idr-utils#56 (comment)