Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create a generic 'dataset' / 'collection' specification, and use that to replace 'root catalog' concept. #194

Closed
cholmes opened this issue Aug 23, 2018 · 10 comments
Assignees
Labels
major This will take substantial work / thinking to do. prio: must-have required for release associated with
Milestone

Comments

@cholmes
Copy link
Contributor

cholmes commented Aug 23, 2018

There was lots of good discussion at STAC sprint 3 and continuing in github after on collection / dataset level metadata.

Not everything is 100% sorted out, but we're homing in on a some good consensus.

The 'dataset' specification will be a more formal definition of the descriptive fields of the catalog (defined by the static catalog in the first sprint, and not really revisited much). This will be its own mini-spec, in its own folder, and other projects like GEE and OpenEO will be able to reuse it. The bulk of this work is being done right now in #164

This will likely replace the 'root catalog' concept in the static catalog, which was the idea that an item would link to the catalog that had meaningful information about it, instead of just the catalog organization that's immediately above it.

This will likely also obviate the need for the collection extension, so #174 may just result in removing it, if dataset covers all the needs of it.

Related issues + PR's
#81 #164 #35 #152 #137

@hgs-msmith
Copy link
Contributor

I've attached a UML diagram as an attempt to summarize the different metadata model approaches we have discussed. We can use this during today's (9/24/18) conference call.

STAC-model.pdf

@hgs-msmith
Copy link
Contributor

Here is an updated model - my attempt to incorporate the discussion during the 9/24/18 conference call.
STAC-model-060.pdf

@m-mohr
Copy link
Collaborator

m-mohr commented Oct 2, 2018

Thanks.
Why is contact, provider etc still in the catalog? That's not the current state AFAIK. The catalog should just hold name, description and links (and properties?!). All the additional fields in catalog belong to the dataset, at least in my proposal.

@hgs-msmith
Copy link
Contributor

Updated version, reflecting changes as of 11:00am EDT 10/2/18.
STAC-model-060.pdf

@m-mohr
Copy link
Collaborator

m-mohr commented Oct 5, 2018

"Migrated" from the chat: The model from @hgs-msmith is up-to-date with two minor exceptions:

  • license was moved from catalog to dataset
  • provider and license are removed from the properties.

m-mohr added a commit that referenced this issue Oct 5, 2018
@m-mohr
Copy link
Collaborator

m-mohr commented Oct 5, 2018

By the way, we may need/want to update the extents object later, see opengeospatial/ogcapi-features#168

@hgs-msmith
Copy link
Contributor

Here is an updated diagram with the changes noted by @m-mohr:
STAC-model-060.pdf

and +1 on pressing the extents updates with WFS.

@hgs-msmith
Copy link
Contributor

Finally, when we are settled on the final 0.6.0 model, let's find a better place to this diagram.

@m-mohr
Copy link
Collaborator

m-mohr commented Oct 8, 2018

Agreed - and this diagram will need some more work for the final 0.6.0 release (stac_version added to dataset, type added for provider, ...)

@cholmes
Copy link
Contributor Author

cholmes commented Oct 8, 2018

Closing, as I think this is all in.

@cholmes cholmes closed this as completed Oct 8, 2018
@cholmes cholmes changed the title Create a generic 'dataset' specification, and use that to replace 'root catalog' concept. Create a generic 'dataset' / 'collection' specification, and use that to replace 'root catalog' concept. Nov 6, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
major This will take substantial work / thinking to do. prio: must-have required for release associated with
Projects
None yet
Development

No branches or pull requests

3 participants