-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Create ARGO templates #16
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
These are notes at this stage, and incomplete. Mark as draft until we have a complete and valid example.
thanks, changed to Draft. |
Can you please identify the mandatory fields in https://github.com/iodepo/odis-in/blob/master/dataGraphs/thematics/dataset/graphs/datasetTemplate.json? Many of them don't make sense for Argovis, I need a clearer picture of what's required versus optional in order to produce an MVP. |
Some questions on specific properties in the dataset template:
Thanks for your help on this, I think the biggest sticking point is that we're more of a data service layer than a static blob of data, which makes some of these keys an awkward fit. Once we can resolve that in a way that respects and represents Argovis' intended usage, the rest will be pretty easy. |
@bkatiemills Just to confirm, in your messages here you are pointing to the general dataset template, but the ARGO dataset template that we created together lives at datasetTemplate-ARGO.json |
@pbuttigieg pinging you here so that you notice @bkatiemills questions above |
@jmckenna thanks for the ping
the 'url' property is intended for something like a landing page for the dataset or any Web resource that's dedicated to that dataset. if you don't produce these, you can omit this property the suggestion in the meeting was more for a Service type (rather than Dataset): there, the url would point to the service's landing page
You can use any semantic resource you think is appropriate, or just strings. there's some documentation here, but there are a few updates pending, summarised below. "keywords": [
"string",
{
"@type": "DefinedTerm",
"inDefinedTermSet": "http://purl.org/dc/terms/DCMIType",
"termCode": "Image",
"name": "Image",
"identifier": "http://purl.org/dc/dcmitype/Image"
}
]
Perhaps we should discuss this in a call. The idea of using the Dataset type is that users can find units of data that the node (your system) wants to highlight. in this model, I would create an individual dataset record for every chunk of Argo data you'd like others to see as an output of your service. An API call to retrieve that is a valid value for contentUrl . if you'd prefer not to share dataset-level records, that's fine too: you can just share one Service or WebAPI record for ArgoVis. this will reduce discoverability, but - as you say - may be more appropriate to guide users to the experience you want them to have.
I'd use the box - @jmckenna you're the FOSS4G dude, better advice? WKT or GeoJSON preferred . . |
@bkatiemills @pbuttigieg regarding the spatialCoverage, there are some very important points to be aware of:
"spatialCoverage": {
"@type": "Place",
"geo": {
"@type": "GeoShape",
"box": "-90 -180 90 -180"
},
"additionalProperty": {
"@type": "PropertyValue",
"propertyID": "http://dbpedia.org/resource/Spatial_reference_system",
"value": "http://www.w3.org/2003/01/geo/wgs84_pos"
}
}, I'm not sure if I answered your question, but keep that in mind anyway. |
thanks @jmckenna - wasn't there an issue with the WGS84 link pointing explicitly to lat lon ? perhaps we should remove that suffix |
Thanks for your feedback here, folks - please see https://gist.github.com/bkatiemills/75efe5e9d6e67d8aa7f5add617e6591c for a schematic of where we're at. Can you read this over and make sure we're not going wildly off the rails here? Also we need input on the Once this schematic is looking correct, I can write some scripts to fill in the things that need nightly updating and provide you with a URL to fetch. |
@bkatiemills today both teams reviewed your Gist together and made some changes, below: {
"@context": {
"@vocab": "https://schema.org/"
},
"@type": "Dataset",
"@id": "https://registry.org/permanentUrlToThisJsonDoc",
"name": "Argovis' representation of the Argo dataset",
"description": "Argovis provides a representation of the profiles collected over the lifetime of the Argo program. This representation is intended to present an interpretation of Argo data that is lightly simplified from the original product, but still appropriate for a large majority of scientific and educational use cases. Simplifications include presenting delayed (better corrected and QCed) mode data where available; presenting interpolated biogeochemical data only; and merging core and bioogeochemical data collected in parallel into unified oceanic profiles.",
"url": "https://github.com/argovis/demo_notebooks/blob/main/Intro_to_Argovis.ipynb",
"license": "MIT", // should be more complete, the full name of the license or the link to it
"citation": [
"Tucker, T., D. Giglio, M. Scanderbeg, and S.S.P. Shen: Argovis: A Web Application for Fast Delivery, Visualization, and Analysis of Argo Data. J. Atmos. Oceanic Technol., 37, 401–416, https://doi.org/10.1175/JTECH-D-19-0041.1",
"Wong, A. P. S., et al. (2020), Argo Data 1999–2019: Two Million Temperature-Salinity Profiles and Subsurface Velocity Observations From a Global Array of Profiling Floats, Frontiers in Marine Science, 7(700), doi: https://doi.org/10.3389/fmars.2020.00700",
"Argo (2000). Argo float data and metadata from Global Data Assembly Centre (Argo GDAC). SEANOE. https://doi.org/10.17882/42182"
],
"creator": "", // can be an array, with Person or Organisation types
"version": "<timestamp to be updated on db write>",
"keywords": [
"Argo",
"ocean profiles",
"temperature",
"salinity",
"pressure",
"ocean biogeochemistry"
],
"measurementTechnique": "http://www.argodatamgt.org/Documentation",
"variableMeasured": [
{
"@type": "PropertyValue",
"name": "<name from data_info[0]>",
"url": "Perhaps a link to the ADMT docs that explain their variables?",
"description": "<long name from data_info[2]>",
"unitCode": "<units from data_info[2]>"
},
// ... to be enumerated for all variables
],
"includedInDataCatalog": {
"@type": "DataCatalog",
"url": "https://argovis.colorado.edu/citations"
},
"temporalCoverage": "<min year>/<max year>"// we can consider using "to present" (is it "now"? to check how to do this) or (more accurate?) just update this to exact ISO timestamp every day
"distribution": {
"@type": "DataDownload",
"url": "https://argovis.colorado.edu/argourlhelper",
"description": "Argovis provides no direct download of the dataset described in this record as it is too large to download in one click; however, please visit https://argovis.colorado.edu/argourlhelper to dynamically access your own subset of data"
},
"spatialCoverage": {
"@type": "Place",
"geo": {
"@type": "GeoShape",
"box": "-90 -180 90 180"// miny minx maxy maxx
},
"additionalProperty": {
"@type": "PropertyValue",
"propertyID": "http://dbpedia.org/resource/Spatial_reference_system",
"value": "http://www.w3.org/2003/01/geo/wgs84_pos"
}
},
"provider": [
{
"@type": "Organization",
"legalName": "University of Colorado Boulder",
"name": "Department of Atmospheric and Ocean Science",
"url": "https://www.colorado.edu/atoc/"
}
]
} |
@jmckenna thanks for your feedback! The gist is updated to reflect it - I have no outstanding questions here, the only remaining blanks are things to be filled in by the nightly update scripts (variables present, temporosparial extents). I'll try and find some time to implement this soon, and provide you with a URL you can scrape and tell me if the finished product is as expected. |
Ok team, here's a first production attempt at a blob of jsonld for the argo collection, lmkwyt: https://argovis-api.colorado.edu/summary?id=argo_jsonld&key=jsonld |
@bkatiemills thanks, looks good. I think for the ODIS front-end search, 2 very useful parameters missing are |
Thanks @jmckenna - do you think we can just change How does |
I recommend changing the
(in the ODIS front-end search results creditText is displayed literally as "Recommended Citation") Yes I would just change to using only the |
(link to |
@jmckenna sounds good, those suggestions will be reflected in tonight's update. We've also made the sitemap and cat entry as discussed; please let us know any further steps needed. |
Hi team - please let us know when our Argo record appears in your datasets list so we can confirm we hit all the requirements correctly; if there's something missing, also please let me know. |
Hi folks - we still don't see argovis appearing at https://oceaninfohub.org/results/Dataset?search_text=argovis&page=0 - is something wrong on our end we can address? Am I looking in the wrong place? |
templates generated through the meeting:
other notes from meeting:
additionalProperty
for other fields so that they are visiblespatialCoverage
type, instead ofpolygon
you can alternatively usebox
(bounding box coordinates)source metadata snippet for a Float:
related to iodepo/odis-arch#404