Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ISO 19115 metadata is converted to lossy ISO 19139 format when ingested by CKAN #769

Closed
adborden opened this issue May 2, 2019 · 9 comments
Assignees
Labels
bug Software defect or bug component/catalog Related to catalog component playbooks/roles xml-transform

Comments

@adborden
Copy link
Contributor

adborden commented May 2, 2019

Geospatial datasets in ISO 19115 lose metadata when CKAN harvests them and converts them to ISO 19139.

How to reproduce

Expected behavior

Actual behavior

@adborden
Copy link
Contributor Author

adborden commented May 2, 2019

Would be good to grab a specific example and list out the metadata fields that are lost.

@adborden adborden added the component/catalog Related to catalog component playbooks/roles label May 2, 2019
@adborden
Copy link
Contributor Author

Some background from @JJediny

Thread on FGDC call https://github.com/ckan/ckanext-spatial/tree/master/ckanext/spatial/validation/xml

johnjediny 3 hours ago
current ckan support for xml harvesting metadata

johnjediny 3 hours ago
think the current issue is how we’re handling our harvest sources transformations into ckan tables that that common schema we conform to should be updated to 19115-3 as a new planned common core that would be compatible with pyCSW without having to risk losing fields in our current transformation.

johnjediny 3 hours ago
im not so up to speed on the best ways to go about approaching that re flask/spatial-ext/scheming-ext

johnjediny 3 hours ago
if they want us to temporarily update the xlst(s) we current install with the current java transformation that might be the least-work fix

johnjediny 3 hours ago
Here is the 19115 v3 schema project https://github.com/ISO-TC211
ISO/TC 211
Repositories
27
@ISO-TC211 | May 26th, 2014 | Added by GitHub

johnjediny 3 hours ago
artic metadata project re existing mapping work https://github.com/adiwg/mdJson-schemas
adiwg/mdJson-schemas
JSON schemas, examples, and templates for ADIwg metadata standards
Website
http://www.adiwg.org/projects/
Stars
7
adiwg/mdJson-schemas | Feb 18th, 2014 | Added by GitHub

johnjediny 3 hours ago
This was created by Micah.Wengren@noaa.gov https://docs.google.com/spreadsheets/u/1/d/19L89sgSijpB9nvaWjgGTSo3m3r06zuXKq9C5QwcbgRs/edit

@JJediny
Copy link
Member

JJediny commented Jul 25, 2019

Best long term solution may be to convert to using 19115-3 as CKAN's core schema. Which could be accomplished using https://github.com/ckan/ckanext-scheming w/ JSON schemas that have already been prepared for the 19115-3 FGDC profile - https://github.com/adiwg/mdJson-schemas/tree/master/schema as there is a good amount of work done mapping DCAT to ISO 19115 it might be easier to map data.json into 19115 - https://www.w3.org/2015/spatial/wiki/ISO_19115_-_DCAT_-_Schema.org_mapping

@adborden adborden added the bug Software defect or bug label Feb 8, 2020
@hkdctol hkdctol moved this to Icebox in data.gov team board May 18, 2022
@nickumia-reisys
Copy link
Contributor

Is this still relevant? @GSA/data-gov-team (specifically @FuhuXia and @jbrown-xentity)

@nickumia-reisys nickumia-reisys moved this to 👀 Needs Review [2] in data.gov team board Aug 11, 2023
@jbrown-xentity
Copy link
Contributor

This is still relevant. Related to the transformation from CSDGM (FGDC standard) into ISO, done in ckanext-geodatagov. Hopefully this will be addressed with the https://github.com/adiwg/mdTranslator tool, but still TBD.

@hkdctol
Copy link
Contributor

hkdctol commented Aug 17, 2023

@jbrown-xentity @nickumia-reisys I will put this back in icebox then?

@nickumia-reisys
Copy link
Contributor

I had brought this back up because of the harvesting requirement derivation/documentation. I wasn't sure if this was still a consideration we needed to address. Since it is, I'll pull the relevant aspects and we can move it back to the icebox.

@nickumia-reisys nickumia-reisys moved this from 👀 Needs Review [2] to 🧊 Icebox in data.gov team board Aug 21, 2023
@gujral-rei
Copy link

gujral-rei commented May 28, 2024

@jbrown-xentity will we address this with #4639 and #4564 ?

@jbrown-xentity
Copy link
Contributor

@jbrown-xentity will we address this with #4639 and #4564 ?

That is the hope. However, I always believed this "error" was actually just a bad UI representation of the data; all of the source data was accessible "as is" without transform, as well as the CKAN view.

@github-project-automation github-project-automation bot moved this from 🧊 Icebox to ✔ Done in data.gov team board May 29, 2024
@gujral-rei gujral-rei moved this from ✔ Done to 🗄 Closed in data.gov team board Jun 5, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Software defect or bug component/catalog Related to catalog component playbooks/roles xml-transform
Projects
Archived in project
Development

No branches or pull requests

7 participants