Add dct identifier to publisher #301

hcvdwerf · 2024-09-09T11:30:14Z

The system previously lacked support for storing and serializing publisher identifiers, such as ROR IDs, ORCIDs, or other unique identifiers for publishers. While other metadata fields like publisher name, email, and URL were being correctly parsed and serialized, the identifier field (e.g., dct:identifier) was not being included in the publisher’s metadata. This omission resulted in incomplete RDF metadata and affected datasets that needed to capture a unique reference to the publisher.

kburger · 2024-09-09T12:59:54Z

ckanext/dcat/converters.py

    elif isinstance(dcat_publisher, dict) and dcat_publisher.get('name'):
-        package_dict['extras'].append({'key': 'dcat_publisher_name', 'value': dcat_publisher.get('name')})
-        package_dict['extras'].append({'key': 'dcat_publisher_email', 'value': dcat_publisher.get('mbox')})
+        if dcat_publisher.get('name'):


There's a double checl for `.get('name') which can be removed.

I removed the second one

amercader · 2024-09-10T10:11:20Z

ckanext/dcat/schemas/dcat_ap_recommended.yaml

@@ -66,6 +66,11 @@ dataset_fields:

    - field_name: type
      label: Type
+
+    - field_name: identifier


Can you add it to the dcat_ap_full.yaml file as well?

amercader · 2024-09-10T10:23:18Z

ckanext/dcat/profiles/euro_dcat_ap_base.py

@@ -123,7 +123,7 @@ def _parse_dataset_base(self, dataset_dict, dataset_ref):

        # Publisher
        publisher = self._publisher(dataset_ref, DCT.publisher)
-        for key in ("uri", "name", "email", "url", "type"):
+        for key in ("uri", "name", "email", "url", "type", "identifier"):


This will take care of the DCAT -> CKAN mapping, but you also need to add the other way around, CKAN -> DCAT. This is done a bit further down in the file here and here.

Additionally, this will cover the legacy way of defining publisher fields based on namespaced publisher_* extra fields, but going forward we want to support scheming objects ("publisher": {"id": xx, "name": yy, "url": zz}). This is handled in the euro_dcat_ap_scheming profile here (for CKAN -> DCAT, the opposite should already be handled by the base profile). Do you mind adding support for the new field there as well? And if you extend this test to cover the new field, even better.

@hcvdwerf Let me know if all this makes sense

Tnx for the comments. Verry helpful @amercader . Jut commit the changes

Markus92

Small fix for failing e2e test.

ckanext/dcat/tests/profiles/dcat_ap_2/test_scheming_support.py

Co-authored-by: Mark <markusjanse@gmail.com>

amercader · 2024-09-12T11:42:19Z

Thanks @hcvdwerf !

feat: add support for dct:identifier in publisher details

aefa22c

hcvdwerf force-pushed the add-dct-identifier-to-publisher branch from 83a1ddf to aefa22c Compare September 9, 2024 12:12

kburger reviewed Sep 9, 2024

View reviewed changes

missing label

87dd3c0

amercader reviewed Sep 10, 2024

View reviewed changes

- Add CKAN -> DCAT support publisher identifier

e0becf6

hcvdwerf force-pushed the add-dct-identifier-to-publisher branch from 64ace0a to e0becf6 Compare September 11, 2024 09:18

hcvdwerf requested review from amercader and kburger September 11, 2024 09:19

Markus92 suggested changes Sep 11, 2024

View reviewed changes

ckanext/dcat/tests/profiles/dcat_ap_2/test_scheming_support.py Outdated Show resolved Hide resolved

Update ckanext/dcat/tests/profiles/dcat_ap_2/test_scheming_support.py

6bd994a

Co-authored-by: Mark <markusjanse@gmail.com>

amercader merged commit d4dfab8 into ckan:master Sep 12, 2024
4 checks passed

mjanez mentioned this pull request Sep 16, 2024

Update from ckan/ckanext-dcat mjanez/ckanext-dcat#31

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add dct identifier to publisher #301

Add dct identifier to publisher #301

hcvdwerf commented Sep 9, 2024

kburger Sep 9, 2024

hcvdwerf Sep 9, 2024

amercader Sep 10, 2024

hcvdwerf Sep 11, 2024

amercader Sep 10, 2024

hcvdwerf Sep 11, 2024

Markus92 left a comment

amercader commented Sep 12, 2024

Add dct identifier to publisher #301

Add dct identifier to publisher #301

Conversation

hcvdwerf commented Sep 9, 2024

kburger Sep 9, 2024

Choose a reason for hiding this comment

hcvdwerf Sep 9, 2024

Choose a reason for hiding this comment

amercader Sep 10, 2024

Choose a reason for hiding this comment

hcvdwerf Sep 11, 2024

Choose a reason for hiding this comment

amercader Sep 10, 2024

Choose a reason for hiding this comment

hcvdwerf Sep 11, 2024

Choose a reason for hiding this comment

Markus92 left a comment

Choose a reason for hiding this comment

amercader commented Sep 12, 2024