-
Notifications
You must be signed in to change notification settings - Fork 47
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Drop the range of dcat:keyword #1585
Comments
Do you suggest to have a mix of literals and resources using <dataset> dcat:keyword "Keyword literal"@en , <http://eurovoc.europa.eu/100141> . If so, I do not think this will improve interoperability.
I think the current state is fine and we should not change that. |
No, I only suggest to drop the range (in fact I would suggest to drop almost all ranges and leave that to application profiles). |
Well, dropping the range effectively means supporting the case above, which in my opinion lowers interoperability. |
The distinction between
has been in place since DCAT v1. Bad habits developed in projects can't be fixed by modifying DCAT for everyone. |
@dr-shorthair I'm aware of the distinction being from v1. The intention of raising this issue was to improve DCAT, not to make it suitable for a particular case. And speaking of bad habits, over-axiomatazing ontologies is definitely a bad habit in RDFS and OWL modelling in general, and not reserved for DCAT. But there is hope. A handy recent example is the range of |
I support the reaction from @jakubklimek. In this case the usage situation is clear and clean, and not restrictive. In short:
In the last case, dcat:theme is a special subproperty: namely the theme to which the Dataset is associated in the Catalogue. In this special case there is hopefully also not the discussion whether that could be a Literal. And note that for one profile the theme of another profile can be considered another categorisation. So instead calling this a bad practice, in this case the range Literal versus Concept is corresponding to a business need. Both nicely address two distinct levels of harmonisation in the area of associating term to datasets to make them easiers findable in a catalogue by freetext search or facetted browsing. By mixing, as illustrated by Jakub, DCAT states that the implementations must accept and being able to process both at the same time. It will create more implementation friction than gain. Lifting the distinction between data property and object property must be done care. And in this case it will not create added value, but more confusion. Maybe you stumble over that the subproperty of dct:subject is not named 'keyword' when you use it in an implementation just as a keyword: that is a different discussion. |
We intent to open the Range for dcat-ap-ch to also allow skos:Concept and schema:DefinedTerm. While we understand, that there is dcat:theme for clearly closed vocabualries which allow skos:Concept, in the use case of Switzerland, with its multilinguality, we see the need to have translated keywords for search purposes, without having a fixed set of (CV) of Terms. (By simply allowing language tags for keywords, we can't distinguish the matching once we have multiple keywords regarding the language.) We see the added potential friction for implementations, but we value the proper description of the datasets in a multilingual context here stronger. We would be pleased to get in the loop regarding this topic. |
@l00mi I think you are considering to overengineer the keywords. If this is a fixed list, well established it is better to stay in the category area (with a codelist in which you provide controlled translations). (subproperties of dct:subject). If it are keywords (i.e. random words that publishers like to attach to their catalogued resources) then an autotranslation service for that purpose is very usefull. |
@bertvannuffelen Thank you for your valuable feedback. I understand your concern, regarding the high editorial efforts, and I understand the connection to dct:theme for clear cut CVs with a closed set of categories. The goal we like to achieve is to have an open set, of multilingual tags. In the swiss context, we foresee to use https://termdat.bk.admin.ch for this (e.g. https://www.termdat.bk.admin.ch/entry/64371), and in addition links to Wikidata where the entries are missing (far from a "fixed" list). The back of the coin, in a multilingual context, is the editorial effort to properly translate Keywords in different languages (where we have a legal obligation to). This is especially in the context of a loose set of Keywords, not easy to translate properly with an auto translation. Therefore we much rather like to use the know-how of the publishers to help here. For your reference, we have a first draft of a usage note for our dcat-ap-ch here: https://github.com/opendata-swiss/eCH-0285-Use-of-controlled-vocabularies/blob/main/document/05_keywords.md I hope this helps to understand our goal, and to also distinguish from dcat:themes/dcat:subject. |
My personal opion is that mixing literals and structured values is generally a bad practice and should be avoided as much as possible. Considerations to take into account:
In the example below I put all possible values that could be technically provided when opening up the range. I have seen them all on the same property.
In your guidelines you make a lot of assumptions but leave the door still open for any of the above cases. Which of the cases you do not want to support? My point of concern is that dcat:keyword is about an "uncontrolled use of range of values". That is best captured by a string/langstring approach. This is the most simple and naive method to tag datasets to increase their findability. We should not give up simple methodes if there already valid approaches in the specification that could support a more structured approach. Observe that any (semi)controlled approach is mappable into this representation. 2 relative simple approaches to meet your requirements: Because you mention that the termdat system has a legal basis for use, I am even more in favor for the second case. In that you can straightforward make a distinction between the ID use https://register.ld.admin.ch/termdat/215878 and the equivalent literal use "hochwasser"@"de". (They are from a skos perspective highly equivalent as SKOS imposes that each prefLabel is uniquely identifying a concept). |
Since the range of
dcat:keyword
isrdfs:Literal
, this makes application profile designers use alternatives such asdcterms:subject
which reduces interoperability with catalogues usingdcat:keyword
A common SHACL shape in EU is:
It would be nicer to use the dedicated
dcat:keyword
.The text was updated successfully, but these errors were encountered: