-
Notifications
You must be signed in to change notification settings - Fork 47
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Should we specialize dcterms:hasPart for dcat:DatasetSeries? #1307
Comments
Can we just use (of course, I would like to see the inverse property as well :-) |
My suggestion:
|
@dr-shorthair @riannella From the point of view of an application developer, let me again state that I would strongly prefer to standardize only one of the two mutually inverse properties. Otherwise, compliant applications and queries need to always account for the possibility that in one data catalog, the datasets are connected to the series using one property, in another using the second, which makes them more complex with each inverse property pair. |
I would suggest keeping only the inSeries property, so that one needn't worry about updating metadata for the series every time a new dataset is added to that series. |
Agree with @jakubklimek and @agreiner . Only one needed and |
...but they are inverse properties...so you only need to define one, the other can be asserted or inferred. |
Yes. Agree. |
I'm fine with defining only |
Sorry @andrea-perego - when I said "you only need to define one" - I really meant you only need to "assert" one. |
instead of
? I don't see why we would switch the logic for dataseries. Until now the "bigger" class states which "smaller" objects are part of them. Catalogs include Datasets, Datasets include Distributions. |
The rational for the dcat:inSeries approach is that for series that are derived from e.g. continuous sensor feeds, there are new members of the series being added at periodic intervals. the dcat:inSeries approach allows the metadata for new members of the series to declare their membership without having to update the series metadata. |
I agree with @smrgeoinfo that the more important relationship is from the part to the series - either This also impacts #1272. |
In our sector, for example, a Regulator would say the more important relationship was from the series to the parts. |
Regarding the The bottom line is that I think that the range of We can already see this in the Czech catalog:
|
@jakubklimek because of the subclass axiom
isn't the issue you describe already taken care of implicilty? |
@dr-shorthair ahh, ok, of course. I lost track of the state of the discussion about whether DatasetSeries should be a subclass of dataset/catalog/resource. I guess my use case is an argument for it being a subclass of dataset. |
Some discussions about the names of these properties have taken place in PR #1328. Originally posted by @andrea-perego in #1328 (comment)
Originally posted by @riccardoAlbertoni in #1328 (comment)
Originally posted by @andrea-perego in #1328 (comment)
Originally posted by @riccardoAlbertoni in #1328 (comment)
|
I find parts and parent/child to ok for generic relationships. I think we can be more specific and use property names that are aligned to the domain of data catalog publishing. @dr-shorthair proposal above with I would then go further and propose (in the same way) for having |
Thanks for the comment, @riannella:
I am afraid that changing the name of |
Hm, I'm getting a little concerned about the direction here. This thread shows at least three people who agreed that it would be beneficial to only define the inSeries property and leave out hasSeriesMember or whatever, because it makes it difficult to determine membership in a series programmatically, since one doesn't know in advance which approach is taken in the metadata. Yet we have people saying they see no reason not to include both, the draft includes both, and there is ongoing discussion of what to call the second term. Next, I expect someone will say that we should just go with it this way and see if anyone outside the group comments, nobody will comment, and we will be stuck with something that doesn't reflect what most of us want, or at least a process that doesn't address the issue raised. |
In tonight's call, @andrea-perego, @riannella, @dr-shorthair: How |
I think that's okay if people really need to mark that up in metadata, but honestly, I don't see why anyone would use that if they can just use the catalog itself. If it's intended to be used for a series, I don't think most people will even think that it has anything to do with a series. |
Thanks for your comment, @agreiner. I understand your concern. As for your previous specific comment,
I suspect the metadata of the dataset series needs to be updated anyway. Especially if we keep the upstream inheritance explained in https://raw.githack.com/w3c/dxwg/dcat-dataseries-issue1272/dcat/index.html#dataset-series where it is said: "The update date (dcterms:modified) of the dataset series should correspond to the latest publication or update date of the child datasets." Also, we are having some parallel discussion of whether supporting inverse properties with a "lightweight approach" adopted by PROV-O (https://www.w3.org/TR/prov-o/#inverse-names) ( we have discussed this in tonight call, see meeting minutes) I think the difficulty here is that there are different intertwined aspects for deciding whether or not to have both inSeries and hasSeriesMember, and which one of the two. |
OK |
@riccardoAlbertoni , if I correctly recall the last meeting, the discussion was about the relationship between a series and its members, with a proposal of calling it To be consistent, the relationship between a |
My understanding of W3C's process is that we take the best guess of the working group and publish that for input from the community, not that we take what feels the most inclusive and put that forth. By typing +1 in a meeting to publish, one says in effect that they are supportive of the text as it stands and okay with the assumption by the rest of the world that you stand by it. Notes can be helpful for expressing alternatives, so I would expect additional options be shown in notes, thereby remaining inclusive of a minority opinion if there isn’t agreement in the group. The current note, to me, doesn’t clearly state the concern raised at all. I’d prefer to have a note that says some members of the group feel there should be an inverse to inSeries and ask if anyone feels it is important to do so. Given the strong reason not to do so, I think this approach is justified. Re update dates, I think any property that assumes people will make updates to a published dataset’s metadata is flawed. Even the most diligent publishers of data will never be able to update copies of datasets that find their way to secondary sites and to users. Update dates only make sense to me when they reflect the state when the dataset (or series) was published. |
Looking at this thread, I think there's a general agreement on Therefore, I propose we revise PR #1292 by provisionally removing the inverse of For the inverse of |
Yes, I agree with the direction you have suggested. I am going to update the PR and open a new issue accordingly. |
Thanks, @andrea-perego , you are right. I switched series with catalog for some mysterious reasons, but series was the one I meant. My apologies ... . |
If we do not need to open other issues besides #1335, this issue can be closed as soon as we merge the changes in PR #1292. |
Thanks, @riccardoAlbertoni . I wonder whether we should also open a separate issue about the general approach to be taken to deal with inverse properties. BTW, part of the discussion in this thread fits into #1273 . I posted there the relevant comments. |
opened a new issue #1336 |
Dataset series v2 (issues #1307 and l#1272.)
For data catalog,
dcterms:hasPart
is specialized intodcat:dataset
,dcat:service
, etc. Should we have some sort of specialization also fordcterms:hasPart
used indcat:DatasetSeries
?In particular, to distinguish more between datasets that are 'just a bag of files' (see examples in Loosely structured catalog) from actual datasets/distributions part of the dataset series.
The text was updated successfully, but these errors were encountered: