-
Notifications
You must be signed in to change notification settings - Fork 10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Migrate FoLiA Set Definition scheme to RDF #14
Comments
…-based set definitions (just started, skeleton, not functional yet), issue #19 and issue proycon/folia#14
… least, untested still), issue proycon/folia#14
See if we can derive our set definition model from SKOS, facilitating interoperability |
Summary of the current solution, the aim was to use as much of SKOS as possible, with as few as possible non-SKOS solutions:
Minor custom-made extensions:
Set definitions are completely agnostic about concept schemes. Relating concepts to external resources can be done through usual SKOS mechanisms, or other vocabularies; FoLiA set definition implementations don't use this information yet. A constraint mechanism for which subsets can be used together given what classes has not been defined nor implemented yet either. FoLiA Set definitions have to be complete and publicly retrievable from the web, a set definition should consist of one and only one SKOS collection that acts as the primary set (i.e. it is not a subset). Furthermore, all classes and subsets need to be defined as stated above, referring to foreign resources using e.g. Example sets:
More details and examples are in the FoLiA documentation. |
I personally quite like the XML format for its simplicity, but understand the motivations for the move to RDF and I think you have done a great job (again ;-)). It would be good to add to your bullet list above (which I suppose will end up in the documentation) how things used to be done in XML to provide guidance for developers who would want to migrate. One thing I noticed when working with setdefinitions is that linking to blob on a GitHub repository is not a good idea as those blobs seem to be cached (and if you push a newer version a rerun the validation the old blob is still fetched). Have you experienced the same or is this just me? :-) |
Thanks for the feedback! The legacy XML set definition format will remain supported for backward compatibility in any case (at least in the python library, which is the only library implementing FoLiA Set Definition support anyway). Legacy XML is translated on the fly to RDF. The The cache issue is an interesting point indeed, but I must say I haven't really encountered many problems with this in my tests yet. In practise it will probably not be much of an issue and only cause a small update delay perhaps. |
For the sake of clarification and possible discussion, I'm adding an example screenshots of how complex FoLiA set definitions (with features/subsets) are used by FLAT (https://github.com/proycon/flat) to populate fields in the annotation editor: |
And an example of deep validator invocation and output:
|
…-based set definitions (just started, skeleton, not functional yet), issue #19 and issue proycon/folia#14
… least, untested still), issue proycon/folia#14
The role of FoLiA Set Definitions is:
Using set definitions a FoLiA document can be validated on a deep level, i.e.
the validity of the used classes can be tested. Set definitions provide
semantics to the FoLiA documents that use them and are an integral part of FoLiA.
Set definitions are not in widespread use yet, most people simply don't bother
or care for such a level of abstraction and formality. One tool, FLAT,
does rely heavily on set definitions to populate options in selection fields.
Set definitions are currently described in a simple XML format, distinct from
FoLiA itself. The format is limited and not strongly established.
Considering the highly semantic nature of set definitions, the binding role
they play between the FoLiA document on one hand and data category registries
on the other hand, and the advent of linked open data, I propose describing
the set definitions themselves in RDF in future versions of FoLiA. I'm working
on a scheme for this.
The current set definitions will remain supported for backwards compatibility
of course, and may also act as an intermediate step in producing the RDF data.
The text was updated successfully, but these errors were encountered: