How can/should one introduce Context into FAIR data? #16

hrzepa · 2017-07-03T10:02:33Z

I have heard criticism of FAIR as representing only four of the five essential attributes of data. The missing component is “context”. Without the back story associated with the data, it is impoverished.

Arguably, the FAIR metadata can provide link(s) to such back stories, but is this sufficient and should over mechanisms such as perhaps EventData be promoted as well?

dr-shorthair · 2017-07-12T07:29:49Z

Yes. Context could also be generalized as 'linked' or 'connected'. This is a weakness or gap in the current FAIR gamut.

micheldumontier · 2017-07-12T14:36:13Z

The FAIR principles indicate that reuse is enabled with detailed provenance (R1.2), and this emcompasses context.

CaroleGoble · 2017-07-31T14:27:27Z

Is context just provenance? IMHO, no.

As soon as one is working with the datasets and OTHER assets arising from a range of studies and experiments you are not just talking about provenance, nor are you just talking about data as one data set. Or even data at all. In the FAIRDOM Systems Biology asset management platform we link together data, models, SOPs, workflows, samples, publications etc all around the ISA model. the entire compound "Research Object is FAIR as well as the individual components within.
See http://www.fair-dom.org, and

Wolstencroft K, Krebs O, Snoep JL, Stanford NJ, Bacall F, Golebiewski M, Kuzyakiv R, Nguyen Q, Owen S, Soiland-Reyes S, Straszewski J, van Niekerk DD, Williams AR, Malmström L, Rinn B, Müller W, Goble C FAIRDOMHub: a repository and collaboration environment for sharing systems biology research. Nucleic Acids Res, 45(D1): D404-D407. DOI: 10.1093/nar/gkw1032 (2016)

The Research Object (http://www.researchobject.org) approach is all about metadata manifests that retain context and relate components that are potentially scattered in external resources as well as contained in containers like docker or even zip files. By having FAIR Research Objects, rather than just "data" we get context

Belhajjame K, Zhao J, Garijo D, Gamble M, Hettne K, Palma R, Mina E, Corcho O, Gómez-Pérez JM, Bechhofer S, Klyne G, Goble C Using a suite of ontologies for preserving workflow-centric research objects, J. Web Sem. 32: 16-42, doi:10.1016/j.websem.2015.01.003. (2015)
and
Chard K, D' Arcy M, Heavner B, Foster I, Kesselman C, Madduri R, Rodriguez A, Soiland-Reyes S, Goble C, Clark K, Deutsch EW, Dinov I, Price N, Toga A I'll Take That to Go: Big Data Bags and Minimal Identifiers for Exchange of Large, Complex Datasets IEEE Intl Conf on Big Data doi:10.1109/BigData.2016.7840618 (2016)

CaroleGoble · 2017-07-31T14:36:57Z

Another comment - in Systems approaches in particular we are crossing the boundaries of different types of data - as is the case in, say, polyomic studies. Thus we very much need to retain the context of how data are related to each other in a study. In project data management workflows too often this linkage is broken when the sub-datasets are disbanded into type specific, siloed, public deposition archives. FAIRDOM (above) tackles this from the start for poly-asset projects through a FAIR metadata layer. BioStudies, kind of does this too. The DTL FAIRification platform (DataFAIRPoints and Fairifier) attempts to recover this retrospectively.

micheldumontier · 2017-07-31T14:48:15Z

The research object approach is perfectly fine way to bundle things together and provide the metadata that you need to understand what those objects are, and, as you say, the context for those objects. While we might disagree that the provenance of a digital object does not fully encompass the context from which it was produced, we should agree that context is covered by R1. meta(data) have a plurality of accurate and relevant attributes.

dr-shorthair · 2017-07-31T16:07:23Z

we should agree that context is covered by R1. meta(data) have a plurality of accurate and relevant attributes.

I agree that you could shoehorn context here, but it would be helpful to have it made more explicit.

I'm looking for a bridge from FAIR to the 5th star of the W3C's Linked Open Data principles here - e.g. http://5stardata.info/en/
If your data is linked into a bigger 'graph' then it is more useful. This requires cross-references and (hyper-)links, not just 'a plurality of attributes'.

micheldumontier · 2017-07-31T21:05:15Z

I3 is responsible for the connectivity part of the data/metadata. See https://www.dtls.nl/fair-data/i3-metadata-include-qualified-references-metadata/ m.

…

On Mon, Jul 31, 2017 at 6:07 PM, Simon Cox ***@***.***> wrote: we should agree that context is covered by R1. meta(data) have a plurality of accurate and relevant attributes. I would agree that you *could* shoehorn context here, but it would be helful to have it made more explicit. I'm looking for a bridge from FAIR to the 5th star of the W3C's Linked Open Data principles here - e.g. http://5stardata.info/en/ If your data is linked into a bigger 'graph' then it is more useful. This requires cross-references and (hyper-)links, not just 'a plurality of attributes'. — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#16 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AA8qPO4cYlEWhVTxNfPoFSR6e18AKDcOks5sTfu8gaJpZM4OMFLe> .

-- Michel Dumontier Distinguished Professor of Data Science Maastricht University http://dumontierlab.com

dr-shorthair · 2017-08-01T05:00:42Z

Yes. I had temporarily overlooked that (though had already slotted it into our rating framework - https://confluence.csiro.au/display/OZNOME/Data+ratings ).

Unfortunately I find the groupings in FAIR to be less than ideal, so in some cases the chief concern is smeared over more than one FAIR principle - for example, 'findable' overlaps with R1, F2, F3, and 'useable' with I2 and R1.3.

Maybe its because our focus is on data, rather than metadata?

Daniel-Mietchen added FAIR extensions FAIRness metadata labels Jul 3, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How can/should one introduce Context into FAIR data? #16

How can/should one introduce Context into FAIR data? #16

hrzepa commented Jul 3, 2017

dr-shorthair commented Jul 12, 2017 •

edited

Loading

micheldumontier commented Jul 12, 2017

CaroleGoble commented Jul 31, 2017

CaroleGoble commented Jul 31, 2017

micheldumontier commented Jul 31, 2017

dr-shorthair commented Jul 31, 2017 •

edited

Loading

micheldumontier commented Jul 31, 2017 via email

dr-shorthair commented Aug 1, 2017 •

edited

Loading

How can/should one introduce Context into FAIR data? #16

How can/should one introduce Context into FAIR data? #16

Comments

hrzepa commented Jul 3, 2017

dr-shorthair commented Jul 12, 2017 • edited Loading

micheldumontier commented Jul 12, 2017

CaroleGoble commented Jul 31, 2017

CaroleGoble commented Jul 31, 2017

micheldumontier commented Jul 31, 2017

dr-shorthair commented Jul 31, 2017 • edited Loading

micheldumontier commented Jul 31, 2017 via email

dr-shorthair commented Aug 1, 2017 • edited Loading

dr-shorthair commented Jul 12, 2017 •

edited

Loading

dr-shorthair commented Jul 31, 2017 •

edited

Loading

dr-shorthair commented Aug 1, 2017 •

edited

Loading