Skip to content

Commit

Permalink
Merge pull request #50 from nfdi4plants/db-datamodel
Browse files Browse the repository at this point in the history
suggestions data model
  • Loading branch information
Freymaurer authored Sep 20, 2024
2 parents 4553ebc + af8a029 commit 91448f9
Show file tree
Hide file tree
Showing 2 changed files with 10 additions and 18 deletions.
26 changes: 8 additions & 18 deletions src/pages/details/arc-data-model.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,34 +2,24 @@
layout: ../../layouts/MarkdownLayout.astro
title: 'ARC data model'
pubDate: 2024-09-13
description: 'A short description of the ARC data mode.'
description: 'A short description of the ARC data model.'
author: 'Timo Mühlhaus'
image:
url: 'https://docs.astro.build/assets/rose.webp'
alt: 'The Astro logo on a dark background with a pink glow.'
tags: ["RO-Crate", "FAIR digital object","JSON-LD"]
tags: ["RO-Crate", "FAIR Digital Object", "JSON-LD"]
---

ARC is an implementation of a FAIR Digital Object (FDO), utilizing RO-Crate with Schema.org and Bioschemas objects as its foundation, and further enhancing it with additional metadata and structure. An RO-Crate serves as a research object, composed of a collection of research elements and data, enabling detailed descriptions of these collections. Research elements in this context include samples, measurement data, and other research outputs.
ARC is an implementation of a FAIR Digital Object (FDO). ARC utilizes RO-Crate with Schema.org and Bioschemas objects as its foundation, and further enhancing it with additional metadata and structure. An RO-Crate serves as a research object, composed of a collection of research elements and data, enabling detailed descriptions of these collections. Research elements in this context include samples, measurement data, and other research outputs.

![ARC RO-Crate](/arc-website/arc-ro-crate.png)
![ARC RO Crate](/arc-website/arc-ro-crate.png)

ARC extends the basic RO-Crate concept by incorporating detailed descriptions of the processes that lead to the generation of data. This enhancement allows the data model to represent a complete process graph, encompassing experimental procedures, simulations, analyses, and the interconnections and provenance among them.
In this model, research elements are the nodes of the process graph, while the connections between them, defined as lab processes, are represented by edges.

![ARC RO-Crate-Process](/arc-website/ARC-isa-cwl-decorations.png)

To allow for unambiguous inclusion of data entities into the process graph, Data Fragment Selectors, defined by W3, can be used in.
In this model, research elements are the nodes of the process graph, while the connections between them, defined as lab processes, are represented by edges. To allow for unambiguous inclusion of data entities into the process graph, Data Fragment Selectors, defined by [W3C]({{W3C}}), can be used.
Each process can be further specified and annotated with explanatory and descriptive metadata using lists of PropertyValues, enhancing its clarity and traceability.

![Fragment Selector](/arc-website/data-fragment-selector.png)

The collection of data fragment selectors for a given dataset is called the datamap. In addition to the selectors, the datamap contains additional information about how to read and interpret the content of the data fragments. Besides this, it carries and annotates data resources which do not originate in the process sequence.

![Datamap](/arc-website/datamap.png)
![ARC ISA CWL](/arc-website/ARC-isa-cwl-decorations.png)

To specialize ARC for biological data, the widely recognized ISA model (Investigation, Study, Assay) is employed, alongside the abstract Common Workflow Language (CWL) for workflows. The ISA model and CWL provide additional layers of metadata and structure, allowing for more precise definitions of processes and data. Dataset objects within ARC can implement either ISA or CWL interface object definitions, ensuring compatibility and standardization across various biological datasets and workflows.

![ARC RO Crate](/arc-website/arc-ro-crate-profiles.png)
![ARC RO Crate Profiles](/arc-website/arc-ro-crate-profiles.png)

This approach elevates ARC from merely documenting research objects to providing a full, interconnected representation of the research process, from data generation to analysis, making it a powerful tool for tracing and reproducing scientific investigations.

2 changes: 2 additions & 0 deletions src/statics.ts
Original file line number Diff line number Diff line change
Expand Up @@ -31,5 +31,7 @@ export enum URLS {
S3_DATAHUB_STATS = "https://frct-dataplant-static.s3.bwsfs.uni-freiburg.de/stats.json",

DATAPLANT_SWATE_ALPHA = "https://swate-alpha.nfdi4plants.org",

W3C = "https://www.w3.org/",

}

0 comments on commit 91448f9

Please sign in to comment.