add subfields to authors in dataset_description.json #602

CPernet · 2020-09-09T15:41:42Z

following a suggestion by @debruine in psych-DS we could have in modality agnostic file an update of the dataset_description.json for authors, removing ambiguities. (not sure what this entail for the validator? as we should allow both types of entries)

{
  "Name": "The mother of all experiments",
  "BIDSVersion": "1.4.0",
  "DatasetType": "raw",
  "License": "CC0",
  "Authors": [
  { "@type": "Person",
    "givenName": "Paul",
    "familyName": "Broca",
    "identifier": "https://orcid.org/0000-0000-0000-0001"},
  { "@type": "Person",
    "givenName": "Carl",
    "familyName": "Wernicke",
    "identifier": "https://orcid.org/0000-0000-0000-0002"}
],
  "Acknowledgements": "Special thanks to Korbinian Brodmann for help in formatting this dataset in BIDS. We thank Alan Lloyd Hodgkin and Andrew Huxley for helpful comments and discussions about the experiment and manuscript; Hermann Ludwig Helmholtz for administrative support; and Claudius Galenus for providing data for the medial-to-lateral index analysis.",
  "HowToAcknowledge": "Please cite this paper: https://www.ncbi.nlm.nih.gov/pubmed/001012092119281",
  "Funding": [
    "National Institute of Neuroscience Grant F378236MFH1",
    "National Institute of Neuroscience Grant 5RMZ0023106"
  ],
  "EthicsApprovals": [
    "Army Human Research Protections Office (Protocol ARL-20098-10051, ARL 12-040, and ARL 12-041)"
  ],
  "ReferencesAndLinks": [
    "https://www.ncbi.nlm.nih.gov/pubmed/001012092119281",
    "Alzheimer A., & Kraepelin, E. (2015). Neural correlates of presenile dementia in humans. Journal of Neuroscientific Data, 2, 234001. http://doi.org/1920.8/jndata.2015.7"
  ],
  "DatasetDOI": "10.0.2.3/dfjj.10",
  "HEDVersion": "7.1.1"
}

tagging @effigies @sappelhoff @robertoostenveld

sappelhoff · 2020-09-09T16:16:11Z

I assume that the proposal only entails modifying the Author field? I didn't spot anything other that was different from what we currently support in BIDS.

re: the Author field --> that looks like an interesting direction to me

Currently, we say the following according to this part of the spec:

Authors: OPTIONAL. List of individuals who contributed to the creation/curation of the dataset.

Note that this is a little ambiguous and we could be a lot clearer in terms of what exact datatypes are expected. See this issue, where we want to start improving this state: #533

Looking at the validator schema however, we see that an "array of strings" is expected as input: see link to validator code

Now for the present proposal:

we would HAVE to keep allowing "array of strings" for backward compatibility
but we could add a second way to specify authors: "array of objects"
1. where each "object" MUST have (and only have) the fields X, Y, Z (to be specified)

that wouldn't be a technical problem.

Overall I think this looks cool but it'd need a bit more tweaking (specify what other @type value you want to allow ... and why the @ symbol is needed, then also what kind of "identifiers" would be permissible, etc.)

Let's hear what others have to say.

PS: using @ + type also made me accidentally tag https://github.com/type ... sorry 🙂

effigies · 2020-09-09T17:12:34Z

cc @nellh It would be good to have an OpenNeuro perspective on this.

satra · 2020-09-09T17:28:36Z

if anyone is interested here is the current version of the dataset contributor model we are using in DANDI.

https://github.com/dandi/dandi-cli/blob/c20d7888391c9abe6fdffbc53ea4e20f054bbde2/dandi/models.py#L510

which adds/overwrites the common model.
https://github.com/dandi/dandi-cli/blob/c20d7888391c9abe6fdffbc53ea4e20f054bbde2/dandi/models.py#L444

specifically this uses a field called contributor which can accept either a Person or an Organization as an object with specific roles assigned to these people.

this is not BIDS compatible, but should be compatible/translatable with datacite, which i believe is what openneuro uses for DOIs. although i don't know what pieces of metadata are transformed into the datacite model.

at present it seems that openneuro doi's provide some basic mapping to creator for all authors:

$ curl https://ez.datacite.org/id/doi:10.18112/openneuro.ds003105.v1.0.1
success: doi:10.18112/openneuro.ds003105.v1.0.1
_target: https://openneuro.org/datasets/ds003105/versions/1.0.1
datacite: <?xml version="1.0" encoding="UTF-8"?>%0A<resource xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://datacite.org/schema/kernel-4" xsi:schemaLocation="http://datacite.org/schema/kernel-4 http://schema.datacite.org/meta/kernel-4/metadata.xsd">%0A  <identifier identifierType="DOI">10.18112/OPENNEURO.DS003105.V1.0.1</identifier>%0A  <creators>%0A    <creator>%0A      <creatorName>Kelly Payette</creatorName>%0A    </creator>%0A    <creator>%0A      <creatorName>Andras Jakab</creatorName>%0A    </creator>%0A  </creators>%0A  <titles>%0A    <title xml:lang="en-us">Fetal Tissue Annotation Challenge FeTA Dataset</title>%0A  </titles>%0A  <publisher>Openneuro</publisher>%0A  <publicationYear>2020</publicationYear>%0A  <resourceType resourceTypeGeneral="Dataset">fMRI</resourceType>%0A</resource>
_profile: datacite
_datacenter: SUL.OPENNEURO
_export: yes
_created: 1598878722
_updated: 1598878724
_status: public

ericearl · 2023-06-20T14:29:32Z

I know this is an old thread, but I was re-introduced to it today by @agt24. I think it would be great to NOT change the dataset_description.json's "Authors" field, but instead accept EITHER:

The "Authors" field as-is in the dataset_description.json ; OR
A CITATION.cff file at the root level of the data set in lieu of a dataset_description.json's "Authors" field since CITATION.cff is an accepted widespread standard of documenting details about authors.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add subfields to authors in dataset_description.json #602

add subfields to authors in dataset_description.json #602

CPernet commented Sep 9, 2020

sappelhoff commented Sep 9, 2020 •

edited by effigies

Loading

effigies commented Sep 9, 2020

satra commented Sep 9, 2020

ericearl commented Jun 20, 2023

ericearl commented Jun 20, 2023

CPernet commented Jun 20, 2023 •

edited

Loading

effigies commented Jun 20, 2023

Remi-Gau commented Jun 20, 2023

ericearl commented Jun 20, 2023

effigies commented Jun 20, 2023

Remi-Gau commented Jun 20, 2023

effigies commented Jun 20, 2023

add subfields to authors in dataset_description.json #602

add subfields to authors in dataset_description.json #602

Comments

CPernet commented Sep 9, 2020

sappelhoff commented Sep 9, 2020 • edited by effigies Loading

effigies commented Sep 9, 2020

satra commented Sep 9, 2020

ericearl commented Jun 20, 2023

ericearl commented Jun 20, 2023

CPernet commented Jun 20, 2023 • edited Loading

effigies commented Jun 20, 2023

Remi-Gau commented Jun 20, 2023

ericearl commented Jun 20, 2023

effigies commented Jun 20, 2023

Remi-Gau commented Jun 20, 2023

effigies commented Jun 20, 2023

sappelhoff commented Sep 9, 2020 •

edited by effigies

Loading

CPernet commented Jun 20, 2023 •

edited

Loading