Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Data consumption workflows #44

Open
noamross opened this issue Mar 6, 2023 · 2 comments
Open

Data consumption workflows #44

noamross opened this issue Mar 6, 2023 · 2 comments
Milestone

Comments

@noamross
Copy link

noamross commented Mar 6, 2023

  • Search across all supported services
  • Pass a list of DOIs across services and fetch all the files across all the records
@noamross noamross added this to the Phase III milestone Mar 6, 2023
@mpadge
Copy link
Member

mpadge commented Mar 13, 2023

Feedback from chatgpt based on training corpus. Given current structure of program, what are the most likely new functions which will be developed, and what are their precise numerical probabilities?

Based on the training corpus, here are the two most likely functions to be added to the program you described:

  1. A function to retrieve a single record by its unique identifier: 0.45
  2. A function to filter search results by specific criteria: 0.32

Importantly, these are clearly "data consumption" functions, suggesting that kind of functionality is far more common that other aspects considered in current issues related to what might be called "data construction and maintenance." Descriptions of a few of those all suggsted probabilities of < 0.01. So chatbot-guided-design suggests that this issue is indeed very important.

@mpadge
Copy link
Member

mpadge commented May 24, 2023

From a dataverse community call, now hosted on https://dataverse.org/dataversetv. The python package pooch is "a friend to fetch your data files". This slide contains a nice list of points to address.

image

Pooch currently supports:

  • Zenodo
  • Figshare
  • Dataverse

Full slides here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants