-
Notifications
You must be signed in to change notification settings - Fork 492
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Files: Persistent Identifiers for Files (DOIs for files) #2438
Comments
I am entering via: https://groups.google.com/forum/#!msg/dataverse-community/gtz2npccWjU/i7_EVs2LBgAJ ... I think persistent identifiers should not be derived from the local file id. An organization may want to migrate from their repository solution to dataverse and be able to import their PIDs if they already have them. And then rebind them to the dataverse dataset and future file pages. That would be possible with a String type, but not with a number. |
Sebastian brought this up on the community call 8/16 and asked if it would be included in 4.6. There are currently no plans to work on this for 4.6. |
Closing this issue as a duplicate of Files: Need Persistent Identifiers/URL's for Data Files #2700. |
A "DOIs for files" thread was just started at https://groups.google.com/d/msg/dataverse-community/JX2GLqPy_yE/6dzgXGVcCAAJ which reminds me that @djbrooke and I discussed this issue as well as #2700 last week. In short, #2700 was a bit more about a specific need for putting into print specific instructions about how to download files from an installation of Dataverse. A combination of a DOI and a file name was sufficient so I'll close that issue. This issue was originally closed as a duplicate of #2700 but this issue is actually the "DOIs for files" issue so I'll reopen it. Note that the title is "persistent identifiers for files" to be more generic than DOIs, to include Handles or whatever other schemes make sense. |
Two recent discussions come up recently where we would benefit from persistent IDs for files:
|
Adding relevant info about the current state of the branch;
To me this looks very wrong. And I don't feel this really is about the right way of handling an exception - why are we getting that exception in the first place? Specifically, a database connection error? Are we somehow trying to register these file DOIs for the 500+ files at the same time - instead of doing int sequentially in the background?
|
I also added two specific change requests in the review. |
…n the dataset once it's locked, before the asynchronous part of the publishing process is initiated. (#2438)
…atasetPublicationCommand update the dataset object; to avoid the condition when the first command is still holding onto the same object as it initiate the execution of the second command, asynchronously - which can result in concurrency and optimistic lock issues. Also, removed the redundant/unnecessary merge() calls. (#2438)
A quick summary, of the latest developments:
|
Tested the above checklist by leonid and all works as described. Also nearly completed test checklist: https://docs.google.com/document/d/1VOlDJFRy7zoS4LVDerH5TgMYsr3DKkm9ZRNcqajitxY/edit?usp=sharing |
Since we are moving towards individual pages for files, we also need to think about what the persistent identifier will be for them.
The text was updated successfully, but these errors were encountered: