File Download: Allow user to download all files from a dataverse at once. #639

eaquigley · 2014-07-09T15:39:56Z

Author Name: Kevin Condon (@kcondon)
Original Redmine Issue: 4086, https://redmine.hmdc.harvard.edu/issues/4086
Original Date: 2014-06-06
Original Assignee: Gustavo Durand

This was requested by a dv admin, Janina:

Allow a user to download all files from a dataverse at once, see RT#179817

do you know if there is a way to download all of the files in our Dataverse at once?

scolapasta · 2014-10-16T15:45:06Z

Moved to 4.1 to decide if we actually want something like this (what if a dataverse has GBs and GBs of files?)

kcondon · 2015-04-20T17:27:56Z

Under the Download button, the UI currently says, All files from this dataset, but it is grayed out.
A user was asking whether this was broken, see RT #196683, also Liz added a ticket to allow selecting all files at once for download: #1988

raprasad · 2015-07-27T17:11:07Z

Based on # of files + sizes, can there be a quick estimate of whether:

This can be done fairly fast (in seconds) for that user OR
Request into a queue
- User sees message saying to expect an email + dv notification for when the download is ready.

sbarbosadataverse · 2016-03-16T20:33:39Z

@scolapasta
This was requested by Harvard GSD just today and had to be passed to Kevin.

kcondon · 2016-05-09T16:37:21Z

Will bring up at GIRT to see whether it is realistic to pursue this.

donsizemore · 2017-04-06T18:33:00Z

Feature request (which might better belong in a separate issue?):

Thu-Mai often needs to download all files from a dataset in their original format. I'm looking into scripting this through the Native and Data Access APIs, but a GUI option for archivists would be extremely helpful.

pdurbin · 2017-04-06T19:39:32Z

@donsizemore yeah, a separate issue might be nice. What you and Thu-Main want is actually a smaller "user story" since it's limited to a single dataset. This issue is about a whole dataverse, which might contain sub-dataverses (which might contain sub-dataverses).

pdurbin · 2017-05-15T20:28:37Z

@donsizemore this issue came up today. Weren't you saying in IRC that you cooked up a script? Want us to take a look? 😄

pdurbin · 2017-05-15T20:32:56Z

"all mine does is download all files in a given dataset in original format writing out the original filename, but it could be smartly rewritten and extended" -- @donsizemore at http://irclog.iq.harvard.edu/dataverse/2017-04-07#i_51359

pdurbin · 2018-10-04T01:39:38Z

When we tried to estimate #4529 about downloading all file based on a persistent ID (DOI or Handle) we decided against implementing that feature due to concerns over performance problems: #4529 (comment)

This issue represents even more load on the server so by the same logic we wouldn't implement this either.

djbrooke · 2019-08-16T14:18:47Z

I'm going to close this for now. For the performance concerns, we could possibly revisit after implementing Lambda functions (#6093) that would take zipping datasets off the application server, or if we decide to pre-zip and store content in support of #6085. We'd need to take non-S3 installations into consideration.

mankoff · 2020-06-04T00:14:09Z

Hello. I'm interested in this feature (and commented recently on a related issue). I have a question after reading this thread:

Why is zipping required?

Based on my (limited, ancient) webserver admin experience, if the dataset is exposed as a folder, the individual files could be downloaded with wget. Compression can happen on-the-fly (and only by certain filetype?) by the webserver (e.g. Apache), or no compression at all, and the Dataverse does not need to bulk-zip everything before the download begins.

pdurbin · 2020-06-04T21:02:43Z

@mankoff I appreciate your out of the box thinking! Thanks for commenting on #4529 and #6505 as well! Let's move the conversation to one of those issues since they're still open. Alternatively, you're welcome to open a dedicated issue about this idea.

eaquigley assigned scolapasta Jul 9, 2014

raprasad modified the milestone: Dataverse 4.0: In Review Jul 9, 2014

scolapasta modified the milestones: Beta 7 - Dataverse 4.0, In Review - Dataverse 4.0 Jul 15, 2014

scolapasta modified the milestones: Beta 7 (Permissions & Auth Branch) - Dataverse 4.0, Beta 8 - Dataverse 4.0, 4.1 Oct 10, 2014

kcondon mentioned this issue Apr 20, 2015

Download Files: Select All Checkbox #1988

Closed

scolapasta modified the milestones: In Review - Long Term, In Review - Short Term May 8, 2015

eaquigley removed the Status: Design label Jun 25, 2015

This was referenced Jun 25, 2015

Requesting Access to (multiple) restricted files: allow to request access to all files simultaneously #2205

Closed

Hook up download All button #2026

Closed

scolapasta removed their assignment Jan 27, 2016

mheppler added the Feature: File Upload & Handling label Jan 28, 2016

scolapasta added Status: Triaged and removed Status: Dev labels Jan 28, 2016

scolapasta removed this from the Not Assigned to a Release milestone Jan 28, 2016

pdurbin added User Role: Guest Anyone using the system, even without an account and removed zTriaged labels Jun 30, 2017

pdurbin mentioned this issue Dec 19, 2017

Persistent Identifiers for Dataverse #4390

Closed

pdurbin added the Feature: API Guide label Aug 10, 2019

djbrooke closed this as completed Aug 16, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

File Download: Allow user to download all files from a dataverse at once. #639

File Download: Allow user to download all files from a dataverse at once. #639

eaquigley commented Jul 9, 2014

scolapasta commented Oct 16, 2014

kcondon commented Apr 20, 2015

raprasad commented Jul 27, 2015

sbarbosadataverse commented Mar 16, 2016

kcondon commented May 9, 2016

donsizemore commented Apr 6, 2017

pdurbin commented Apr 6, 2017

pdurbin commented May 15, 2017

pdurbin commented May 15, 2017

pdurbin commented Oct 4, 2018

djbrooke commented Aug 16, 2019

mankoff commented Jun 4, 2020

pdurbin commented Jun 4, 2020

File Download: Allow user to download all files from a dataverse at once. #639

File Download: Allow user to download all files from a dataverse at once. #639

Comments

eaquigley commented Jul 9, 2014

scolapasta commented Oct 16, 2014

kcondon commented Apr 20, 2015

raprasad commented Jul 27, 2015

sbarbosadataverse commented Mar 16, 2016

kcondon commented May 9, 2016

donsizemore commented Apr 6, 2017

pdurbin commented Apr 6, 2017

pdurbin commented May 15, 2017

pdurbin commented May 15, 2017

pdurbin commented Oct 4, 2018

djbrooke commented Aug 16, 2019

mankoff commented Jun 4, 2020

pdurbin commented Jun 4, 2020