Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Download Manager related help text #2960

Closed
posixeleni opened this issue Feb 18, 2016 · 13 comments
Closed

Download Manager related help text #2960

posixeleni opened this issue Feb 18, 2016 · 13 comments

Comments

@posixeleni
Copy link
Contributor

A user recently reported (here https://help.hmdc.harvard.edu/Ticket/Display.html?id=232704) that they have had several people report to them that they were unable to download the files from their dataset. The file would start to download ok and then hang up, resulting in them having to re-start the whole process which for a larger file can take a lot of time.

I spoke with @landreev about this yesterday and we should look into a way to support the ability to resume a download which got interrupted.

@pdurbin
Copy link
Member

pdurbin commented Feb 22, 2016

@pameyer and I talked about this a bit at http://irclog.iq.harvard.edu/dataverse/2016-01-08#i_29469 and it's definitely something that's of interest for @sbgrid . Their in-house system uses rsync for this. I'm not sure the best way to implement this in Dataverse. This feature also seems strongly related to the Data IO Component BRD @raprasad is working on so I left a comment on it to check if it's in scope or not.

@pdurbin
Copy link
Member

pdurbin commented Feb 24, 2016

@bmckinney I'm giving you this issue to think about and discuss with @pameyer with regard to the needs of @sbgrid. Definitely open to suggestions. 😄

@bmckinney
Copy link
Contributor

Thanks, I definitely have some thoughts on this. I'll plan to discuss with @pameyer and share a google doc.

@bmckinney
Copy link
Contributor

@pdurbin
Copy link
Member

pdurbin commented Feb 25, 2016

@bmckinney nice! Please see also the diagram by @michbarsinai at #2728 (comment) and the BRD: Data IO Component doc @raprasad has been working on.

@pdurbin
Copy link
Member

pdurbin commented Feb 25, 2016

@bmckinney I just mentioned your desktop app idea to @landreev and a standup and he agrees that you need something on the client side to coordinate resumption of uploads and downloads. Also, I'm reminded of a demo @astrofrog gave to me, @mcrosas (and others?) in May 2015. I just looked at http://irclog.iq.harvard.edu/search.pl?channel=dataverse&q=trobitaille but I can't find a reference to any code anywhere but it was written in Python. I do see a mention of Dataverse at https://github.com/glue-viz/glue/blob/16bc539736cb580ac4d42ccf5f6741ada553fde1/glue/main.py#L257 and there was a little discussion at dvn/swordpoc#3 . Anyway, if we do decide to pursue some sort of desktop client for Dataverse, let's create a new GitHub issue to track it.

@bmckinney
Copy link
Contributor

before i forget, this project/protocol is somewhat relevant: http://tus.io/ caveats: only deals with uploads, currently no java server implementation. it could be a nice option for "somewhat large" file uploads

@pdurbin
Copy link
Member

pdurbin commented Jun 11, 2016

"Resumable upload supports uploading videos that are up to 1.75GB and 45 minutes long." https://developers.facebook.com/docs/graph-api/video-uploads#resumable via http://socialwg.indiewebcamp.com/irc/social/2016-06-10/line/1465619059043

Too small for our requirements but potentially interesting.

@pdurbin
Copy link
Member

pdurbin commented Feb 2, 2017

This issue is highly related since rsync supports the ability to resume uploads and downloads: Data Capture Module (rsync support) #3145

Truth be told, this issue is a little more about the download side of things and the DCM is about uploads.

@pameyer
Copy link
Contributor

pameyer commented Feb 2, 2017

DCM is about uploads - but integration of components for rsync downloads are in other issues.

wget-able links will help here as well (aka - users will be able to do resumable downloads without other changes to dataverse, within the restrictions of files/datasets that will display a download link.

@pdurbin pdurbin added the User Role: Guest Anyone using the system, even without an account label Jul 4, 2017
pdurbin referenced this issue Aug 11, 2017
A rough draft of detailed rsync upload instructions. Needs more loving once we know more details about this feature.
@pdurbin
Copy link
Member

pdurbin commented Oct 5, 2018

@pameyer good point about wget. I chronicled my experience with wget --continue at #5092 and it sounds like we're going to promote the use of non-browser tools like wget in #4949.

pdurbin referenced this issue Nov 30, 2018
Added the section "Downloading a Dataverse Package via URL". This should cover this feature for the User Guide.
@mheppler
Copy link
Contributor

mheppler commented Dec 7, 2018

Stumbled across this download issue, and saw the wget conversation. Reminded me that we added a bit of help text to the Download Package File from S3 #4949 issue.

We currently display a Download URL on the file pg, which provides download access via the API.

screen shot 2018-12-07 at 12 48 25 pm

I suggest that we could resolve this issue by displaying similar help text above this Download URL field that explains the wget and download manager methods, as well as a link to the User Guide.

@pdurbin
Copy link
Member

pdurbin commented Dec 7, 2018

@mheppler fine with me.

@mheppler mheppler self-assigned this Dec 10, 2018
@mheppler mheppler removed Type: Suggestion an idea User Role: Guest Anyone using the system, even without an account labels Dec 10, 2018
@djbrooke djbrooke changed the title Support The Ability To Resume Disrupted File Downloads Download Manager related help text Dec 12, 2018
@djbrooke djbrooke assigned dlmurphy and unassigned dlmurphy Dec 12, 2018
@mheppler mheppler self-assigned this Dec 18, 2018
dlmurphy added a commit that referenced this issue Dec 18, 2018
Moved around and added some new info.
@mheppler mheppler removed their assignment Dec 18, 2018
@kcondon kcondon self-assigned this Dec 19, 2018
@kcondon kcondon closed this as completed Dec 19, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

8 participants