-
-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[deviantart][bug] PDF files no longer being downloaded, just their preview images #3781
Comments
This was already reported in #3561. |
Hmm. Trouble is, that isn't just one or two artists deliberately disabling the "download" option. It's the default behavior for PDF files. Failing the current ability of gallery-dl to download PDFs, is there something I could do to take note of all PDF files it can't download now so I may perhaps download them in the future when that should become possible again? I.e., "If artwork is PDF: write artwork URL to a file" (or at least do so with "artwork ID"). Then I could, in the future when this may be fixed, assemble a list of commands à la "gallery-dl_SomeFutureVersion.exe --config myConfig.conf https://www.deviantart.com/view/{PDF_file's_artwork_ID_here}" and run that. That way I don't miss anything now and don't have to fully re-parse every single gallery in my download queue later. Any suggestions for how to accomplish that? Obviously it should only do this for artworks that are PDF files - or at least not for those that are obviously normal images. |
I don't think so.
The problem is precisely that it's not possible to determine if a deviation is PDF without making a request to the webpage (as opposed to the API), which is very inefficient. |
@mikf This might be fixable by detecting the presence of This could also be used to improve Related: #3322 |
As of sometime between April 17th and May 2nd, deviantArt seems to have fixed this themselves, so pdf files get downloaded again. Tested in both 1.25.2 and 1.25.3. I don't know if anything other than PDFs was affected, should the issue be closed? |
I'm not sure if this is still an issue with your pdfs, but I'm trying to download some and I'm only getting the cover image. Maybe it's because the pdf is text-based? running gallery-dl on it only gives me the cover image. Testing on your provided PDF downloaded the entire pdf so I'm not sure why gallery-dl's being selective. |
Hmm. Tried your link and it worked for me, using the new 1.26.0 release. Maybe it was fixed in that, or it was a temporary hick up on deviantArt's end, they did just make some API updates around ... well, okay, probably closer to September 19th - 21st or so, not September 11th when you commented, but still. |
Maybe it's because it's a "mature" pdf? Hmm, judging by your github join date I'm guessing you're new to gallery-dl? Do you have your own config file for gallery-dl, and did you set it up to connect to your deviantArt account? I think for mature content (and to greatly reduce the amount of "429 Too Many Request" errors) you need to connect that, the setup is explained here: https://github.com/mikf/gallery-dl/blob/master/docs/configuration.rst#api-tokens--ids I know "register an application" sounds a bit daunting, but you don't even need to upload nor submit anything, just fill out a short form and hit "save" is enough. Then run "gallery-dl.exe --config MY_CONFIG.conf oauth:deviantart" and it'll pop up a browser window with some text you should copy (though gallery-dl apparently also saves this automatically). As for how to set up your config file... example config files: Also my own config looks something like this, feel free to include / exclude the things you (don't) need:
This also enables description download for deviantArt ( "postprocessors": [{ ... ), enables downloading embedded deviantArt images from descriptions and journals ( "extra": true ), and also does some extra stuff for twitter too like enable text-downloads ( "text-tweets": true,... ) and to separate retweets ( "directory": { ... / "postprocessors": [ ... ) from the account's own tweets. Also try a few more times than 5 retries since my internet is spotty sometimes. :P Ask away if you have any questions. :) |
No dice. I already have a Deviantart Config set up (yoinked it off the examples page) that works fine for most NSFW. It's only this specific format that it seems to be outputting just the covers for. config:
I added the cookies portion by using This extension on chrome, opening up DA, and just grabbing the cookies as netscape format. IDK if that's the right way to get them. Adding that in and trying to redownload has not helped, it still just gives me the jpg for the cover image. |
"Get cookies.txt locally" is the same extension I'm using too. Hmm. Well I guess it's not as easy as I thought. Still works for me though, even though deviantArt has had another site change just earlier today apparently. Okay, well... couple shots in the dark, maybe we'll hit something:
...Not that I can pretend to understand the full intricacies of gallery-dl's inner workings (I don't even speak Python), but perhaps something can be gleamed from comparing our verbose outputs? |
woops, did not mean to close this |
Don't think this has anything to do with gallery-dl's inner workings, it's simply the dA API behavior, caused by the changes they are making right in the moment, apparently.. |
I mean we're running this at the exact same time i'd assume. Could it be a mac vs pc issue? I don't think that'd be the case but I don't want to rule anything out (why would mac vs pc affect a website?) here's my --verbose log
I tried moving the cookies, didn't change anything. |
"[gallery-dl][debug] Version 1.25.8" That's the wrong version. Current version is 1.26.0, it was released on October 3rd. I just tested 1.25.8, that one indeed only gets the preview image. 1.26.0 gets the pdf file. Although yes, as Hrxn pointed out, right now, as of literally earlier today (or yesterday depending on your time zone), deviantArt did change something again, making this not quite the best time to do a mass download until that problem is fixed. -_- ( It would've literally only taken one more day to finish my 6 day download marathon... grumble grumble ) |
DeviantArt must've changed something a few months ago, because gallery-dl no longer downloads PDF files, only their preview images. Last PDF-download I have is from October 13th, 2022.
Example:
gallery-dl_1.25.0.exe --config myConfig.conf https://www.deviantart.com/timsplosion/art/R-R2-March-of-the-Penguins-Storyboard-Part-Two-324286646
Tested with versions 1.25.0 (as well as older versions 1.23.4, 1.24.4, 1.24.5 a few months back)
Don't know if the same applies to other non-artwork file formats, though Flash-files and *.zip were fine last I checked on v1.24.5.
My config:
The text was updated successfully, but these errors were encountered: