Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

sync upstream #1

Merged
merged 119 commits into from
May 26, 2020
Merged

sync upstream #1

merged 119 commits into from
May 26, 2020

Conversation

mo-han
Copy link
Owner

@mo-han mo-han commented May 26, 2020

No description provided.

mo-han and others added 30 commits March 31, 2020 21:59
Extract all sta.sh items in a single extractor run.
Don't spawn a new StashExtractor for each individual sta.sh item to
preserve the current requests.Session and its opened TCP connections.
'/extended_fetch' as well as Deviation webpages now again contain
Deviation UUIDs needed to grab Deviation info through the OAuth API,
meaning cookies are no longer necessary to grab original files.

The only instance were cookies are still needed are scraps marked as
"mature", since those entries are hidden for public users.

(#655, #657, #660)
- add a 2 second wait time between requests to deviantart.com
- catch 403 "Request blocked" errors and wait for 3 minutes until
  retrying
"Request blocked" can also happen on sta.sh and for *any* HTTP
request directed at deviantart.com
- allow 'until' to be a datetime object
- do "time calculations" with UTC timestamps
- set a default 'reason'
Calls to config.clear() from other tests are removing the API
credentials set when importing mastodon.py for the first time.
i.e. /g4/data.php?id=…

- get filename & extension from Content-Disposition header
- handle all downloadable file types (docx, swf, etc)
'q_\d+' would sometimes also replace something in the 'token' query
parameter, invalidating the URL.
mikf and others added 29 commits May 12, 2020 20:17
There are currently no situations where forwarding gallery-dl's cookies
to youtube-dl is necessary, and it only causes problems when forcing
youtube-dl for Twitter video downloads while logged in.
- move code into its own function
- add enumeration index to filenames
- dump responses regardless of status code
* [imagechest] Add new extractor for ImageChest

* [imagechest] Fix flake8 compliance issues
The webtoons extractor can extract episode and entire comic (all
episodes) from webtoons.com.

All the logic of the extractors should be trivial except for a couple
of kludges needed:

 - `ageGatePass' cookie is always set to avoid possible redirect and stop of
    extraction, especially in the comic extractor
 - The image URLs returned by the episode extractor could not be fetched
   directly and the `Referer:' HTTP header needs to be passed to fetch them

Close #593.
- support changing values for 'k'
- use XML parser to get request parameters
  (some input fields are now embedded in an HTML comment)
use 'pool:<pool id>' as search tag to get pool posts
Wrap all loggers used by job, extractor, downloader, and postprocessor
objects into a (custom) LoggerAdapter that provides access to the
underlying job, extractor, pathfmt, and kwdict objects and their
properties.

__init__() signatures for all downloader and postprocessor classes have
been changed to take the current Job object as their first argument,
instead of the current extractor or pathfmt.

(#574, #575)
- fix episode listings for french comics
- allow input URLs without explicit scheme
- add 'lang'/'language' metadata
- use str.format() instead of '+' to assemble URLs
Downloading the pre-rendered versions should be a better default
than .zip files with individual frames.
Allow its value to be a JSON object / Python dict that specifies
a mapping from invalid/unwanted input characters to specific
output characters.

For example {"/": "-", "*": "+"} will transform
"foo / ***bar***" into "foo - +++bar+++"

(closes #662, #755)
- include 'igneous' and 'hath_perks' in Exhentai cookies
- add an example of how to write DeviantArt description to file
- add a 'path-restrict' mapping from invalid characters in Windows
  paths to Unicode alternatives (taken from #662)
@mo-han mo-han merged commit f4f57db into mo-han:master May 26, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

7 participants