Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Site Request] 4chan Archives #4012

Closed
cheese529 opened this issue May 5, 2023 · 7 comments
Closed

[Site Request] 4chan Archives #4012

cheese529 opened this issue May 5, 2023 · 7 comments

Comments

@cheese529
Copy link

Imgur will be purging all NSFW content on May 15th which is in less than 10 days and all of the original 4chan archives are hosted using Imgur so it would be nice if we could get a chance to download everything before it gets purged. Here's the website https://4chanarchives.com/ and here's an example thread https://4chanarchives.com/board/hr/thread/2693240

@cheese529 cheese529 changed the title Site:Support 4chan Archives [Site Request] 4chan Archives May 5, 2023
@mikf
Copy link
Owner

mikf commented May 5, 2023

Duplicate of #1262 and #2418

nvm

@cheese529
Copy link
Author

cheese529 commented May 6, 2023

do you think it would be possible to get this site added asap? We only have until May 15th before everything on here is purged thx to imgur. I even tried to use chat gpt to write a python script in efforts of doing this but I had zero luck :/

@mikf
Copy link
Owner

mikf commented May 6, 2023

Well, here you go: 1406f71

The board extractor supports starting from a given page, e.g. https://4chanarchives.com/board/c/100

@mikf
Copy link
Owner

mikf commented May 6, 2023

Use this to write metadata for all posts

    "postprocessors": [
        {
            "name": "metadata",
            "event": "post",
            "filename": "{no}.json"
        }
    ]

to only write the posted text, add "format": "{com}"

@cheese529
Copy link
Author

Thank you so much for this! Do I just copy that python code and paste it into my gallery-dl folder where the rest of the extractors are in order for it to work?

@mikf
Copy link
Owner

mikf commented May 7, 2023

That wouldn't be enough. What you can do is put this file in a separate directory, potentially fix any gallery_dl imports, and use -X path/to/directory to load additional extractors from it.

Or you grab the new release.

@mikf mikf closed this as completed May 7, 2023
@cheese529
Copy link
Author

thank you very much for doing this on such quick notice, it works perfectly!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants