[Site Support Request] Wikipedia and Wikimedia #1443

paulolimac · 2021-04-08T21:36:52Z

Is there any way to download from Wikipedia and Wikimedia domains?
Unsuccessfully, my commands:

$ gallery-dl https://commons.wikimedia.org/wiki/Category:1st_Horseman_of_the_Apocalypse
[gallery-dl][error] No suitable extractor found for 'https://commons.wikimedia.org/wiki/Category:1st_Horseman_of_the_Apocalypse'

$ gallery-dl https://en.wikipedia.org/wiki/Gustave_Dor%C3%A9
[gallery-dl][error] No suitable extractor found for 'https://en.wikipedia.org/wiki/Gustave_Dor%C3%A9'

The text was updated successfully, but these errors were encountered:

mikf · 2021-04-08T22:50:32Z

Not at the moment.

paulolimac · 2021-04-08T23:39:39Z

ok then. thanks for reply :)

Ailothaen · 2021-05-11T19:20:08Z

After looking about it a bit, Wikipedia (and any Mediawiki website, in general) has an API that can be used to retrieve images from an article (and surely other pages)

An example:

https://en.wikipedia.org/w/api.php?action=parse&page=Pet_door&prop=images&format=json to retrieve all image names from an article
https://en.wikipedia.org/w/api.php?action=query&titles=File:Gatera_de_ademuz.jpg&prop=imageinfo&iiprop=url to retrieve the full URL for an image name (since the exact path can change depending on the language version)

I guess I could try to implement an extractor if I someday find the time for it 0:)

rautamiekka · 2021-05-11T19:53:53Z

I wonder if there's a public out-of-source-code info on the Mediawiki URL syntax ... I couldn't find with an extremely fast try, and don't feel like checking the source code.

At 1st I was thinking "Match until a question mark after /wiki/" cuz I knew Mediawiki supports sub-articles which show up as /wiki/ORIGINAL_ARTICLE/SUB_ARTICLE (repeating the /SUB_ARTICLE part), but then I started thinking maybe matching until a question mark would exclude some articles.

Ailothaen · 2021-05-30T18:28:28Z

Random question for @mikf (it is slightly related to this issue, but I do not see any better place to post it): is there a documentation that specifies how to write an extractor? By that, I mean how to use the Extractor class and which methods are to be used depending on context.

…, #2906)

- support mediawiki.org - support mariowiki.com (#3660) - combine code into a single extractor (use prefix as subcategory) - handle non-wiki instances - unescape titles

Wikis hosted on fandom.com are just wikimedia instances and support its API.

GrimPixel · 2024-02-05T17:15:50Z

I have done this in my own repository: download.py. I think you may get inspired.

- support mediawiki.org - support mariowiki.com (mikf#3660) - combine code into a single extractor (use prefix as subcategory) - handle non-wiki instances - unescape titles

Wikis hosted on fandom.com are just wikimedia instances and support its API.

add wikidata.org and wikivoyage.org

mikf added the site:support label Apr 8, 2021

paulolimac closed this as completed Apr 8, 2021

mikf reopened this Apr 10, 2021

mikf changed the title ~~[question] how to download images from wikipedia and wikimedia?~~ [Site Support Request] Wikipedia and Wikimedia Apr 10, 2021

Ailothaen mentioned this issue Feb 27, 2022

[wikimedia] Add Wikipedia/Wikimedia extractor #2340

Merged

kattjevfel mentioned this issue Jun 13, 2022

[New site support request] Downloading images, videos, etc. from Fandom (former Wikia) wikis #2677

Closed

mikf added a commit that referenced this issue Jan 16, 2024

merge #2340: [wikimedia] add 'article' and 'category' extractors (#1443…

34a7afd

…, #2906)

mikf added a commit that referenced this issue Jan 18, 2024

[wikimedia] generalize (#1443)

ea553a1

- support mediawiki.org - support mariowiki.com (#3660) - combine code into a single extractor (use prefix as subcategory) - handle non-wiki instances - unescape titles

mikf added a commit that referenced this issue Jan 20, 2024

[wikimedia] support fandom wikis (#1443, #2677, #3378)

c7a4288

Wikis hosted on fandom.com are just wikimedia instances and support its API.

bradenhilton pushed a commit to bradenhilton/gallery-dl that referenced this issue Feb 5, 2024

[wikimedia] support fandom wikis (mikf#1443, mikf#2677, mikf#3378)

538d217

Wikis hosted on fandom.com are just wikimedia instances and support its API.

mikf added a commit that referenced this issue Feb 10, 2024

[wikimedia] combine most wikimedia.org sites (#1443)

af61d2b

add wikidata.org and wikivoyage.org

mikf added the site:type:wikimedia label Feb 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Site Support Request] Wikipedia and Wikimedia #1443

[Site Support Request] Wikipedia and Wikimedia #1443

paulolimac commented Apr 8, 2021

mikf commented Apr 8, 2021

paulolimac commented Apr 8, 2021

Ailothaen commented May 11, 2021

rautamiekka commented May 11, 2021 •

edited

Loading

Ailothaen commented May 30, 2021

GrimPixel commented Feb 5, 2024

[Site Support Request] Wikipedia and Wikimedia #1443

[Site Support Request] Wikipedia and Wikimedia #1443

Comments

paulolimac commented Apr 8, 2021

mikf commented Apr 8, 2021

paulolimac commented Apr 8, 2021

Ailothaen commented May 11, 2021

rautamiekka commented May 11, 2021 • edited Loading

Ailothaen commented May 30, 2021

GrimPixel commented Feb 5, 2024

rautamiekka commented May 11, 2021 •

edited

Loading