Releases: openzim/zimit
Releases · openzim/zimit
2.1.6
Changed
Upgrade to browsertrix crawler 1.3.5 (#426 )
2.1.5
Changed
Upgrade to browsertrix crawler 1.3.4 and warc2zim 2.1.3 (#424 )
2.1.4
Changed
Upgrade to browsertrix crawler 1.3.3 (#411 )
2.1.3
Changed
Upgrade to browsertrix crawler 1.3.2, warc2zim 2.1.2 and other dependencies (#406 )
Fixed
2.1.2
Changed
Upgrade to browsertrix crawler 1.3.0-beta.1 (#387 ) (fixes "Ziming a website with huge assets (e.g. PDFs) is failing to proceed" - #380 )
2.1.1
Added
Add support for uncompressed tar archive in --warcs (#369 )
Changed
Upgrade to browsertrix crawler 1.3.0-beta.0 (#379 ), including upgrage to Ubuntu Noble (#307 )
Fixed
Stream files downloads to not exhaust memory (#373 )
Fix documentation on --diskUtilization
setting (#375 )
2.1.0
Added
Add --custom-behaviors
argument to support path/HTTP(S) URL custom behaviors to pass to the crawler (#313 )
Add daily automated end-to-end tests of a page with Youtube player (#330 )
Add --warcs
option to directly process WARC files (#301 )
Changed
Make it clear that --profile
argument can be an HTTP(S) URL (and not only a path) (#288 )
Fix README imprecisions + add back warc2zim availability in docker image (#314 )
Enhance integration test to assert final content of the ZIM (#287 )
Stop fetching and passing browsertrix crawler version as scraperSuffix to warc2zim (#354 )
Do not log number of WARC files found (#357 )
Upgrade dependencies (warc2zim 2.1.0)
Fixed
Sort WARC directories found by modification time (#366 )
2.0.6
Changed
Upgraded Browsertrix Crawler to 1.2.6
2.0.5
Changed
Upgraded Browsertrix Crawler to 1.2.5
Upgraded warc2zim to 2.0.3
2.0.4
Changed
Upgraded Browsertrix Crawler to 1.2.4 (fixes retrieve automatically the assets present in a data-xxx tag #316 )
You can’t perform that action at this time.