Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Syncing up code moves from IA #6

Merged
merged 29 commits into from
Dec 12, 2013
Merged

Syncing up code moves from IA #6

merged 29 commits into from
Dec 12, 2013

Conversation

anjackson
Copy link
Member

This pull request contains the latest code from the internetarchive/ia-web-commons fork, which moves the Heritrix3 code required by OpenWayback into the commons so that H3 need no longer be a dependency. I pulled in their code and then patched it up slightly to match the org.netpreserve names etc.

ikreymer and others added 29 commits October 3, 2013 16:21
…ired (handled later), add a locCacheMaxDuration to cache only if below threshold
…ization for zipnum clusters to be loaded sequentially when only looking for last line
… print connectedUrl on error, skip cached url on fallback to location list
…ators

FIX ZIPNUM: Flush cache, if any, when reloading locations
…tion

* use connectedUrl in all exceptions messages
* attemptBlockLoad: use SEVERE only on last retry of required cluster which will lead to a 503, use WARNING otherwise
ZIPNUM: track 2nd attempt to load correctly
Switch to use ThreadLocalHttpConnectionManager as default for ApacheSLR!
extractrule, check for empty string
…at wayback doesn't have to depend on heritrix-commons
Conflicts:
	src/main/resources/effective_tld_names.dat
anjackson added a commit that referenced this pull request Dec 12, 2013
Syncing up code moves from IA
@anjackson anjackson merged commit 3579cea into iipc:master Dec 12, 2013
nlevitt pushed a commit to nlevitt/webarchive-commons that referenced this pull request Feb 1, 2014
get rid of RecyclingFastBufferedOutputStream, which was supposed to avoi...
sebastian-nagel pushed a commit to sebastian-nagel/webarchive-commons that referenced this pull request Dec 5, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants