-
Notifications
You must be signed in to change notification settings - Fork 275
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix for issue #116 #215
Fix for issue #116 #215
Conversation
Add "org.apache.httpcomponents.httpclient" dependency.
Add a bugfix at the end of "Bug Fixes" list of "OpenWayback 2.1.0 Release" section.
Add a RemoteLiveWebCache2-related comment.
Include myself in developers list.
Having a classname like If Otherwise, if
In this scenario (where both classes are retained) I think we should retain the old one as the default option in |
Rethinking about it, it is perhaps better to describe this class as an alternate implementation. The new class is a modified version of the old that works with a standard proxy server (e.g., Squid), not with the ArcRecordingProxy. We went that route initially because our experience using the ArcRecordingProxy was not a good one, and we kept on bumping into connection issues, making the Wayback unusable. I could suggest renaming the old and new classes to the following, respectively: ArcRemoteLiveWebCache I realize the new class needs a good deal of cleaning up, but it works so far, and it is what we have in production at the BA right now. Thank you for any feedback. |
That's starting to make sense to me. Perhaps start with adding the JavaDoc descriptions on the classes functionality? |
OK, I will work on it next days. Should I push more commits on the same branch or create a new branch with modified contents? |
You can push the commits to the same branch. They will then become part of this pull request. |
Extends ArcRemoteLiveWebCache instead of RemoteLiveWebCache
I finished adding JavaDoc descriptions on both classes (StdRemoteLiveWebCache.java and ArcRemoteLiveWebCache.java) and updating LiveWeb.xml to fit with the changes. As shown above, Travis CI failed to build my first commits. This is because LiveRobotsNoCache.java is extending ArcRemoteLiveWebCache.java and I should've committed it first. Anyway, do you think you could rebuild 'Update ArcRemoteLiveWebCache.java' (e980049) and 'Update LiveWeb.xml' (4815ab1) instead of committing them again? Thank you. |
Please don't merge stdRemoteLiveWebCache because after doing some tests, I noticed it is not working properly. I have added 2 features to the current version of stdRemoteLiveWebCache. Lines 122-126 set connection time out to 10 seconds which increases openwayback responsiveness. This part is working perfectly normal. As you know that Openwayback gets robots.txt from the live web. If any domain is down (or is not responding) because its http status code is 404, then Openwayback will fail to get robots.txt and will throw a connection time out exception (Openwayback unable to get the robots.txt document to display this page). Lines 133-142 should fix this issue, but it is not working as expected. Openwayback is stable somehow if we excluded lines 133-142 from stdRemoteLiveWebCache, but it will not display contents of any down domain (http status is 404), which breaks the idea of web archiving. I will work on it and commit it again if it worked gracefully. |
Closing in favor of PR #251 |
No description provided.