-
-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Remove A/ prefix in paths #104
Comments
It is still necessary to have two entries per resources: one with the payload, one with the headers. We use We could remove raw=true was awaiting kiwix/libkiwix#671 We can remove it now. index manipulation is not a scraper's responsibility. All we can do is specify the FRONT_ARTICLE hints, which we do. |
Thanks for the clarification. Agree.
Would be great if not too complicated.
Default (newest) libzim behaviour should allow to do this properly based indeed on the indication if we deal with a front-article or not. But AFAIK this is not done so far in warc2zim, because at the time we made the warc2zim it was not possible. |
As mentioned, it's done already. We are specifying FRONT_ARTICLE=False for our static files and for the Headers Entries. For the rest of the Entries (everything), we don't specify hints expecting the default behavior to apply (if HTML > is_front) but there's probably a bug around that as even non-html are returned in random. |
We had a long discussion of the issue of removing these pseudo-namespaces or prefixes in #99. At the time, it was decided to keep both I'm happy for the I'll be very glad to see the end of |
@Jaifroid we've decided we don't want to get into this at the moment. One important piece of code actually depends on it and it would be really fragile not to. The following piece of JS is injected before warc2zim/src/warc2zim/templates/sw_check.html Lines 1 to 23 in f8a8ddb
Also, yes, we are dropping the |
OK, thanks for the update. Maybe, in due course, we can think of other solutions than injecting that script into every article. Thanks for the information about |
Yes, that's part of the “where should the SW be?” discussion |
@rgaudin @mgautierfr Can we update this ticket please considering the Zimit2. I kind of think this should be done for Zimit2 but I don't have all the rationals. |
I'd say we close this as wontfix since we are using different paths for zimit2 which makes it less relevant to this ticket's history which was SW related |
This is not necessary anymore since libzim7 and complexify without need the code.
This implies as well to remove the hack with mime-type "text/html;raw=true" and properly put things in:
Maybe something should be fixed in Webac as well
The text was updated successfully, but these errors were encountered: