Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Workaround for over-encoded URLs in zimit2 #1230

Merged
merged 8 commits into from
May 1, 2024

Conversation

Jaifroid
Copy link
Member

@Jaifroid Jaifroid commented Mar 14, 2024

The first commit is a workaround for over-encoded URLs in zimit2.

However, we could also usefully check to see that we are handling clicks on URLs in Wikipedia articles where an encoded question mark that is not a querystring needs to be decoded. For example, an HTML URL like What_About_Me%3F_(Kenny_Rogers_album) needs to be decoded to A/What_About_Me?_(Kenny_Rogers_album) in order to find it in the ZIM and the material after the ? must not be removed.

Should fix #1229. Should only be merged pending the outcome of openzim/libzim#865.

@Jaifroid Jaifroid added bug backend zimit Code relating to the support of Zimit-style archives labels Mar 14, 2024
@Jaifroid Jaifroid added this to the v4.1 milestone Mar 14, 2024
@Jaifroid Jaifroid self-assigned this Mar 14, 2024
@Jaifroid Jaifroid marked this pull request as ready for review March 27, 2024 10:50
@Jaifroid
Copy link
Member Author

Jaifroid commented May 1, 2024

Since further test ZIMs are not pending, we should merge and deal with any subsequent issues in a separate PR.

@Jaifroid Jaifroid merged commit 85a9d82 into main May 1, 2024
9 checks passed
@Jaifroid Jaifroid deleted the Workaround-for-over-encoded-querystrings branch May 1, 2024 09:10
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
backend bug zimit Code relating to the support of Zimit-style archives
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Work around the workaround: deal with querystrings in zimit2 ZIMs where the URL separators have been encoded
1 participant