-
Notifications
You must be signed in to change notification settings - Fork 276
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
output xml didn't follow spec in non ASCII character URI #346
Comments
Anyone has the idea? |
Hi @lisbethw1130 I think you're right. When I wrote this gem years ago it wasn't internationalized to handle UTF-8 and that wasn't as prevalent as it is today. It would be great if you could add that functionality, with tests :) |
Here's some obstacle I bumped in and solving:
Any idea is welcome ;) |
Awesome that the change was released in Ruby, @lisbethw1130! 🚀 |
As sitemap spec mentioned, the xml itself should do a xml entity escape, which the gem already have.
But the url should first do the RFC-3986 standard for URIs or the RFC-3987 standard for IRIs, and xml entity escape at last. sitemap generator seems didn't follow RFC-3986 now.
can someone help me to check if my conclusion is right since I'm just a junior programmer and not sure it's right.
If everything is OK, a PR for this issue will be sent later.
Best Regards,
Lisbeth
The text was updated successfully, but these errors were encountered: