Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Sanitize characters not suitable for use with markup #106

Closed
gmile opened this issue Jun 10, 2014 · 1 comment
Closed

Sanitize characters not suitable for use with markup #106

gmile opened this issue Jun 10, 2014 · 1 comment

Comments

@gmile
Copy link

gmile commented Jun 10, 2014

A feature request, rather than an issue.

Can we have a sanitization of symbols, that are discouraged for use by the XML committee?

According to this document, specifically the "3. Characters not Suitable for use With Markup" section, there's a list of characters we could sanitize.

The need for this have risen after our users started to input control characters, randomly. A similar issue was found on StackOverflow.

@rgrove
Copy link
Owner

rgrove commented Jun 10, 2014

Seems like a good idea. This would probably be best implemented as a pre-parsing step.

I'm actually curious whether the Gumbo parser used in the dev-3.0.0 branch might already do some of this for us. Worth testing.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants