Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Everything is an Event::Start with utf-16 files #563

Closed
bbergeron0 opened this issue Feb 24, 2023 · 1 comment
Closed

Everything is an Event::Start with utf-16 files #563

bbergeron0 opened this issue Feb 24, 2023 · 1 comment
Labels
duplicate encoding Issues related to support of various encodings of the XML documents

Comments

@bbergeron0
Copy link

I have several UTF-16 files which doesn't parse properly.

When looping with read_event_into, most stuff gets interpreted as a tag start, including comments, self-closing tags, CDATA, and closing tags.

OS: Gentoo Linux
quick-xml version: "0.27", no additional feature flags
Input XML: These files are Skyrim "FOMod" files used for modding. This archive contains an UTF-16 encoded XML file named "EnhancedLightsandFX 3.06/fomod/ModuleConfig.xml" which doesn't parse properly with quick-xml.

@dralley
Copy link
Collaborator

dralley commented Feb 24, 2023

Unfortunately only single byte encodings are supported at the moment https://github.com/tafia/quick-xml/blob/master/src/encoding.rs#L31

The work to eventually enable it is tracked here #158, I will close this issue as a duplicate.

I have a PR where I got it working in limited scenarios, however, I was roadblocked by the difficulty of async support and my free time mostly drying up for a few months. I'll pick it back up once I have time again.

@dralley dralley closed this as not planned Won't fix, can't repro, duplicate, stale Feb 24, 2023
@dralley dralley added duplicate encoding Issues related to support of various encodings of the XML documents labels Feb 24, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
duplicate encoding Issues related to support of various encodings of the XML documents
Projects
None yet
Development

No branches or pull requests

2 participants