Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Math not parsed in some docx files #3365

Closed
jgm opened this issue Jan 19, 2017 · 0 comments
Closed

Math not parsed in some docx files #3365

jgm opened this issue Jan 19, 2017 · 0 comments

Comments

@jgm
Copy link
Owner

jgm commented Jan 19, 2017

See this discussion on pandoc-discuss.

The docx reader currently seems to expect all the namespace prefixes to be declared on the root document element, but in some docx files they are not -- e.g. this docx has a tag

<m:oMath xmlns:m="http://schemas.openxmlformats.org/officeDocument/2006/math">

where 'm' is not declared on the document element.

We need to be able to recognize math when it is inserted this way.

Side note: the docx reader also seems to expect that m will always be the prefix for math, etc. If we're making this assumption, why do we even bother looking up namespace declarations? Perhaps we shouldn't make the assumption (though it seems solid for every Word file I've seen). Alternatively we could work with the URIs themselves, but then we might run into the problem of different versions (e.g., 2006 vs 2011).

@jkr can you have a look and see what you think should be done?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant