Latex reader: cannot use macro in image path? #3236

sgdjs · 2016-11-16T22:15:39Z

For the same Latex document to come out in different "flavors", I put a macro in the image path, like this:

\documentclass{report}

\usepackage{graphicx}
\newcommand{\mycolor}{red}

\begin{document}

\includegraphics[width=17cm]{\mycolor /header}
Magnificent \mycolor{} header.

\end{document}

So with images named the same in both red and blue folders, changing the value of mycolor once changes all the images of the file.

Pandoc convertion to HTML converts only the second macro correctly: pandoc source.tex -o index.html --default-image-extension=png gives

<p><img src="\mycolor /header.png" alt="image" width="642" /></p>
<p>Magnificent <span>red</span> header.</p>

How to convert such a latex document to HTML with images ? thanks

The text was updated successfully, but these errors were encountered:

cagix · 2016-11-17T07:31:21Z

Imho the content/arguments of latex macros (and environments) will be passed to the writer as "raw tex". This works fine for the latex writer, since latex can expand your macro inside the \includegraphics. However, the html writer does not understand the tex code inside the \includegraphics (i.e. \mycolor /header in the given example), so the parameters are not processed by pandoc nor the html writer (it is "raw tex").

You could try some kind of pandoc filter, like

from pandocfilters import toJSONFilter, Str, Image
import re

image = re.compile('\\\\includegraphics.*?\{(.*)\}$')

def textohtml(key, value, format, meta):
    if key == 'RawInline':
        fmt, s = value
        if fmt == "tex":
            m = image.match(s)
            if m:
                return Image([Str("description")], [m.group(1),""])  

if __name__ == "__main__":
    toJSONFilter(textohtml)

(not tested, should work for pandocfilters < 1.3.0 (Image has got some more parameters in pandocfilters >= 1.3.0)).

sgdjs · 2016-11-17T23:06:59Z

Hello, thanks for the answer. I have not used filters before but I'll test your solution when I can. I suppose this issue can be closed if it's the best way.
Thanks again

cagix · 2016-11-18T08:00:50Z

Hmmm, it seems, I had an "out of coffee exception" without proper exception handler in place ;(

The proposed solution will not solve your problem completely, since the argument of \includegraphics is still not processed, i.e. you would still end up in the html with <img src="\mycolor /header.png">. The filter needs to process the m.group(1) before returning the image ...

This rewrite is primarily motivated by the need to get macros working properly (#982, #934, #3779, #3236, #1390, #2888, #2118). We now tokenize the input text, then parse the token stream. Macros modify the token stream, so they should now be effective in any context, including math. (Thus, we no longer need the clunky macro processing capacities of texmath.) A custom state LaTeXState is used instead of ParserState. This, plus the tokenization, will require some rewriting of the exported functions rawLaTeXInline, inlineCommand, rawLaTeXBlock.

This rewrite is primarily motivated by the need to get macros working properly (#982, #934, #3779, #3236, #1390, #2888, #2118). A side benefit is that the reader is significantly faster (27s -> 19s in one benchmark, and there is a lot of room for further optimization). We now tokenize the input text, then parse the token stream. Macros modify the token stream, so they should now be effective in any context, including math. Thus, we no longer need the clunky macro processing capacities of texmath. A custom state LaTeXState is used instead of ParserState. This, plus the tokenization, will require some rewriting of the exported functions rawLaTeXInline, inlineCommand, rawLaTeXBlock. * Added Text.Pandoc.Readers.LaTeX.Types (new exported module). Exports Macro, Tok, TokType, Line, Column. [API change] * Text.Pandoc.Parsing: adjusted type of `insertIncludedFile` so it can be used with token parser. * Removed old texmath macro stuff from Parsing. Use Macro from Text.Pandoc.Readers.LaTeX.Types instead. * Removed texmath macro material from Markdown reader. * Changed types for Text.Pandoc.Readers.LaTeX's rawLaTeXInline and rawLaTeXBlock. (Both now return a String, and they are polymorphic in state.) * Added orgMacros field to OrgState. [API change] * Removed readerApplyMacros from ReaderOptions. Now we just check the `latex_macros` reader extension.

Not all passing yet.

jgm added format:LaTeX reader labels Dec 7, 2016

jgm mentioned this issue Mar 9, 2017

LaTeX Reader does not handle environment delimiters or block commands defined with a macro #982

Closed

jgm added a commit that referenced this issue Jul 7, 2017

Added test cases for #1390, #2118, #3236, #3779, #934, #982.

aa55995

Not all passing yet.

jgm closed this as completed in 0feb750 Jul 7, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Latex reader: cannot use macro in image path? #3236

Latex reader: cannot use macro in image path? #3236

sgdjs commented Nov 16, 2016

cagix commented Nov 17, 2016 •

edited

Loading

sgdjs commented Nov 17, 2016

cagix commented Nov 18, 2016

Latex reader: cannot use macro in image path? #3236

Latex reader: cannot use macro in image path? #3236

Comments

sgdjs commented Nov 16, 2016

cagix commented Nov 17, 2016 • edited Loading

sgdjs commented Nov 17, 2016

cagix commented Nov 18, 2016

cagix commented Nov 17, 2016 •

edited

Loading