You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have a tool that converts some XML format into Markdown so that I can use Pandoc to turn it into epub for reading on a eink device.
It doesn't trim whitespace everywhere so given input like
<doc>
<abstract>
Lorem ipsum
</abstract>
</doc>
it ended up injecting "\n Lorem ipsum\n " into the YAML metadata. This somehow resulted in the metadata block and the first section being treated as a table.
Trimming the whitespace in the abstract it produces the expected result.
Here's a reduced example that produces the bug:
---author: John Doe <jdoe@example.com>title: 'ABC-1234: Lorem ipsum'abstract: "\n Nullam blandit imperdiet venenatis. Sed efficitur euismod nisi ut varius malesuada.\n"
...
Introduction {#intro}============Lorem ipsum dolor sit amet, consectetur adipiscing elit.Some subsection {#subsection}
---------------Fusce eget massa risus. Sed dolor risus, posuere vel est eget, pellentesque sagittis risus.
I did a bisect using the release Debian packages and found that it works in 2.8 but not in 2.8.0.1.
pandoc 2.8
Installed pandoc-2.8-1-amd64.deb from the releases on Debian 10.
pandoc -t markdown -s output:
---abstract: Nullam blandit imperdiet venenatis. Sed efficitur euismod nisiut varius malesuada.author: 'John Doe <jdoe@example.com>'title: 'ABC-1234: Lorem ipsum'---
Introduction {#intro}
============
Lorem ipsum dolor sit amet, consectetur adipiscing elit.
Some subsection {#subsection}
---------------
Fusce eget massa risus. Sed dolor risus, posuere vel est eget,
pellentesque sagittis risus.
Installed pandoc-2.8.0.1-1-amd64.deb from the releases on Debian 10.
Note how the yaml metadata and the first section becomes a single-cell table.
---------------------------
author: John Doe
<jdoe@example.com> title:
'ABC-1234: Lorem ipsum'
abstract:
"`\n `{=tex}Nullam
blandit imperdiet
venenatis. Sed efficitur
euismod nisi ut varius
malesuada.`\n `{=tex}"
...
Introduction {\#intro}
============
Lorem ipsum dolor sit amet,
consectetur adipiscing
elit.
Some subsection
{\#subsection}
---------------------------
Fusce eget massa risus. Sed dolor risus, posuere vel est eget,
pellentesque sagittis risus.
I have a tool that converts some XML format into Markdown so that I can use Pandoc to turn it into epub for reading on a eink device.
It doesn't trim whitespace everywhere so given input like
it ended up injecting
"\n Lorem ipsum\n "
into the YAML metadata. This somehow resulted in the metadata block and the first section being treated as a table.Trimming the whitespace in the abstract it produces the expected result.
Here's a reduced example that produces the bug:
I did a bisect using the release Debian packages and found that it works in 2.8 but not in 2.8.0.1.
pandoc 2.8
Installed
pandoc-2.8-1-amd64.deb
from the releases on Debian 10.pandoc -t markdown -s
output:This is what I expected.
-t native
output:pandoc 2.8.0.1
Installed
pandoc-2.8.0.1-1-amd64.deb
from the releases on Debian 10.Note how the yaml metadata and the first section becomes a single-cell table.
This is not what I expected.
-t native
output:The text was updated successfully, but these errors were encountered: