-
-
Notifications
You must be signed in to change notification settings - Fork 3.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
parsing bug in raw html from markdown to html #3989
Labels
Comments
HTML usually is supposed to have the
|
Pandoc can parse the HTML correctly, as you'll see if you do
`pandoc -f html`. The problem is that the Markdown reader
tries to guess where the HTML tag ends before passing this
off to the HTML reader for parsing. I think we can fix this
easily by passing in the whole input...TagSoup's parser is
lazy so I don't think this will affect performance, but we'd
have to check.
+++ Hao Zhu [Oct 23 17 13:22 ]:
… Say I have a markdown file with the following contents
test.md
<span data-toggle="tooltip" data-placement="right" data-html="true"
title="1st line of text <br> 2nd line of text">
Hover over me
</span>
(I'm copying the contents from [1]https://www.bootply.com/83099)
Those html tags <> in the title attribute create some bugs when I tried
to convert this markdown file to html using
pandoc -f markdown -t html test.md
I got
<p><span data-toggle="tooltip" data-placement="right" data-html="true"
title="1st line of text <br> 2nd line of text"> 2nd line of text">
Hover over me </span></p>
My pandoc version is 1.19.2.1. Thank you!
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, [2]view it on GitHub, or [3]mute the
thread.
References
1. https://www.bootply.com/83099
2. #3989
3. https://github.com/notifications/unsubscribe-auth/AAAL5KzWwUwRHAV306B8zM3p-ubWump5ks5svPWCgaJpZM4QDcFE
|
Turns out we didn't completely fix this before. |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Say I have a markdown file mixed with some raw HTML and markdown text
test.md
(I'm copying the contents from https://www.bootply.com/83099)
Those html tags <> in the title attribute create some bugs when I tried to convert this markdown file to html using
pandoc -f markdown -t html test.md
I got
My pandoc version is 1.19.2.1. Thank you!
The text was updated successfully, but these errors were encountered: