Incorrectly decoding encoded HTML tags #106

apjones6 · 2015-08-29T08:49:25Z

With a standard MD parser encoded HTML tags allows you to include them as readable text. For example (plain HTML included for comparison):

&lt;iframe src="http://www.w3schools.com"&gt;&lt;/iframe&gt;
<iframe src="http://www.w3schools.com"></iframe>

results in the output HTML:

<p>&lt;iframe src="http://www.w3schools.com"&gt;&lt;/iframe&gt;</p>
<iframe src="http://www.w3schools.com"></iframe>

However if I put this output HTML through to-markdown the encoded < and > characters are erroneously decoded. This results in the following markdown:

<iframe src="http://www.w3schools.com"></iframe>
<iframe src="http://www.w3schools.com"></iframe>

(whitespace lines removed from examples for brevity)

oliverguenther · 2018-11-08T13:34:45Z

This is an actual issue for using turndown as a Markdown converter whenever using escaped HTML elements in the input format, as they will be incorrectly output as HTML tags. A quick hack to fix this is ensuring < and > are always re-encoded as entities.

bjones1 · 2023-06-02T15:25:39Z

Duplicate issue: #261.

raw HTML and HTML blocks. Closes mixmark-io#106 Closes mixmark-io#261

apjones6 pushed a commit to apjones6/to-markdown that referenced this issue Sep 15, 2015

naive fix for mixmark-io#106

e92b90f

karelbilek mentioned this issue Dec 4, 2020

🐛 < and > should not be converted to < and > JohannesKaufmann/html-to-markdown#30

Closed

bjones1 added a commit to bjones1/turndown that referenced this issue Jun 2, 2023

Fix: correctly escape text that would otherwise be interpreted as

33c2bbb

raw HTML and HTML blocks. Closes mixmark-io#106 Closes mixmark-io#261

bjones1 linked a pull request Jun 2, 2023 that will close this issue

Fix: correctly escape text that would otherwise be interpreted as raw HTML and HTML blocks. #438

Open

bjones1 added a commit to bjones1/turndown that referenced this issue Aug 23, 2024

Fix: correctly escape text that would otherwise be interpreted as

f060c41

raw HTML and HTML blocks. Closes mixmark-io#106 Closes mixmark-io#261

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Incorrectly decoding encoded HTML tags #106

Incorrectly decoding encoded HTML tags #106

apjones6 commented Aug 29, 2015

oliverguenther commented Nov 8, 2018

bjones1 commented Jun 2, 2023

Incorrectly decoding encoded HTML tags #106

Incorrectly decoding encoded HTML tags #106

Comments

apjones6 commented Aug 29, 2015

oliverguenther commented Nov 8, 2018

bjones1 commented Jun 2, 2023