Problem with HTML entiy parsing #206

jkphl · 2015-11-16T20:31:55Z

The parser seems to have a problem with HTML entity detection / handling. Example:

$environment = \League\CommonMark\Environment::createCommonMarkEnvironment();
$parser = new \League\CommonMark\DocParser($environment);
$renderer = new \League\CommonMark\HtmlRenderer($environment);

echo $renderer->renderBlock($parser->parse(
    "If you want to write about 'AT&T', you need to write '`AT&amp;T`'."
));

results in

<p>If you want to write about 'AT&amp;T`'.</p>

Half of the sentence is swallowed, probably due to the parser interpreting the first &T as the start of an (invalid) HTML entity I guess‽

The text was updated successfully, but these errors were encountered:

colinodell · 2015-11-16T22:45:39Z

This is likely a duplicate of #192. Basically the EntityParser would "eat" everything between the first & and the last HTML entity.

Version 0.12 contains a fix for this issue, so please upgrade to this version. (This Babelmark 2 test shows that 0.12 behaves as expected given your input).

jkphl · 2015-11-17T08:19:45Z

You're right, updating to 0.12 fixed it — I somehow missed that latest release. Thank you! :)

colinodell closed this as completed Nov 16, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Problem with HTML entiy parsing #206

Problem with HTML entiy parsing #206

jkphl commented Nov 16, 2015

colinodell commented Nov 16, 2015

jkphl commented Nov 17, 2015

Problem with HTML entiy parsing #206

Problem with HTML entiy parsing #206

Comments

jkphl commented Nov 16, 2015

colinodell commented Nov 16, 2015

jkphl commented Nov 17, 2015