Markdown: Added support for nested bold and italic tokens #1897

RunDevelopment · 2019-05-18T19:07:51Z

This PR adds support for nested bold and italic tokens.

This also resolves #1849.

Example

Before:

After:

ghost

bold and italic share nearly identical regular expressions
bold, italic, and strike share nearly the same content
pandoc's markdown defines ::: as well, for classes
pandoc's markdown supports { } after various elements, also for classes
not immediately obvious why token.content[2] isn't used/defined (line 196)
don't see any tests for ** bold with _ italic inside
don't see any tests for __ bold with * italic inside
is bold/italic/strike a possibility?

RunDevelopment · 2019-06-24T13:51:21Z

@DaveJarvis Thank you for taking the time to review this PR!

bold and italic share nearly identical regular expressions
bold, italic, and strike share nearly the same content

The new createInline function should address both issues.

pandoc's markdown defines ::: as well, for classes
pandoc's markdown supports { } after various elements, also for classes

I don't think we need to support a parser specific feature. If it's a commonly used feature, then sure we'll add it.
But you might want to open a new issue for this or add it to #1558 as this isn't within the scope of this PR.

not immediately obvious why token.content[2] isn't used/defined (line 196)

I added a comment explaining the structure of token.content.

don't see any tests for ** bold with _ italic inside
don't see any tests for __ bold with * italic inside

Added.

is bold/italic/strike a possibility?

Yes, you can nest bold, italic, and strike, or is there any issue with it?
The reason why support for nested bold, italic tokens was so difficult is that they can use the same character as delimiters. This makes parsing very hard and error-prone (sometimes even actual MD parser get wrong). Because strikes use a different character, there shouldn't be an issue, or is there?

ghost · 2019-06-24T17:34:00Z

If it's a commonly used feature, then sure we'll add it.

Available since Oct 2017 (jgm/pandoc#168), so it's gaining in usage and popularity.

Yes, you can nest bold, italic, and strike, or is there any issue with it?

I didn't see any specific unit tests that go through all the combinations, but maybe they aren't necessary.

RunDevelopment · 2019-06-24T17:49:26Z

Because strike uses different delimiters than bold and italic, so it's generally less problematic. So I think that this should be sufficent testing.

ghost

The unit test blocks are comprehensive, though a little unwieldy. Not sure if it would be worthwhile to write a recursive function to iterate over an array of inline elements. Something like:

generate_inline_test( [ '**', '__' ] )
generate_inline_test( [ '__', '**' ] )
generate_inline_test( [ '*', '__', '~~' ] )

If so, once the generate_inline_test function is created, it would be possible to call it programmatically to generate all possible combinations and permutations based on a single array of inline elements (e.g., ['**', '__', '~~', '*', '_', ...]).

RunDevelopment · 2019-06-24T20:40:51Z

Regarding the combinations: Yes we could write such a function but I don't think we have to. Looking at the regular expression, it's enough to test the nested tokens one level deep (It doesn't make sense the nest e.g. bold inside bold which makes things simpler) which gives us 6 * 4 combinations which are all covered by the bold, italic and strike tests in this PR.
More extensive tests might be necessary in the future but for now, I think it's fine as is.

RunDevelopment · 2019-06-24T20:45:11Z

@DaveJarvis Again, thank you for reviewing!

Added support for nested bold and italic tokens

03cf85b

RunDevelopment added enhancement language-definitions needs review labels May 18, 2019

RunDevelopment mentioned this pull request May 18, 2019

Markdown TODOs #1558

Closed

6 tasks

ghost approved these changes Jun 24, 2019

View reviewed changes

RunDevelopment added 3 commits June 24, 2019 14:47

Improved tests

235fce7

More strike tests

986c9de

Improved inline pattern creation + more comments

9c75843

Rebuilt Prism

84b9651

ghost approved these changes Jun 24, 2019

View reviewed changes

RunDevelopment merged commit 1190372 into PrismJS:master Jun 24, 2019

RunDevelopment deleted the markdown_nested_bold_italic branch June 24, 2019 20:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Markdown: Added support for nested bold and italic tokens #1897

Markdown: Added support for nested bold and italic tokens #1897

RunDevelopment commented May 18, 2019

ghost left a comment

RunDevelopment commented Jun 24, 2019

ghost commented Jun 24, 2019

RunDevelopment commented Jun 24, 2019

ghost left a comment

RunDevelopment commented Jun 24, 2019

RunDevelopment commented Jun 24, 2019

Markdown: Added support for nested bold and italic tokens #1897

Markdown: Added support for nested bold and italic tokens #1897

Conversation

RunDevelopment commented May 18, 2019

ghost left a comment

Choose a reason for hiding this comment

RunDevelopment commented Jun 24, 2019

ghost commented Jun 24, 2019

RunDevelopment commented Jun 24, 2019

ghost left a comment

Choose a reason for hiding this comment

RunDevelopment commented Jun 24, 2019

RunDevelopment commented Jun 24, 2019