Extract each bullet point in Markdown nested bullet lists into its own string #131

intrigeri · 2018-06-17T08:24:26Z

A few months ago I've mentioned "(vague) plans for other improvements in Markdown bullet list handling" I've mentioned a few months ago on #7 (comment). The design discussion there was resolved and #7 was rightfully closed so it's time to follow-up with this.

Single-level Markdown bullet lists are handled properly but as soon as one adds a second, nested level, string extraction starts to get suboptimal. Our translators at Tails have complained about the additional work it causes for them e.g. when strings become fuzzy.

I've pushed a test case to https://github.com/intrigeri/po4a/tree/mdwn-nested-bullet-lists that demonstrates the problem.

This will address github's issue #131.

toddy15 · 2018-06-17T10:09:58Z

Ah, I see. Thanks for the follow-up, I've added a testcase for this.

intrigeri · 2018-07-01T10:13:42Z

@toddy15 AFAICT the test case you added demonstrates that the current code indeed has the bug this ticket is about, i.e. "Nested item 1" and "Nested item 2" are not extracted into their own string. So I'm not surprised it passes. While the test case I've proposed instead fails, in order demonstrate that the code does not behave as it should. So I believe that "This will address github's issue #131" is incorrect: commit 5658ace does not address this issue, it merely confirms it in its own way. Are we on the same page?

toddy15 · 2018-07-01T15:59:20Z

Yes, we are on the same page. The commit message is a bit misleading, I admit. What I meant was that this will address the issue, not that it solved the issue. However, adding a test case that fails would render the Travis CI useless, so I didn't do it.

It might be better to create a test case with the expected output but disable the specific test for now.

intrigeri · 2018-07-01T16:03:40Z

It might be better to create a test case with the expected output but disable the specific test for now.

Agreed :)

toddy15 · 2018-07-01T23:00:14Z

The test is now failing as expected, but the testsuite still passes because it's marked as TODO. Quite a nice feature of Test::More.

bexelbie · 2018-10-02T12:26:18Z

I've recently fixed a similar bug in Asciidoc, #149, that is still pending PR. I wonder if a similar patch would work for markdown.

eighthave · 2020-03-20T14:32:27Z

This seems to be solved in 0.57 at least. I'm seeing these metadata entries in the .po files:

#. type: Bullet: ' - '
#. type: Bullet: ' * '
#. type: Bullet: ' * '
#. type: Bullet: '* '

intrigeri · 2020-03-21T10:47:42Z

Hi, Hans-Christoph Steiner (2020-03-20):

This seems to be solved in 0.57 at least. I'm seeing these metadata entries in the .po files: * `#. type: Bullet: ' - '` * `#. type: Bullet: ' * '` * `#. type: Bullet: ' * '` * `#. type: Bullet: '* '`

Interesting! Unfortunately my results differ: - With 0.57, I still don't see nested bullet lists extracted separately. - On current Git master, if I enable the `MarkDownNestedLists` test in `t/20-text.t`, I still see it fail as expected. Are you sure the success you're seeing is with _nested_ bullet lists?

mquinson · 2020-05-24T21:57:05Z

Here is another regex-based parser of Markdown. We may find some interesting bits in there: https://github.com/microsoft/vscode-markdown-tm-grammar/blob/master/markdown.tmLanguage.base.yaml

Fixes mquinson#131

toddy15 added a commit that referenced this issue Jun 17, 2018

Add testcase for nested markdown lists.

5658ace

This will address github's issue #131.

toddy15 added a commit that referenced this issue Jul 1, 2018

Change .po file to actual expected output, see #131

8cae726

terceiro added a commit to terceiro/po4a that referenced this issue May 31, 2020

Locale::Po4a::Test: supported nested lists

57985b3

Fixes mquinson#131

terceiro mentioned this issue May 31, 2020

Locale::Po4a::Text: supported nested lists #244

Merged

terceiro added a commit to terceiro/po4a that referenced this issue May 31, 2020

Locale::Po4a::Text: supported nested lists

4436122

Fixes mquinson#131

mquinson closed this as completed in #244 May 31, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extract each bullet point in Markdown nested bullet lists into its own string #131

Extract each bullet point in Markdown nested bullet lists into its own string #131

intrigeri commented Jun 17, 2018

toddy15 commented Jun 17, 2018

intrigeri commented Jul 1, 2018

toddy15 commented Jul 1, 2018

intrigeri commented Jul 1, 2018 via email

toddy15 commented Jul 1, 2018

bexelbie commented Oct 2, 2018

eighthave commented Mar 20, 2020

intrigeri commented Mar 21, 2020 via email

mquinson commented May 24, 2020

Extract each bullet point in Markdown nested bullet lists into its own string #131

Extract each bullet point in Markdown nested bullet lists into its own string #131

Comments

intrigeri commented Jun 17, 2018

toddy15 commented Jun 17, 2018

intrigeri commented Jul 1, 2018

toddy15 commented Jul 1, 2018

intrigeri commented Jul 1, 2018 via email

toddy15 commented Jul 1, 2018

bexelbie commented Oct 2, 2018

eighthave commented Mar 20, 2020

intrigeri commented Mar 21, 2020 via email

mquinson commented May 24, 2020