misc: share markdown parsing in collect-strings and the report #9514

brendankenny · 2019-08-05T20:03:29Z

duplicated markdown parsing in dom.js for the report and collect-strings.js for adding placeholders for ctc files (which will eventually be reconstituted and then passed through dom.js methods) is a little risky, so the first commit here makes them share a splitting implementation in util.js.

There are no changes to the collected locale strings or the existing dom and collect-strings unit tests.

...but then I couldn't resist making an interface that's easier to understand (no more mysterious empty string for preambleText but undefined for linkText and linkHref). This is isolated to the second commit, so if it seems excessive I can drop it :)

brendankenny · 2019-08-05T20:05:02Z

(a bit easier to see at https://github.com/GoogleChrome/lighthouse/pull/9514/files?w=1, most added lines are the expanded tests)

brendankenny · 2019-08-05T20:06:37Z

lighthouse-core/report/html/renderer/util.js

+    const parts = text.split(/\[([^\]]+?)\]\((https?:\/\/.*?)\)/g);
+    while (parts.length) {
+      // Pop off the same number of elements as there are capture groups.
+      const [preambleText, linkText, linkHref] = parts.splice(0, 3);


I tried rewriting this with i % 3 and have 0 and 1 cases, where the 1 case also pops off parts[i + 1], but it just got harder to follow, so stuck with this internally

brendankenny · 2019-08-05T20:11:41Z

lighthouse-core/test/report/html/renderer/util-test.js

 /* global URL */

 describe('util helpers', () => {
-  let origConsoleWarn;


needed for old displayValue handling (#5099) but no need after #6767 and missed for removal in #7628

brendankenny · 2019-08-05T20:14:05Z

lighthouse-core/scripts/i18n/collect-strings.js

+  if (message.match(/\[.*\] \(.*\)/)) {
+    throw Error(`Bad Link spacing in message "${message}"`);
+  }
+  // * [](empty link text)


@exterkamp and I discussed this empty linkText case and decided while maybe there's some vague possibility of a need for invisible links at some point, it's almost certainly an accident today, so let's alert the author

patrickhulce

LGTM

patrickhulce · 2019-08-05T22:07:32Z

lighthouse-core/report/html/renderer/util.js

+
+    const parts = text.split(/\[([^\]]+?)\]\((https?:\/\/.*?)\)/g);
+    while (parts.length) {
+      // Pop off the same number of elements as there are capture groups.


Suggested change

// Pop off the same number of elements as there are capture groups.

// Shift off the same number of elements as there are capture groups.

;)

largely a pedantic suggestion and pre-existing, so can reject if you'd like

patrickhulce · 2019-08-05T22:09:54Z

lighthouse-core/report/html/renderer/util.js

+      // Pop off the same number of elements as there are capture groups.
+      const [preambleText, linkText, linkHref] = parts.splice(0, 3);
+
+      if (preambleText) { // Empty plain text is an artifact of splitting, not meaningful.


the empty explanation feels a bit weird here, maybe invert it or at least say "We can skip empty b/c ..."?

patrickhulce · 2019-08-05T22:11:43Z

lighthouse-core/test/report/html/renderer/util-test.js

+      ]);
+    });
+
+    it('splits on backticked code at the emd of the string', () => {


Suggested change

it('splits on backticked code at the emd of the string', () => {

it('splits on backticked code at the end of the string', () => {

patrickhulce · 2019-08-05T22:13:13Z

lighthouse-core/test/report/html/renderer/util-test.js

+    });
+  });
+
+  describe('#splitMarkdownLink', () => {


love these ❤️

exterkamp

LGTM. Like the collection of concerns into 1 file and all the new tests!

exterkamp · 2019-08-06T18:29:23Z

lighthouse-core/report/html/renderer/util.js

+   * into segments that were enclosed in backticks (marked as `isCode === true`)
+   * and those that outside the backticks (`isCode === false`).
+   * @param {string} text
+   * @return {Array<{isCode: true, codeText: string}|{isCode: false, plainText: string}>}


It seems weird to me to include a boolean flag and use different text fields. Makes more sense to me to have {isCode: boolean, text: string}? Is this less canonical for js?

We had a massive discussion about this very thing when discussing the shape of the proto and the hardened LHR API.

I 100% agree this feels anti-JS :)

OTOH, it's for internal use only, never goes over any wire, and we never really need to use just the text, so I don't have a strong reason to object yet.

haha everyone lives with "preambleText" (does code typically come with a preamble?) including preambleText that occurs after the content and may not have any text in it, but I try to make self describing property names and everyone loses their minds :P

I think there is value in API safety like this (in this case tsc requires the isCode discriminator to be checked for the consuming code to be able to know if it can use plainText or codeText) but as you say it's internal only, for a minor feature, and I was really just playing around with the interface (not sure I really like it), so I can switch back to text or whatever :)

exterkamp · 2019-08-06T18:33:13Z

lighthouse-core/report/html/renderer/util.js

+   * `isLink === false`), and segments with text content and a URL that did make
+   * up a link (marked as `isLink === true`).
+   * @param {string} text
+   * @return {Array<{isLink: true, linkText: string, linkHref: string}|{isLink: false, plainText: string}>}


Same thing about this. Seems like isLink with text would be sufficient.

exterkamp · 2019-08-06T18:48:41Z

lighthouse-core/test/report/html/renderer/util-test.js

+      ]);
+    });
+
+    it('handles text only within backticks', () => {


I was also curious for backticks separated by only a space if that would maintain the space e.g.

"`first code` `second code`"

...and that works as well. Feel free to add it as a test if that seems significant.

Feel free to add it as a test if that seems significant.

sure, no problem

patrickhulce · 2019-08-07T00:27:23Z

grumble grumble feedback

😆🤣😆🤣😆🤣

brendankenny added 3 commits August 2, 2019 17:53

misc: share markdown parsing in collect-strings and the report

b52ae97

update markdown splitting interface

e2b102e

don't allow empty link text

1860f8c

brendankenny requested a review from paulirish as a code owner August 5, 2019 20:03

googlebot added the cla: yes label Aug 5, 2019

brendankenny commented Aug 5, 2019

View reviewed changes

whoops, delete

7953ee3

patrickhulce approved these changes Aug 5, 2019

View reviewed changes

exterkamp approved these changes Aug 6, 2019

View reviewed changes

grumble grumble feedback

e09a47c

vercel bot deployed to staging August 7, 2019 00:07 View deployment

brendankenny merged commit f72dc06 into master Aug 7, 2019

brendankenny deleted the sharemarkdown2 branch August 7, 2019 22:20

paulirish pushed a commit that referenced this pull request Nov 6, 2019

misc: share markdown parsing in collect-strings and the report (#9514)

ecb729e

snyk-bot mentioned this pull request Mar 21, 2020

[Snyk] Upgrade lighthouse from 5.1.0 to 5.6.0 godaddy/lighthouse4u#13

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

misc: share markdown parsing in collect-strings and the report #9514

misc: share markdown parsing in collect-strings and the report #9514

brendankenny commented Aug 5, 2019 •

edited

Loading

brendankenny commented Aug 5, 2019

brendankenny Aug 5, 2019

brendankenny Aug 5, 2019

brendankenny Aug 5, 2019

patrickhulce left a comment

patrickhulce Aug 5, 2019

patrickhulce Aug 5, 2019

patrickhulce Aug 5, 2019

patrickhulce Aug 5, 2019

exterkamp left a comment

exterkamp Aug 6, 2019

patrickhulce Aug 6, 2019

brendankenny Aug 6, 2019

exterkamp Aug 6, 2019

exterkamp Aug 6, 2019

brendankenny Aug 7, 2019

patrickhulce commented Aug 7, 2019

	// Pop off the same number of elements as there are capture groups.
	// Shift off the same number of elements as there are capture groups.

	it('splits on backticked code at the emd of the string', () => {
	it('splits on backticked code at the end of the string', () => {

misc: share markdown parsing in collect-strings and the report #9514

misc: share markdown parsing in collect-strings and the report #9514

Conversation

brendankenny commented Aug 5, 2019 • edited Loading

brendankenny commented Aug 5, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

patrickhulce left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

exterkamp left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

patrickhulce commented Aug 7, 2019

brendankenny commented Aug 5, 2019 •

edited

Loading