Make assert_dom_equal ignore insignificant whitespace when walking the node tree #84

jduff · 2020-05-20T00:57:51Z

Yet another take on fixing #62

Prior art: #71, #66 and #83

This version ignores the whitespace as we are walking the tree instead of trying to pre-process or clean the html strings. Also included a strict option to preserve the assertion including whitespace (default to false).

I'm not a fan of passing strict through the method calls but wasn't sure of a better approach.

Let me know what you think and if there are any changes I should make!

chancancode · 2020-06-17T19:50:02Z

lib/rails/dom/testing/assertions/dom_assertions.rb

                child.to_s == other_child.to_s
+              else
+                child.to_s.strip == other_child.to_s.strip


Instead of / in addition to strip, I think we need some kind of regex that collapses interior white spaces as well. For example,

Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur. Excepteur sint occaecat cupidatat non proident, sunt in culpa qui officia deserunt mollit anim id est laborum.Curabitur pretium tincidunt lacus. Nulla gravida orci a odio. Nullam varius, turpis et commodo pharetra, est eros bibendum elit, nec luctus magna felis sollicitudin mauris. Integer in mauris eu nibh euismod gravida. Duis ac tellus et risus vulputate vehicula. Donec lobortis risus a elit. Etiam tempor. Ut ullamcorper, ligula eu tempor congue, eros est euismod turpis, id tincidunt sapien risus a quam. Maecenas fermentum consequat mi. Donec fermentum. Pellentesque malesuada nulla a mi. Duis sapien sem, aliquet nec, commodo eget, consequat quis, neque. Aliquam faucibus, elit ut dictum aliquet, felis nisl adipiscing sapien, sed malesuada diam lacus eget erat. Cras mollis scelerisque nunc. Nullam arcu. Aliquam consequat. Curabitur augue lorem, dapibus quis, laoreet et, pretium ac, nisi. Aenean magna nisl, mollis quis, molestie eu, feugiat in, orci. In hac habitasse platea dictumst.

vs

 Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur. Excepteur sint occaecat cupidatat non proident, sunt in culpa qui officia deserunt mollit anim id est laborum. Curabitur pretium tincidunt lacus. Nulla gravida orci a odio. Nullam varius, turpis et commodo pharetra, est eros bibendum elit, nec luctus magna felis sollicitudin mauris. Integer in mauris eu nibh euismod gravida. Duis ac tellus et risus vulputate vehicula. Donec lobortis risus a elit. Etiam tempor. Ut ullamcorper, ligula eu tempor congue, eros est euismod turpis, id tincidunt sapien risus a quam. Maecenas fermentum consequat mi. Donec fermentum. Pellentesque malesuada nulla a mi. Duis sapien sem, aliquet nec, commodo eget, consequat quis, neque. Aliquam faucibus, elit ut dictum aliquet, felis nisl adipiscing sapien, sed malesuada diam lacus eget erat. Cras mollis scelerisque nunc. Nullam arcu. Aliquam consequat. Curabitur augue lorem, dapibus quis, laoreet et, pretium ac, nisi. Aenean magna nisl, mollis quis, molestie eu, feugiat in, orci. In hac habitasse platea dictumst.

I added a test case for this here and solved it by using split instead of strip on both strings.

chancancode · 2020-06-17T19:53:30Z

lib/rails/dom/testing/assertions/dom_assertions.rb

+            def compare_doms(expected, actual, strict)
+              expected_children = extract_children(expected, strict)
+              actual_children   = extract_children(actual, strict)
+              return false unless expected_children.size == actual_children.size


This assumes that there can't be adjacent text nodes, which is generally not a correct assumption. I think in our use case it's probably a reasonable one, but I am not familiar with the guarantees of the Nokogiri parsing algorithm to know for sure. There may also be ways for us to cause this to happen reject though I could not think of an example.

I don't think there is a strong reason to change this absent of a failing test case, but just thought I should point this out explicitly.

Unless I'm misunderstanding what you're referring to, adjacent text nodes should be fine, there just needs to be the same number of nodes.

I suppose it would be possible for "one two three" to be split into 1-3 text nodes, but I doubt that would happen for the same strings in the same process. If we do end up with a different number of text nodes I think there would have to be differences in the markup to cause it and failing would be correct.

Yeah, that's what I meant. I'm not sure if we can guarantee that Nokogiri won't split text nodes, or that our code wouldn't cause it to do that. For example, if we decide to ignore HTML comments, then this would not work, because the comment may have splitter one text node into two. But can find out and iterate on that another time.

chancancode · 2020-06-17T19:54:59Z

I think this is a pretty solid attempt and the best one we've had so far. I think once we fix the interior whitespace issue, it seems good to land this and iterate further when there are bug reports. Thanks for working on this!

jduff · 2020-06-18T14:08:32Z

@chancancode thanks for taking a look! I just updated this PR with a test and fix for the issue with interior whitespace you mentioned. If there is anything else let me know.

chancancode · 2020-06-18T16:04:53Z

Urgh, seems like CI config is super old and 1.9 does not support kwarg. You can either change it to take a hash there to avoid the syntax error, or you can fix the CI config to drop support for EOL'ed Rubies and Rails, and add matrix rows for newer Rubies + Rails, in which case we can cut a breaking change release after landing this.

jduff · 2020-06-18T18:42:38Z

@chancancode instead of lumping it all together I created another PR updating the test matrix #86

kaspth · 2020-06-18T21:27:58Z

I don't have much overhead for intricate reviews right now. @chancancode if you want to help carry this, I'll happily merge 🙏

chancancode · 2020-06-18T21:29:13Z

I think this is basically good to go if we merge the CI fix and rebase this on top. We will have to make a new breaking release after due to dropping old ruby versions.

chancancode · 2020-06-18T21:29:35Z

However, I apparently can’t merge 😛

jduff · 2020-06-18T21:45:35Z

I just pushed up a rebase!

chancancode · 2020-06-18T23:42:48Z

@kaspth This seems good to me. If you can look into getting my commit/publish bit back then I can help with the merge/release as well, I assume that got lost in shuffling the GH teams since I still have the rails/rails one, and this seems much lower stakes than that. I also don't mind not having it if you want to do the merge/publish.

@jduff Since we stopped testing Rails 4.2 on CI, we should probably bump this to require the versions we are testing. It doesn't have to be part of this PR, but it does need to be changed before we release.

kaspth · 2020-06-30T19:54:23Z

@chancancode done, appreciate the help 👍 — and thank you for the PR, @jduff 🙌

jduff · 2020-07-08T18:46:01Z

Thanks for helping see this through! I've been using master in one of my projects and it's working great 👌

jduff mentioned this pull request Jun 16, 2020

make assert_dom_equal ignore insignificant whitespace #83

Closed

chancancode reviewed Jun 17, 2020

View reviewed changes

jduff added 2 commits June 18, 2020 17:44

make assert_dom_equal ignore insignificant whitespace

a26828a

Handle differences in interior whitespace

8b074b9

jduff force-pushed the ignore_whitespace_2 branch from 5f3f093 to 8b074b9 Compare June 18, 2020 21:45

chancancode approved these changes Jun 19, 2020

View reviewed changes

chancancode merged commit 47506c4 into rails:master Jul 1, 2020

This was referenced Jul 8, 2020

Make assert_dom_equal ignore insignificant whitespace #71

Closed

Update dom_assertions.rb #66

Closed

jduff mentioned this pull request Jul 21, 2020

Slots return stripped HTML ViewComponent/view_component#414

Merged

jyeharry mentioned this pull request Feb 28, 2025

assert_dom should ignore whitespace just like assert_dom_equal #121

Closed

Make assert_dom_equal ignore insignificant whitespace when walking the node tree #84

Make assert_dom_equal ignore insignificant whitespace when walking the node tree #84

Uh oh!

Conversation

jduff commented May 20, 2020

Uh oh!

chancancode Jun 17, 2020

Choose a reason for hiding this comment

Uh oh!

jduff Jun 18, 2020

Choose a reason for hiding this comment

Uh oh!

chancancode Jun 17, 2020

Choose a reason for hiding this comment

Uh oh!

jduff Jun 18, 2020

Choose a reason for hiding this comment

Uh oh!

chancancode Jun 18, 2020

Choose a reason for hiding this comment

Uh oh!

chancancode commented Jun 17, 2020

Uh oh!

jduff commented Jun 18, 2020

Uh oh!

chancancode commented Jun 18, 2020

Uh oh!

jduff commented Jun 18, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kaspth commented Jun 18, 2020

Uh oh!

chancancode commented Jun 18, 2020

Uh oh!

chancancode commented Jun 18, 2020

Uh oh!

jduff commented Jun 18, 2020

Uh oh!

chancancode commented Jun 18, 2020

Uh oh!

kaspth commented Jun 30, 2020

Uh oh!

jduff commented Jul 8, 2020

Uh oh!

Uh oh!

jduff commented Jun 18, 2020 •

edited

Loading