Import Sorter to cope with multi-line comments and misplaced imports #112

fvgh · 2017-05-17T04:15:20Z

The change shall enhance the robustness of the java.ImportOrderStep step.
I admit that I basically need it to use it on Groovy code (see #13), but it is also an issue in the java world:

If you comment out imports, the java.ImportOrderStep step will ignore the multi-line comment and use these imports in its sorted list
If you have misplaced imports (still working on code, or using groovy), the java.ImportOrderStep will (delete all code in between)[java.ImportOrderStep].

Two improvements:

The ImportSorter stops looking for import statements, as soon as it finds the beginning of a scope.
The ImportSorter gets a rough idea what multi-line comments are, so that it does not look for tokens ({, import) within multi-line comments.

…laced imports

nedtwigg · 2017-05-17T05:25:20Z

I looked at this briefly. The modifications look good, but the original code is kinda spaghetti so I'm not sure if it introduced a subtle bug or not (maybe even fixed some, who knows!).

For code like this, it earns my trust by simple mileage - run it on a ton of code and make sure nothing breaks. Once this gets merged to master so that it shows up in -SNAPSHOT, then I'll test this on my work codebases to make sure it doesn't introduce any subtle bugs before we release.

@jbduncan tends to have a more careful eye than me, so I'll defer to him :)

fvgh · 2017-05-17T05:37:53Z

I was wondering whether it is time to put the format method into a child class, but since this means a complete refactoring and it would make it even harder to compare the original code with the few modifications. Your choice, let me know...

I ran the new version on junit, for which I need it in the first place.

nedtwigg · 2017-05-17T05:44:53Z

I ran the new version on junit, for which I need it in the first place.

Fantastic! Good enough for me 👍

I was wondering whether it is time to put the format method into a child class

The problem with naive pseudo-parsers such as ImportSorter is that they're easy to get working for most cases, but they suck up a lot of time with obscure bugs later in their lifecycle. I absolutely don't think it's worth a full rewrite - it's worked great so far, and you've tested it well.

I think you made the right call re: refactoring.

fvgh · 2017-05-17T05:58:50Z

That is also the point why I did not open a new child-class. You can of course implement more and more features to come closer to a real syntax aware implementation. But this cannot be the goal.
As I stated in my code comments, the multi-line detection is not complete.
But also the detection of import is not complete since it only accepts the import statement at the very beginning of the line, which is not in line with the Java syntax definition.
As long as we can cope with sensible code, that should be good enough. I was just a little bit surprised when the ImportOrder eat half of my groovy file due to scattered imports.
Here again, my solution is not perfect. The solution will only sort the imports for the first class. If the second class in the same file has additional imports, it will not touch them.
But for me that is good enough. Having multiple classes in one file is a feature in Groovy which in some cases can make your code readable. Scattering the imports in your code is in very rare cases useful, but then just when you need just one or two more imports. Other wise you should really ave two files for two classes. However, in a clean-code scenario I do not need a sorting for these additional imports.

jbduncan

Although I do have a comment about a code block within ImportSorter.java, it's not clear to me that this change is wise.

google-java-format, which Spotless supports, purposely throws an exception upon encountering comments within the imports section, since it's not clear whether the comment was purposely inserted or not, since it may document something important, so it should be up to a human to decide whether it should be removed or placed elsewhere in the file.

jbduncan · 2017-05-17T10:33:08Z

lib/src/main/java/com/diffplug/spotless/java/ImportSorter.java

+				isMultiLineComment = false;
+				if (!next.contains("/*")) {
+					continue;
+				}


I'm not entirely sure that this if..continue block is needed.

AFAICT, !next.contains("/*") can never be true at this point. I can explain my reasoning if you wish, but it's a bit long-winded, I've not tested it with JUnit or a debugger, and there's no harm to this block staying anyway AFAICT.

You have this situation in the following case:

/* //isMultiLineComment = true import what.so.ever */ //isMultiLineComment = false

In the old code what.so.ever would have been part of the refactored imports, so it was originally commented out.
If I refactor it to an inner class, it becomes easier to read. But at first I wanted to assure you, that I actually did not alter much in the initial logic of the method.

In the old code what.so.ever would have been part of the refactored imports, so it was originally commented out.

Sorry, I don't understand what this sentence is saying. Can you rephrase it for me?

Original behavior:

import i.need.this /* import i.don.t.need.that */

Results in

import i.need.this import i.don.t.need.that

New behavior detects multi-line comments. So result is:

import i.need.this

Sorry, just saw my mistake. I wanted to say:
In the old code what.so.ever would have been part of the refactored imports, though it was originally commented out.

Aha I see now. Thanks for the clarifications @fvgh. :)

jbduncan · 2017-05-17T10:51:45Z

Hi @fvgh, as I mentioned in my review, it's not clear to me that this code change is wise.

google-java-format, which Spotless supports, purposely throws an exception upon encountering comments within an imports section, since it's not clear whether the comment was purposely inserted or not, and it may document something important. So it makes some sense to me that it should be up to a human to decide whether it should be removed or placed elsewhere in the file, rather than let Spotless indiscriminately remove them.

But then again, most users (if not all users) of Spotless will also be using a VCS like Git, so they can revert or change commits if an important comment was accidentally removed. So maybe this isn't such a big problem... 🤔

fvgh · 2017-05-17T19:21:02Z

throws an exception upon encountering comments within the imports section

This is an option. I thought that the current behavior to eat comments was intended, so I did not change it. But let's do that in a separated PR, since it is unrelated to my changes.

they can revert or change commits

I think if you run a code formatter over your complete code without source control, well, ...

Refactoring the code and use an inner class is definitely an option here. But in the current form I think it is easier to see what I changed in the behavior. So for me it is important that we have both a common understanding of the behavioral change before I propose a refactored code.
The unit test files I provided should give a compressed view on the changed behavior. Once we agreed on it, I can offer you, as a second non-squashed commit, a proposal for an inner class solution.
I trust my UT to guarantee that I don't destroy anything. As regression test for the original behavior I will check with JUnit again.
Let me know if you have more questions on the logic and afterwards I can provide you with a refactored code if you like.

jbduncan · 2017-05-17T21:58:43Z

I thought that the current behavior to eat comments was intended, so I did not change it.

Aha, I didn't realise that. Let's keep the comment-eating behaviour then.

I'm fine with the tests, and it looks like they pass, so no problems there.

jbduncan · 2017-05-17T22:07:55Z

Basically, this LGTM now. 👍

Enhance Import Sorter/Order to cope with multi-line comments and misp…

18af99d

…laced imports

fvgh requested a review from jbduncan May 17, 2017 04:15

fvgh mentioned this pull request May 17, 2017

Add missing groovy plugin check #110

Merged

jbduncan reviewed May 17, 2017

View reviewed changes

fvgh merged commit b325e5c into master May 18, 2017

fvgh deleted the import_order_robustness branch May 18, 2017 04:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Import Sorter to cope with multi-line comments and misplaced imports #112

Import Sorter to cope with multi-line comments and misplaced imports #112

fvgh commented May 17, 2017 •

edited

Loading

nedtwigg commented May 17, 2017

fvgh commented May 17, 2017

nedtwigg commented May 17, 2017

fvgh commented May 17, 2017 •

edited

Loading

jbduncan left a comment

jbduncan May 17, 2017

fvgh May 17, 2017 •

edited

Loading

jbduncan May 17, 2017

fvgh May 18, 2017

fvgh May 18, 2017

jbduncan May 18, 2017

jbduncan commented May 17, 2017

fvgh commented May 17, 2017 •

edited

Loading

jbduncan commented May 17, 2017

jbduncan commented May 17, 2017

Import Sorter to cope with multi-line comments and misplaced imports #112

Import Sorter to cope with multi-line comments and misplaced imports #112

Conversation

fvgh commented May 17, 2017 • edited Loading

nedtwigg commented May 17, 2017

fvgh commented May 17, 2017

nedtwigg commented May 17, 2017

fvgh commented May 17, 2017 • edited Loading

jbduncan left a comment

Choose a reason for hiding this comment

jbduncan May 17, 2017

Choose a reason for hiding this comment

fvgh May 17, 2017 • edited Loading

Choose a reason for hiding this comment

jbduncan May 17, 2017

Choose a reason for hiding this comment

fvgh May 18, 2017

Choose a reason for hiding this comment

fvgh May 18, 2017

Choose a reason for hiding this comment

jbduncan May 18, 2017

Choose a reason for hiding this comment

jbduncan commented May 17, 2017

fvgh commented May 17, 2017 • edited Loading

jbduncan commented May 17, 2017

jbduncan commented May 17, 2017

fvgh commented May 17, 2017 •

edited

Loading

fvgh commented May 17, 2017 •

edited

Loading

fvgh May 17, 2017 •

edited

Loading

fvgh commented May 17, 2017 •

edited

Loading