Improve error messages from CLI invocations #592

edoardopirovano · 2021-06-24T16:42:05Z

This PR improves the error messages we get from CodeQL CLI invocations that went wrong. In particular, it captures the stderr of the program (up to a limit of 20,000 characters), and creates an error message that includes this as well as the command being run and the exit code. This error will bubble up to the top level where it should be caught and included in the status reports that get uploaded.

Merge / deployment checklist

Confirm this change is backwards compatible with existing workflows.
Confirm the readme has been updated if necessary.
Confirm the changelog has been updated if necessary.

adityasharad

Looks reasonable, though I think Andrew or Robert should take a look too.

adityasharad · 2021-06-24T17:18:12Z

src/codeql.ts

    listeners: {
      stdout: (data: Buffer) => {
        output += data.toString();
      },
+      stderr: (data: Buffer) => {
+        const toRead = Math.min(maxErrorSize - error.length, data.length);
+        error += data.toString("utf8", 0, toRead);


Unfortunately I don't think we're guaranteed UTF-8 encoded output. But that's hopefully rare, and partially incomplete errors are better than no errors.

Yeah I think it's reasonable to assume most of the errors we encounter will be UTF8. Certainly I think anything the CLI outputs typically is, unless a string we include in the error (file path, snippet from a QL file, etc.) isn't. I think it's okay if those cases don't get a perfect error message in the status reports.

aeisenberg · 2021-06-24T21:03:07Z

src/codeql.ts

+ * (2) It avoids us hitting the limit of how much data we can send in our
+ *     status reports on GitHub.com.
+ */
+const maxErrorSize = 20_000;


Not a big deal, but is there a reason why you chose 20,000? Do you know what the max size for a status report is?

The max size of fields in the status report is 100kb and UTF8 characters can be up to four bytes so the max length of the message is 25k characters (assuming we're really unlucky and all the characters happen to require four bytes). I subtracted 5k to allow other space for the command invocation and arguments.

Nice. Thanks for the explanation. Could you add something like that to the comment above just so this context doesn't get lost?

The max size of fields in the status report is 100kb

Just curious, where did this information come from? For my own benefit really as I didn't know what the limit was or where it comes from.

Ah, I misunderstood your comment on the issue as meaning that you knew 100kb was the limit rather than it being a suggestion. The database column allows storing up to 2GB apparently, but I'm not sure how much data the endpoint that receives status reports will accept.

Ah, no worries. 25k characters is probably fine anyway, though you could increase the limit and it would be fine.

The API should accept large volumes. Not up to 2GB but I think the limit is around 400MB so easily high enough. There's also protobuf in the pipeline there but the limit on that is also big enough for anything we might want to send.

aeisenberg

This is strictly better than we had before, and so I think it should be merged, but do you know if this fixes the error reporting problems discussed earlier?

edoardopirovano · 2021-06-24T22:25:29Z

This is strictly better than we had before, and so I think it should be merged, but do you know if this fixes the error reporting problems discussed earlier?

It will certainly make the error message better than what we had before. Whether they'll be sufficiently better to actually gleam information from I think is something we'll have to establish once the data starts flowing in. Perhaps, we'll start to see patterns of common errors and we'll want to make special cases for them to distinguish them - for instance, clearly marking certain errors as user/configuration errors and others as CLI ones.

At the moment, however, we can't really do this because all the messages in status reports say the exact same thing (CLI failed with exit code 2) so we don't really know what errors we need to distinguish. I suggest we wait a week or two from where this is merged and then take a look at what errors we have gotten and if/how we can improve them further.

edoardopirovano requested review from aeisenberg and robertbrignull June 24, 2021 16:42

edoardopirovano requested a review from a team as a code owner June 24, 2021 16:42

adityasharad reviewed Jun 24, 2021

View reviewed changes

aeisenberg reviewed Jun 24, 2021

View reviewed changes

aeisenberg approved these changes Jun 24, 2021

View reviewed changes

Improve error messages from CLI invocations

3b83544

edoardopirovano force-pushed the better-errors branch from 89d93cd to 3b83544 Compare June 24, 2021 22:27

edoardopirovano enabled auto-merge (rebase) June 24, 2021 22:27

edoardopirovano merged commit 40852fa into main Jun 24, 2021

edoardopirovano deleted the better-errors branch June 24, 2021 22:38

edoardopirovano mentioned this pull request Jun 25, 2021

Remove misleading comment #595

Merged

github-actions bot mentioned this pull request Jun 28, 2021

Merge main into v1 #601

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve error messages from CLI invocations #592

Improve error messages from CLI invocations #592

edoardopirovano commented Jun 24, 2021

adityasharad left a comment

adityasharad Jun 24, 2021

edoardopirovano Jun 24, 2021

aeisenberg Jun 24, 2021

edoardopirovano Jun 24, 2021

aeisenberg Jun 24, 2021

edoardopirovano Jun 24, 2021

robertbrignull Jun 25, 2021

edoardopirovano Jun 25, 2021

robertbrignull Jun 25, 2021

aeisenberg left a comment

edoardopirovano commented Jun 24, 2021 •

edited

Loading

Improve error messages from CLI invocations #592

Improve error messages from CLI invocations #592

Conversation

edoardopirovano commented Jun 24, 2021

Merge / deployment checklist

adityasharad left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

aeisenberg left a comment

Choose a reason for hiding this comment

edoardopirovano commented Jun 24, 2021 • edited Loading

edoardopirovano commented Jun 24, 2021 •

edited

Loading