add `--lean` compiler flag; improve error message when `strutils.%` or addf throws by showing all useful context #14282

timotheecour · 2020-05-09T03:48:52Z

move tests/modules/tstrutils_insert_sep.nim to where it belongs, tests/stdlib/tstrutils.nim
test on c js vm
strutils.% now shows a more helpful error msg:

import std/strutils
let a = "a $key2 b" % ["key", "val"]

before PR:
Error: unhandled exception: invalid format string [ValueError]

after PR:
Error: unhandled exception: invalid char '$' at index: '2' for input: 'a $key2 b' [ValueError]

I found it quite useful while working on #14278; without this, you got cryptic errors when format string was wrong (eg doc/advopt.txt)
after this you get all useful context which should help you spot the error faster.
The error formatting is not set in stone so shouldn't be relied on in hardcoded tests.

future work

This can be moved to lib/std/private/miscdollars.nim to be usable in other modules that need to report parsing error with useful context:

json
perhaps Add rst parse option for continuing after parse error #14280 (comment)
(EDIT) merge the pre-existing badly named tests/stdlib/tstrutil.nim with tests/stdlib/tstrutils.nim

links

up to 20x faster jsonutils deserialization #18183 (comment)

kaushalmodi · 2020-05-28T22:33:46Z

If you are planning to commit this, I will close #14486.

timotheecour · 2020-05-29T01:25:08Z

If you are planning to commit this

ya I am planning to, will get to it

Araq · 2020-12-14T09:37:55Z

IMHO "invalid format string: " & formatstr as the error message would be sweet spot between usefulness and code size. Keep in mind that Nim's strategic target is embedded devices and people enjoy using the full Nim language (plus all of its stdlib) for it. As a alternative, add a specialization for --opt:size (when compileOption("opt", "size")).

timotheecour · 2020-12-14T10:29:43Z

sweet spot between usefulness and code size. Keep in mind that Nim's strategic target is embedded devices

raiseParseError is noinline, the difference in cgen'd code is only 27LOC (or even 16LOC if I don't allow optional msg [1]) out of 340 LOC just for addf:

nim r --stacktrace:off --excessivestacktrace:off main

import strutils
proc main()=
  echo "abc1 $foo1 " % ["goo", "bar0"]
  echo "abc $foo2 " % ["foo", "bar"]
main()

before PR:
addf is 325 LOC, invalidFormatString is 14 LOC
after PR:
addf is 325 LOC, raiseParseError is 41 LOC

nim --eval:'echo (41-14)/(325 + 14)'
0.07964601769911504

an overall 27 LOC increase (8% increase in addf) is IMO insignificant, especially given fact that any real program will dwarf that size in cgen, and raiseParseError can as mentioned be used in other contexts, for any parsing function; the code will then be reused and won't add to the code size.

There are plenty of things that will have much bigger impact on reducing code size that should be tried first, but if somehow that 8% increase is a problem, I can introduce and document a nimOptimizeSize to avoid the extra size (that flag could be used in similar situations), but I don't think we should conflate --opt:size with this, --opt:size shouldn't affect semantics, even for error messages.

[1] but msg is useful if we reuse raiseParseError in other parsing procs

Araq · 2020-12-14T10:45:37Z

I am not talking about "LOC" (lines of code), I'm talking about the resulting binary sizes. And while it's hardly significant for this "one little thing", these little things do add up. And on an embedded chip, I'm not willing to pay the price for debug conveniences in the final build, eventually I stop debugging my code and then I'm left with the bloat. So arguably, this should all be done for debug mode only.

I can introduce and document a nimOptimizeSize to avoid the extra size (that flag could be used in similar situations), but I don't think we should conflate --opt:size with this, --opt:size shouldn't affect semantics, even for error messages.

We already have debug vs release vs danger, adding more doesn't seem wise.

timotheecour · 2020-12-15T08:49:34Z

for reference here's binary sizes, using nim r --eval:'import os; echo getFileSize("/tmp/main")'
(du has coarser resolution, even with BLOCKSIZE=1 it hardcodes as 512)

nim c -o:/tmp/main --stacktrace:off --listfullpaths:off --excessivestacktrace:off main.nim

before PR: 80824
after PR: 85920
after PR, wo msg: 85872

same with -d:danger
before PR: 60160
after PR: 69352
after PR, wo msg: 69304

I'm not willing to pay the price for debug conveniences in the final build

fine but then let's do it properly and without conflating features (which always bites you back), via a proper dedicated flag --lean which can be combined with other flags:

--lean: libraries (eg stdlib) can query for this and produce leaner error messasges, which does affect observable semantics even in normal conditions (raising CatchableErrors is "exceptional" but "normal")
-d:danger -d:release --opt:size --opt:speed # affect optimization + checks/asserts; does not affect semantics in normal conditions, only affects semantics for defects
--panics # ditto, this only affects semantics for defects

It's easy enough for a user on scarce resource to combine however they wish, eg nim r --lean -d:danger --panics:on --opt:size main

this nicely solves fix #14905: new recommended flag for performance/benchmarking: -d:nimDisableAssertMsgs #14907 too: --lean can be used instead of -d:nimDisableAssertMsgs, with similar spirit
--lean can be used in many other places, and a user using -d:danger will never have to worry about whether their exceptions's messages get changed just because of -d:danger, since this would be controlled by a separate flag.
stdlib can be even more aggressive about shortening messages when --lean is passed since that's the intent; it could even be reduced to an error code, but that can be discussed separately

Araq · 2020-12-15T09:42:25Z

fine but then let's do it properly and without conflating features (which always bites you back), via a proper dedicated flag --lean which can be combined with other flags

When I want to optimize for size, it means I care about the size. Why shouldn't the stdlib adapt to my wish? How is --opt:size --lean simpler to use than just --opt:size? Yes, it's not only an "optimization" in the strictest sense of the word. At least make lean imply --opt:size --stackTrace:off then

timotheecour · 2020-12-15T10:28:09Z

PTAL, added --lean accordingly.

Why shouldn't the stdlib adapt to my wish?

because users would not expect --opt:size to affect semantics, including of messages in exceptions. Having a dedicated flag that is explicitly documented as enabling leaner messages (hence different semantics) enables even more aggressive size optimizations. So it's a win-win, at the price of 1 extra flag to learn about, but likely just for users targetting applications such as embedded.

At least make lean imply --opt:size --stackTrace:off then

done but we may need to revisit this decision soon; i'm not sure it's a good idea, because --lean can also be useful for other things, eg benchmarking or release/danger builds where user is fine about the aggressive size optimizations involving semantic changes (to exception messages etc), and --opt:size is mutually exclusive with --opt:speed

(I also made it imply --listfullpaths:off; at least these flags should be overrideable as usual, eg --lean --listfullpaths:on)

Keeping flags orthogonal is always the simplest/less buggy / most flexible; documentation is always there to help picking which flags for size/speed/debug.

timotheecour · 2021-02-01T23:31:41Z

ping @xflywind @Araq (i had to rebase to avoid bitrot conflicts)

Varriount · 2021-07-11T06:30:49Z

Is this really so important that it needs an entirely new compiler flag? How often are library writers actually going to check for this flag? And if it affects semantics, why would users use it? Affecting semantics means libraries depending on "regular" semantics may break, sometimes in unexpected ways.

stale · 2022-07-12T00:06:58Z

This pull request has been automatically marked as stale because it has not had recent activity. If you think it is still a valid PR, please rebase it on the latest devel; otherwise it will be closed. Thank you for your contributions.

timotheecour mentioned this pull request May 9, 2020

Add rst parse option for continuing after parse error #14280

Closed

timotheecour mentioned this pull request May 28, 2020

strutils: Make the invalidFormatString error a bit more informative #14486

Closed

timotheecour mentioned this pull request Dec 13, 2020

strutils: % or addf uses invalidFormatString which shows unhelpful error (should show context) timotheecour/Nim#451

Open

timotheecour force-pushed the pr_strutils_errmsg branch 2 times, most recently from bb3593e to 1183ab5 Compare December 14, 2020 04:12

timotheecour changed the title ~~[WIP] improve error message when strutils.addf throws by showing all useful context~~ improve error message when strutils.% or addf throws by showing all useful context Dec 14, 2020

timotheecour marked this pull request as ready for review December 14, 2020 04:16

timotheecour requested a review from ringabout December 14, 2020 04:41

timotheecour mentioned this pull request Dec 15, 2020

fix #14905: new recommended flag for performance/benchmarking: -d:nimDisableAssertMsgs #14907

Closed

timotheecour force-pushed the pr_strutils_errmsg branch from 1183ab5 to ad87c3a Compare December 15, 2020 10:22

timotheecour force-pushed the pr_strutils_errmsg branch from ad87c3a to e05d926 Compare December 15, 2020 17:53

timotheecour force-pushed the pr_strutils_errmsg branch from e7e03b2 to aad4aa5 Compare February 1, 2021 23:29

timotheecour changed the title ~~improve error message when strutils.% or addf throws by showing all useful context~~ add --lean compiler flag; improve error message when strutils.% or addf throws by showing all useful context Feb 1, 2021

timotheecour mentioned this pull request Feb 1, 2021

faster copyFile via zero-copy API's (copyfile, splice etc) nim-lang/RFCs#330

Open

9 tasks

timotheecour mentioned this pull request Jun 5, 2021

up to 20x faster jsonutils deserialization #18183

Merged

timotheecour added 6 commits July 7, 2021 12:16

improve errmsg in strutils.addf

edf5840

improve error message when addf throws by showing all useful context

e3b18eb

improve test names

c49e84c

improve strutils.% error message

0faffdf

--lean flag to aggresively optimize for binary size

b28520d

fixup

7d453f0

fixup

d61bfb4

timotheecour force-pushed the pr_strutils_errmsg branch from aad4aa5 to d61bfb4 Compare July 7, 2021 19:17

fixup

bfd69ba

timotheecour mentioned this pull request Jul 7, 2021

Extended side effect error messages #18418

Merged

fix after rebase

e760eb9

stale bot added the stale Staled PR/issues; remove the label after fixing them label Jul 12, 2022

konsumlamm mentioned this pull request Aug 4, 2022

Improve error message for strutils.addf #20157

Merged

Varriount closed this in #20157 Aug 5, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add `--lean` compiler flag; improve error message when `strutils.%` or addf throws by showing all useful context #14282

add `--lean` compiler flag; improve error message when `strutils.%` or addf throws by showing all useful context #14282

timotheecour commented May 9, 2020 •

edited

Loading

kaushalmodi commented May 28, 2020 •

edited

Loading

timotheecour commented May 29, 2020

Araq commented Dec 14, 2020

timotheecour commented Dec 14, 2020

Araq commented Dec 14, 2020 •

edited

Loading

timotheecour commented Dec 15, 2020 •

edited

Loading

Araq commented Dec 15, 2020

timotheecour commented Dec 15, 2020 •

edited

Loading

timotheecour commented Feb 1, 2021 •

edited

Loading

Varriount commented Jul 11, 2021

stale bot commented Jul 12, 2022

add --lean compiler flag; improve error message when strutils.% or addf throws by showing all useful context #14282

add --lean compiler flag; improve error message when strutils.% or addf throws by showing all useful context #14282

Conversation

timotheecour commented May 9, 2020 • edited Loading

future work

links

kaushalmodi commented May 28, 2020 • edited Loading

timotheecour commented May 29, 2020

Araq commented Dec 14, 2020

timotheecour commented Dec 14, 2020

Araq commented Dec 14, 2020 • edited Loading

timotheecour commented Dec 15, 2020 • edited Loading

Araq commented Dec 15, 2020

timotheecour commented Dec 15, 2020 • edited Loading

timotheecour commented Feb 1, 2021 • edited Loading

Varriount commented Jul 11, 2021

stale bot commented Jul 12, 2022

add `--lean` compiler flag; improve error message when `strutils.%` or addf throws by showing all useful context #14282

add `--lean` compiler flag; improve error message when `strutils.%` or addf throws by showing all useful context #14282

timotheecour commented May 9, 2020 •

edited

Loading

kaushalmodi commented May 28, 2020 •

edited

Loading

Araq commented Dec 14, 2020 •

edited

Loading

timotheecour commented Dec 15, 2020 •

edited

Loading

timotheecour commented Dec 15, 2020 •

edited

Loading

timotheecour commented Feb 1, 2021 •

edited

Loading