Strict typechecking mode #2077

jneem · 2024-10-23T03:54:20Z

This adds a --strict-typechecking flag that starts the typechecker in "checking" mode, equivalent to wrapping each file in (...) : _.

A couple of points I was unsure of:

this PR does it for every file, but maybe we only want it at the top level?
I noticed while testing that despite the "strict" in the name, it's sometimes more permissive than non-strict mode (because of type inference; see the added tests). Maybe "strict" mode should run both kinds of typechecking so it's actually strictly stricter?

Fixes #2049

github-actions · 2024-10-23T04:06:27Z

Bencher Report

Branch	2077/merge
Testbed	ubuntu-latest

⚠️ WARNING: The following Measure does not have a Threshold. Without a Threshold, no Alerts will ever be generated!
Latency
Click here to create a new Threshold
For more information, see the Threshold documentation.
To only post results if a Threshold exists, set the --ci-only-thresholds CLI flag.

Click to view all benchmark results

Benchmark	Latency	nanoseconds (ns)
fibonacci 10	📈 view plot ⚠️ NO THRESHOLD	494,100.00
foldl arrays 50	📈 view plot ⚠️ NO THRESHOLD	1,771,100.00
foldl arrays 500	📈 view plot ⚠️ NO THRESHOLD	6,748,500.00
foldr strings 50	📈 view plot ⚠️ NO THRESHOLD	7,256,500.00
foldr strings 500	📈 view plot ⚠️ NO THRESHOLD	63,521,000.00
generate normal 250	📈 view plot ⚠️ NO THRESHOLD	46,735,000.00
generate normal 50	📈 view plot ⚠️ NO THRESHOLD	2,101,700.00
generate normal unchecked 1000	📈 view plot ⚠️ NO THRESHOLD	3,247,200.00
generate normal unchecked 200	📈 view plot ⚠️ NO THRESHOLD	749,480.00
pidigits 100	📈 view plot ⚠️ NO THRESHOLD	3,230,800.00
pipe normal 20	📈 view plot ⚠️ NO THRESHOLD	1,490,300.00
pipe normal 200	📈 view plot ⚠️ NO THRESHOLD	10,218,000.00
product 30	📈 view plot ⚠️ NO THRESHOLD	831,740.00
scalar 10	📈 view plot ⚠️ NO THRESHOLD	1,516,400.00
sum 30	📈 view plot ⚠️ NO THRESHOLD	830,670.00

🐰 View full continuous benchmarking report in Bencher

yannham

I noticed while testing that despite the "strict" in the name, it's sometimes more permissive than non-strict mode (because of type inference; see the added tests). Maybe "strict" mode should run both kinds of typechecking so it's actually strictly stricter?

I think it's fine as it is; it's the expected semantics (as you say, it behaves is if we put : _ every where at the top of each file), and while it's more permissive, it's not accepting dubious programs. I would say that it's smarter rather than just more permissive.

Regarding tests, maybe we should add a test that imports are also properly strictly typechecked.

yannham · 2024-10-23T08:03:25Z

core/src/typecheck/mod.rs

@@ -1382,8 +1382,10 @@ pub fn type_check(
    t: &RichTerm,
    initial_ctxt: Context,
    resolver: &impl ImportResolver,
+    strict: bool,


Nitpick: there is an enum typecheck::pattern::TypecheckMode = { Walk, Enforce }. Maybe it would be better to use that instead of a boolean (boolean blindness and all that) ? In that case maybe hoist it up in the typecheck module, because it's not for patterns only anymore.

yannham · 2024-10-23T08:10:03Z

I think it's fine as it is; it's the expected semantics (as you say, it behaves is if we put : _ every where at the top of each file), and while it's more permissive, it's not accepting dubious programs. I would say that it's smarter rather than just more permissive.

Ok but now maybe I see your point: it might be surprising that some files require --strict-typechecking to work. On the other hand it would be sad and wasteful to just run typechecking a second time in walk mode. I'm tempted to say that it's on the user. Maybe providing a way to do the same thing with a pragma located in the file instead would be more resilient to this kind of issues, as now the decision is taken by the source program and not the invoker of the CLI?

jneem · 2024-10-23T08:44:51Z

I was mainly thinking from the point of view of someone wanting to run extra checks in CI. There, you don't really care if nickel typecheck is a little (or even a factor of 2) slower, but you'd be surprised if it misses out on some problems that are caught by a "normal" typecheck. I'm not sure that having pragmas in the file really solves this use case, though: I'm imagining that I have a normal nickel file that isn't supposed to need any special settings, and I just want to run some extra-strict checks on it.

yannham · 2024-10-23T08:59:45Z

Hmm, indeed.

By the way, should we make --strict-typechecking an argument of nickel typecheck only? Maybe that would make it clear that this is some kind of separate check, and then it would be acceptable to run typechecking in both mode - we don't penalize normal execution, which is what matters probably. In that case I tend to agree that maybe the double run is what you want.

And if it starts to be too slow, we can easily make it so that the typechecker just ignores any expression with a type or a contract annotation on the second run (some kind of shallow walk,so to speak), to make sure it doesn't duplicate any work from the strict pass.

Another solution would be to rerun the typechecker in strict mode upon failure in normal execution: if the program is typeable with a bit more effort, then we accept it. But I'm not sure it's such a good idea: if people start to rely on this behavior, then the default path would waste work in the first attempt at each execution.

yannham · 2024-10-23T09:01:58Z

And if it starts to be too slow, we can easily make it so that the typechecker just ignores any expression with a type or a contract annotation on the second run (some kind of shallow walk,so to speak), to make sure it doesn't duplicate any work from the strict pass.

Wait, we can't do that, because that won't catch your example. Maybe the converse is fine though: first run the normal pass, and for the strict pass, only reprocess what was untyped before? Since inference is more powerful, it seems like if a typed block passes the normal phase, then it would necessarily pass the strict one. Would have to think a bit harder to make sure.

cli/src/typecheck.rs

jneem added 2 commits October 22, 2024 15:38

Add interpreter config and strict typecheck mode

7a8ae3c

Add tests

99c5290

jneem requested a review from yannham October 23, 2024 03:54

yannham approved these changes Oct 23, 2024

View reviewed changes

jneem added 3 commits October 23, 2024 21:40

Make it typechecker-only, and run it twice

548f016

Add a test for imports

2b42e26

Test the cli also

7e42fa7

yannham reviewed Oct 23, 2024

View reviewed changes

cli/src/typecheck.rs Outdated Show resolved Hide resolved

jneem added 2 commits October 24, 2024 08:27

Fix benches, add a comment

2343582

Remove unused config module

6d9347b

jneem enabled auto-merge October 24, 2024 03:12

jneem added this pull request to the merge queue Oct 24, 2024

Merged via the queue into master with commit 0b31a5f Oct 24, 2024
5 checks passed

jneem deleted the interpreter-config branch October 24, 2024 03:21

BrewTestBot mentioned this pull request Nov 12, 2024

nickel 1.9.0 Homebrew/homebrew-core#197479

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Strict typechecking mode #2077

Strict typechecking mode #2077

jneem commented Oct 23, 2024

github-actions bot commented Oct 23, 2024 •

edited

Loading

yannham left a comment

yannham Oct 23, 2024

yannham commented Oct 23, 2024

jneem commented Oct 23, 2024

yannham commented Oct 23, 2024

yannham commented Oct 23, 2024

Strict typechecking mode #2077

Strict typechecking mode #2077

Conversation

jneem commented Oct 23, 2024

github-actions bot commented Oct 23, 2024 • edited Loading

Bencher Report

yannham left a comment

Choose a reason for hiding this comment

yannham Oct 23, 2024

Choose a reason for hiding this comment

yannham commented Oct 23, 2024

jneem commented Oct 23, 2024

yannham commented Oct 23, 2024

yannham commented Oct 23, 2024

github-actions bot commented Oct 23, 2024 •

edited

Loading