Concurrent type checking #202

ahejlsberg · 2025-01-01T23:30:04Z

This PR introduces concurrent type checking. By default, we now create four type checkers and divide programs into four equal parts that are type checked concurrently. Diagnostics are then collected from each of the checkers and merged into a single set of diagnostics. The division into equal parts is determined from the index of each particular source file in the program modulo the number of checkers. This means the partitioning is deterministic for the same set of source files, which in turn means the produced set of diagnostics remains stable across compilations. The set of diagnostics may however differ from a single-threaded run of the compiler since the compiler may choose different locations for elaborations.

Concurrent type checking isn't as perfectly parallelizable as parsing and binding because checking often resolves types across source file boundaries, potentially duplicating work between the four checkers. Indeed, concurrent type checking always consumes more memory (because of the duplicated types) and in the worst case is no faster than single-threaded checking. However, in practice, concurrent type checking appears to have a significant positive effect. For example, with single-threaded parsing, binding, and checking, the compile time for VSCode is around 12s, and with concurrent parsing and binding, but single threaded checking (as was the case before this PR), the compile time is 8.5s. With full concurrent parsing, binding, and type checking, the compile time drops to 4s at a cost of consuming 20% more memory. (Measured on an 8-core Intel Core i9.)

Note that four type checkers were experimentally chosen as a good balance between performance and memory consumption. With eight type checkers, VSCode compiles about 10% faster, but memory consumption further increases by 20%.

Also note that #200 is a prerequisite for this PR. Without #200, types are ordered by type IDs, which become non-deterministic with concurrent checking.

jakebailey · 2025-01-02T18:18:59Z

internal/compiler/utilities.go

-var nextMergeId atomic.Uint32
+var (
+	symbolMergeMutex sync.Mutex
+	nextMergeId      uint32
+)


With concurrent checkers, are we going to have a problem here? Exhausting int32 globally with parallel checking doesn't seem infeasible at all anymore...

It would take 4+ billion merged symbols, and then we'd really only have a problem if some of the symbols from 4+ billion in the past are still around to cause conflicts. I'm not too worried.

Given the old compiler had a max of 2^52, that's a lot of headroom we had before that could have been insulating us from this problem; I suspect that someone typing in a large project could actually get up to this limit, honestly...

# Conflicts: # internal/compiler/checker_test.go # internal/compiler/program.go

DanielRosenwasser · 2025-01-07T22:50:09Z

Will there be a way to run everything else in parallel, but checking in a single-threaded manner? Or to change the split-up count?

# Conflicts: # internal/compiler/checker_test.go # internal/compiler/program.go # internal/compiler/utilities.go

jakebailey · 2025-01-08T23:12:00Z

internal/compiler/utilities.go

+	symbolMergeMutex.Lock()
 	if symbol.MergeId == 0 {


I was initially confused about this, until I realized that the race is on symbol.MergeId itself (and not necessarily the global), since symbols can be shared between checkers.

jakebailey · 2025-01-08T23:15:16Z

Will there be a way to run everything else in parallel, but checking in a single-threaded manner? Or to change the split-up count?

Not in this PR there isn't; it's possible we could add some sort of flag to do that, since "concurrent checking" is the most dangerous thing comparatively.

ahejlsberg added 6 commits December 25, 2024 10:14

Deterministic ordering of types

Verified

This commit was signed with the committer’s verified signature.

pietroalbini Pietro Albini

GPG key ID: 3E06ABE80BAAF19C

Verified
Learn about vigilant mode

897de05

Remove unstable comparison

9520c1c

Fix potential index out of range

e5a7707

Include origin in union ordering

f5c1201

Favor ordering types by name

8a8f833

Concurrent type checking

6a837c3

ahejlsberg requested review from jakebailey, DanielRosenwasser and RyanCavanaugh January 1, 2025 23:30

Fix formatting

13aac5a

jakebailey reviewed Jan 2, 2025

View reviewed changes

Merge branch 'main' into concurrent-checking

6b29739

# Conflicts: # internal/compiler/checker_test.go # internal/compiler/program.go

ahejlsberg added 2 commits January 8, 2025 14:33

Merge branch 'main' into concurrent-checking

8f10fc8

# Conflicts: # internal/compiler/checker_test.go # internal/compiler/program.go # internal/compiler/utilities.go

Add comments

7f6d682

jakebailey reviewed Jan 8, 2025

View reviewed changes

jakebailey approved these changes Jan 8, 2025

View reviewed changes

ahejlsberg merged commit a172481 into main Jan 8, 2025
14 checks passed

jakebailey deleted the concurrent-checking branch January 13, 2025 20:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Concurrent type checking #202

Concurrent type checking #202

ahejlsberg commented Jan 1, 2025 •

edited

Loading

jakebailey Jan 2, 2025

ahejlsberg Jan 4, 2025

jakebailey Jan 6, 2025

DanielRosenwasser commented Jan 7, 2025 •

edited

Loading

jakebailey Jan 8, 2025

jakebailey commented Jan 8, 2025

Concurrent type checking #202

Concurrent type checking #202

Conversation

ahejlsberg commented Jan 1, 2025 • edited Loading

jakebailey Jan 2, 2025

Choose a reason for hiding this comment

ahejlsberg Jan 4, 2025

Choose a reason for hiding this comment

jakebailey Jan 6, 2025

Choose a reason for hiding this comment

DanielRosenwasser commented Jan 7, 2025 • edited Loading

jakebailey Jan 8, 2025

Choose a reason for hiding this comment

jakebailey commented Jan 8, 2025

ahejlsberg commented Jan 1, 2025 •

edited

Loading

DanielRosenwasser commented Jan 7, 2025 •

edited

Loading