inference: remove `throw` block deoptimization completely #49260

aviatesk · 2023-04-05T13:05:41Z

After experimenting with #49235, I started to question if we are getting any actual benefit from the throw block deoptimization anymore.

This commit removes the deoptimization from the system entirely.

Based on the numbers below, it appears that the deoptimization is not very profitable in our current Julia-level compilation pipeline, with the effects analysis playing a significant role in reducing latency.

Here are the updated benchmark:

Metric	master	#49235	this commit
Base (seconds)	15.579300	15.206645	15.42059
Stdlibs (seconds)	17.919013	17.667094	17.404586
Total (seconds)	33.499279	32.874737	32.826162
Precompilation (seconds)	53.488528	53.152028	53.152028
First time `plot(rand(10,3))` ¹	`3.432983 seconds (16.55 M allocations)`	`3.477767 seconds (16.45 M allocations)`	`3.539117 seconds (16.43 M allocations)`
First time `solve(prob, QNDF())(5.0)` ²	`4.628278 seconds (15.74 M allocations)`	`4.609222 seconds (15.32 M allocations)`	`4.547323 seconds (15.19 M allocations: 823.510 MiB)`

With disabling precompilation of Plots.jl. ↩
With disabling precompilation of OrdinaryDiffEq. ↩

aviatesk · 2023-04-05T13:06:04Z

@nanosoldier runbenchmarks("inference", vs=":master")

nanosoldier · 2023-04-05T14:00:59Z

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here.

vtjnash · 2023-04-06T00:35:31Z

I would like to remove it, but this seems surprisingly strongly negative on our benchmarks. Can we do something to reduce that, or is that just some sort of artifact of being a small test case?

aviatesk · 2023-04-06T07:46:53Z

Yeah, there seems to be profitability for small cases. I will add more examples to BaseBenchmarks.jl (by using existing packages) and see what it will say.

oscardssmith · 2023-04-20T18:48:14Z

I think we should merge this. Now that tab complete relies on proving termination and effectfree-ness, this PR also makes tab complete better.

aviatesk · 2023-09-20T09:13:37Z

@nanosoldier runbenchmarks("inference", vs=":master")

nanosoldier · 2023-09-20T09:15:52Z

Your job failed.

aviatesk · 2023-09-20T10:29:03Z

@nanosoldier runbenchmarks("inference", vs=":master")

nanosoldier · 2023-09-20T11:22:35Z

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here.

aviatesk · 2023-12-07T08:43:00Z

@nanosoldier runbenchmarks("inference", vs=":master")

nanosoldier · 2023-12-07T09:40:30Z

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here.

aviatesk · 2024-07-05T07:15:38Z

Although benchmarks suggest that this is indeed an effective optimization, this optimization has been quite incomplete from the beginning and more importantly started to cause numerous issues in other projects. So I will prioritize the development efficiency of those projects and remove this optimization nevertheless. We could try to come up with an alternative solution to improve latency for those code paths, but it doesn't seem to be a good idea to leave this situation as is.
I'm going to merge this once confirming CI passes successfully.

After experimenting with #49235, I started to question if we are getting any actual benefit from the `throw` block deoptimization anymore. This commit removes the deoptimization from the system entirely. Based on the numbers below, it appears that the deoptimization is not very profitable in our current Julia-level compilation pipeline, with the effects analysis playing a significant role in reducing latency. Here are the updated benchmark: | Metric | master | #49235 | this commit | |-------------------------|-----------|-------------|--------------------------------------------| | Base (seconds) | 15.579300 | 15.206645 | 15.42059 | | Stdlibs (seconds) | 17.919013 | 17.667094 | 17.404586 | | Total (seconds) | 33.499279 | 32.874737 | 32.826162 | | Precompilation (seconds) | 53.488528 | 53.152028 | 53.152028 | | First time `plot(rand(10,3))` [^1] | `3.432983 seconds (16.55 M allocations)` | `3.477767 seconds (16.45 M allocations)` | `3.539117 seconds (16.43 M allocations)` | | First time `solve(prob, QNDF())(5.0)` [^2] | `4.628278 seconds (15.74 M allocations)` | `4.609222 seconds (15.32 M allocations)` | `4.547323 seconds (15.19 M allocations: 823.510 MiB)` | [^1]: With disabling precompilation of Plots.jl. [^2]: With disabling precompilation of OrdinaryDiffEq.

topolarity · 2024-08-07T14:15:00Z

CI is green after #55356 - Good to merge?

aviatesk · 2024-08-07T18:45:33Z

@nanosoldier runbenchmarks("inference", vs=":master")

nanosoldier · 2024-08-07T20:19:02Z

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here.

oscardssmith · 2024-08-07T20:42:12Z

Looks like a few inference regressions, but nothing too surprising. IMO this is good to merge.

…5430)

Due to JuliaLang/julia#49260, the constructor is now being inlined, so we need to check for `Expr(:new)` and not just `:call`.

…49260) Co-authored-by: Cody Tapscott <topolarity@tapscott.me> Co-authored-by: Oscar Smith <oscardssmith@gmail.com>

…tions (JuliaLang#55430)

nsajko · 2024-08-25T20:52:03Z

This causes a weird inference regression, see #55583.

After investigating the cause of the invalidation reported in #55583, it was found that the issue arises only when `r` is propagated as an extended lattice element, such as `PartialStruct(UnitRange{Int}, ...)`, for the method of `getindex(::String, r::UnitRange{Int})`. Specifically, the path at https://github.com/JuliaLang/julia/blob/cebfd7bc66153b82c56715cb1cb52dac7df8eac8/base/compiler/typeinfer.jl#L809-L815 is hit, so the direct cause was the recursion limit for constant inference. To explain in more detail, within the slow path of `nextind` which is called inside `getindex(::String, ::UnitRange{Int})`, the 1-argument `@assert` is used https://github.com/JuliaLang/julia/blob/cebfd7bc66153b82c56715cb1cb52dac7df8eac8/base/strings/string.jl#L211. The code related to `print` associated with this `@assert` further uses `getindex(::String, ::UnitRange{Int})`, causing the recursion limit. This recursion limit is only for constant inference, which is why we saw this regression only for the `PartialStruct` case. Moreover, since this recursion limit occurs within the `@assert`-related code, this issue did not arise until now (i.e. until #49260 was merged). As a solution, I considered improving the recursion limit itself, but decided that keeping the current code for the recursion limit of constant inference is safer. Ideally, this should be addressed on the compiler side, but there is certainly deep recursion in this case. As an easier solution, this commit resolves the issue by changing the 1-arg `@assert` to the 2-arg version. - replaces #55583

* Generalize symbol type for debug scopes * More scope adjustment * Adjust to JuliaLang/julia#49260 * More Core.Compiler adjustments

aviatesk changed the title ~~remove throw block deoptimization completely~~ inference: remove throw block deoptimization completely Apr 5, 2023

aviatesk force-pushed the avi/remove-throw-block-unopt branch from ffba552 to e7dee0e Compare April 5, 2023 13:29

aviatesk force-pushed the avi/throw-block-effects branch from bc32263 to ee45c04 Compare April 9, 2023 03:17

oscardssmith mentioned this pull request Apr 20, 2023

improve effects of objectid and getindex(::Dict) #49447

Merged

aviatesk force-pushed the avi/remove-throw-block-unopt branch from e7dee0e to a24372e Compare September 20, 2023 09:10

aviatesk force-pushed the avi/throw-block-effects branch from ee45c04 to 81287f2 Compare September 20, 2023 09:12

aviatesk force-pushed the avi/remove-throw-block-unopt branch from a24372e to 07165d7 Compare September 20, 2023 09:13

aviatesk force-pushed the avi/remove-throw-block-unopt branch from 07165d7 to 3ae3529 Compare September 20, 2023 09:31

aviatesk force-pushed the avi/throw-block-effects branch 6 times, most recently from c8a5046 to d77836e Compare September 26, 2023 04:54

Base automatically changed from avi/throw-block-effects to master September 26, 2023 06:44

aviatesk force-pushed the avi/remove-throw-block-unopt branch from 3ae3529 to 7328bb3 Compare September 26, 2023 09:26

aviatesk force-pushed the avi/remove-throw-block-unopt branch from 7328bb3 to b08925c Compare December 7, 2023 08:42

aviatesk force-pushed the avi/remove-throw-block-unopt branch from b08925c to 808bae1 Compare December 7, 2023 08:44

aviatesk force-pushed the avi/remove-throw-block-unopt branch from 808bae1 to 906bf12 Compare January 18, 2024 07:24

aviatesk added 4 commits August 2, 2024 03:34

fix up test

876e825

more test update

696d91c

update BuildSettings

af93df0

topolarity force-pushed the avi/remove-throw-block-unopt branch from 2853f37 to af93df0 Compare August 2, 2024 19:42

Profile: close files when assembling heap snapshot

a6f1246

topolarity mentioned this pull request Aug 2, 2024

Profile: close files when assembling heap snapshot #55356

Merged

Merge branch 'master' into avi/remove-throw-block-unopt

73a6687

JeffBezanson approved these changes Aug 7, 2024

View reviewed changes

JeffBezanson mentioned this pull request Aug 7, 2024

add --trim option for generating smaller binaries #55047

Merged

Merge branch 'master' into avi/remove-throw-block-unopt

27340ea

JeffBezanson merged commit 30d5a34 into master Aug 8, 2024
7 checks passed

JeffBezanson deleted the avi/remove-throw-block-unopt branch August 8, 2024 18:41

aviatesk added a commit that referenced this pull request Aug 9, 2024

inference: follow up #49260, remove no longer necessary functions

288c980

aviatesk added a commit that referenced this pull request Aug 9, 2024

inference: follow up #49260, remove no longer necessary functions (#5…

c3d0d67

…5430)

Seelengrab added a commit to Seelengrab/RequiredInterfaces.jl that referenced this pull request Aug 15, 2024

Fix tests on 1.12

37336f2

Due to JuliaLang/julia#49260, the constructor is now being inlined, so we need to check for `Expr(:new)` and not just `:call`.

lazarusA pushed a commit to lazarusA/julia that referenced this pull request Aug 17, 2024

inference: remove throw block deoptimization completely (JuliaLang#…

cf951f7

…49260) Co-authored-by: Cody Tapscott <topolarity@tapscott.me> Co-authored-by: Oscar Smith <oscardssmith@gmail.com>

lazarusA pushed a commit to lazarusA/julia that referenced this pull request Aug 17, 2024

inference: follow up JuliaLang#49260, remove no longer necessary func…

7c4b40c

…tions (JuliaLang#55430)

nsajko mentioned this pull request Aug 25, 2024

fix a huge part of invalidation on defining a new Integer subtype #55583

Closed

aviatesk mentioned this pull request Aug 29, 2024

fix new type instability from getindex(::String, r::UnitRange{Int}) #55625

Merged

maleadt added a commit to JuliaGPU/GPUCompiler.jl that referenced this pull request Sep 17, 2024

Adapt to JuliaLang/julia#49260.

1a01e08

Keno added a commit to CedarEDA/DAECompiler.jl that referenced this pull request Sep 28, 2024

Generalize symbol type for debug scopes (#8)

54c02c1

* Generalize symbol type for debug scopes * More scope adjustment * Adjust to JuliaLang/julia#49260 * More Core.Compiler adjustments

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

inference: remove `throw` block deoptimization completely #49260

inference: remove `throw` block deoptimization completely #49260

aviatesk commented Apr 5, 2023 •

edited

Loading

aviatesk commented Apr 5, 2023

nanosoldier commented Apr 5, 2023

vtjnash commented Apr 6, 2023

aviatesk commented Apr 6, 2023

oscardssmith commented Apr 20, 2023

aviatesk commented Sep 20, 2023

nanosoldier commented Sep 20, 2023

aviatesk commented Sep 20, 2023

nanosoldier commented Sep 20, 2023

aviatesk commented Dec 7, 2023

nanosoldier commented Dec 7, 2023

aviatesk commented Jul 5, 2024

topolarity commented Aug 7, 2024

aviatesk commented Aug 7, 2024

nanosoldier commented Aug 7, 2024

oscardssmith commented Aug 7, 2024

nsajko commented Aug 25, 2024

inference: remove throw block deoptimization completely #49260

inference: remove throw block deoptimization completely #49260

Conversation

aviatesk commented Apr 5, 2023 • edited Loading

Footnotes

aviatesk commented Apr 5, 2023

nanosoldier commented Apr 5, 2023

vtjnash commented Apr 6, 2023

aviatesk commented Apr 6, 2023

oscardssmith commented Apr 20, 2023

aviatesk commented Sep 20, 2023

nanosoldier commented Sep 20, 2023

aviatesk commented Sep 20, 2023

nanosoldier commented Sep 20, 2023

aviatesk commented Dec 7, 2023

nanosoldier commented Dec 7, 2023

aviatesk commented Jul 5, 2024

topolarity commented Aug 7, 2024

aviatesk commented Aug 7, 2024

nanosoldier commented Aug 7, 2024

oscardssmith commented Aug 7, 2024

nsajko commented Aug 25, 2024

inference: remove `throw` block deoptimization completely #49260

inference: remove `throw` block deoptimization completely #49260

aviatesk commented Apr 5, 2023 •

edited

Loading