Eagerly invert plan on formation of AdjointPlan: correct eltype and remove output_size #113

gaurav-arya · 2023-07-29T23:23:12Z

This PR resolves #110. After some thinking I realized that the right way to do this should be to actually call inv(p) in the formation of the adjoint plan. This makes adjoint(p) work the same way as inv(p), in that caching occurs in the plan formation rather than plan application.

As a consequence, I realized that output_size should not actually be job of an adjoint style. TPlan implementers are actually already required to manually figure out output size for their plan when implementing size(plan_inv(p)), and hence it didn't actually make sense to have the adjoint style trait provide this. If we want to make to make the output size available for free without forming the inverse plan, that's a separate line of work (that could also simplify the lives of the original plan implementation) But it shouldn't actually be the job of adjoint traits -- the goal of adjoint traits is to make adjoints easy given that all inverse functionality is implemented, and indeed inverses already give the output size for free.

Note that this PR is technically breaking for implementers of new adjoint styles (by removing 1 required function). but does not affect users of existing adjoint styles.

Edit: this PR also includes the fusion optimization suggested in #113 (comment).

remove output_size

…tUtils

codecov · 2023-07-29T23:30:16Z

Codecov Report

Patch coverage: 96.15% and project coverage change: -0.17% ⚠️

Comparison is base (fae1170) 93.87% compared to head (d194cbe) 93.70%.

Additional details and impacted files

@@            Coverage Diff             @@
##           master     #113      +/-   ##
==========================================
- Coverage   93.87%   93.70%   -0.17%     
==========================================
  Files           5        5              
  Lines         441      445       +4     
==========================================
+ Hits          414      417       +3     
- Misses         27       28       +1

Files Changed	Coverage Δ
src/definitions.jl	`83.75% <95.65%> (-0.33%)`	⬇️
ext/AbstractFFTsTestExt.jl	`100.00% <100.00%> (ø)`

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

stevengj · 2023-08-22T20:17:43Z

In the common case where inv(p) returns a ScaledPlan, it would be better to pull out the scale factor and combine it with the scaling loop in adjoint_mul, or eliminate the scale factor entirely if it cancels (as it will for complex FFTs).

For example:

function adjoint_mul(p::Plan{T}, x::AbstractArray, ::FFTAdjointStyle) where {T}
    dims = fftdims(p)
    N = normalization(T, size(p), dims)
    pinv = inv(p)
    if pinv isa ScaledPlan
        return pinv.scale == N ? pinv.p * x : rmul!(pinv.p * x, pinv.scale / N)
    else
        return rmul!(pinv * x, inv(N))
    end
end

gaurav-arya · 2023-08-22T20:53:05Z

Implemented in f9b5de3. In the FFT case, a neat way to do the case where the scaling does not cancel was to combine the extra scaling into the scaled plan, which will then do the same rmul!:

AbstractFFTs.jl/src/definitions.jl

Line 268 in f9b5de3

*(p::ScaledPlan, x::AbstractArray) = LinearAlgebra.rmul!(p.p * x, p.scale)

. In the RFFT/IRFFT case, it seems we always need at least one pass through the array for the map, so we take the scaling out of the scaled plan and into the map.

src/definitions.jl

Co-authored-by: Steven G. Johnson <stevenj@mit.edu>

gaurav-arya · 2023-09-04T02:21:27Z

Bump on merge

devmotion · 2023-09-05T11:49:08Z

src/definitions.jl

+    if pinv isa ScaledPlan && pinv.scale == N
+        return pinv.p * x
+    else
+        return (1/N * pinv) * x


@gaurav-arya This line introduced a Float64 promotion that was not (necessarily) present in the previous implementation.

I don't quite follow, could you explain or give an example?

1 / N will be a Float64 which will promote everything. In the old code this did not happen.

Apologies that I still don't follow: why does where we divide by N affect the promotion behavior?

And why is N a Float64 in the first place, rather than a quantity of type T?

julia> using FFTW julia> P = plan_fft(rand(Float32, 3, 3)); julia> P' * rand(Float32, 3, 3) 3×3 Matrix{ComplexF32}: 3.48416+0.0im -0.264397-0.859624im -0.264397+0.859624im -0.0523138-0.781731im -0.462729-0.257876im -0.902054+0.397495im -0.0523138+0.781731im -0.902054-0.397495im -0.462729+0.257876im

Could you provide an example of the issue?

Bump -- I still don't follow your initial comment, since as Int64 and Float32 promote to Float32. An example would be much appreciated.

Bump. If this a problem that exists on the main branch, could you make an issue for it, with a reproducer?

Sorry, somehow this conversation was lost in my stream of Github notifications.

And why is N a Float64 in the first place, rather than a quantity of type T?

I assumed it would be an Int (and hence 1/N would be a Float64) but you're right of course that it is of type T. Maybe one could even use T = eltype(pinv) here since the computations are performed with pinv but I think it should not matter. Possibly inv(N) would be a tiny bit safer since then no Ints would be involved but I can't think of any relevant case where it should matter that the inverse is computed in a way that involves an Int.

devmotion · 2023-09-05T11:49:58Z

src/definitions.jl

+    pinv = inv(p)
+    n = size(pinv, halfdim)
+    # Optimization: when pinv is a ScaledPlan, fuse the scaling into our map to ensure we do not loop over x twice.
+    scale = pinv isa ScaledPlan ? pinv.scale / 2N : 1 / 2N


Same here, this introduces Float64 promotions.

julia> using FFTW julia> P = plan_rfft(rand(Float32, 3, 3)); julia> P' * (P * rand(Float32, 3, 3)) 3×3 Matrix{Float32}: 1.35893 3.17063 3.99985 1.15958 6.03116 3.67565 3.10326 5.80633 3.89977

gaurav-arya added 2 commits July 29, 2023 19:12

Eagerly compute inv(p) in adjoint plans, and use it to fix eltype and

e2780de

remove output_size

Remove output_size tests, replace with size test for all plans in Tes…

43cd52a

…tUtils

gaurav-arya changed the title ~~Eagerly invert plan on formation of AdjointPlan, use to correct eltype and remove output_size~~ Eagerly invert plan on formation of AdjointPlan: correct eltype and remove output_size Jul 29, 2023

Merge branch 'master' into ag-eltype

c7e2516

Ensure <= 1 pass over array in adjoint application

f9b5de3

gaurav-arya added 2 commits August 22, 2023 17:13

Try to fix type stability issues

18631da

Fix another type instability

ee4b225

stevengj reviewed Aug 23, 2023

View reviewed changes

src/definitions.jl Outdated Show resolved Hide resolved

stevengj reviewed Aug 23, 2023

View reviewed changes

src/definitions.jl Outdated Show resolved Hide resolved

gaurav-arya and others added 3 commits August 22, 2023 20:51

Replace division with multiplication in adjoint loop

fe0f7f4

Switch isapprox to equality

a115e97

Co-authored-by: Steven G. Johnson <stevenj@mit.edu>

Lift out multiply-by-2

d194cbe

stevengj approved these changes Aug 23, 2023

View reviewed changes

stevengj merged commit be4aa9b into JuliaMath:master Sep 4, 2023

devmotion reviewed Sep 5, 2023

View reviewed changes

devmotion mentioned this pull request Sep 5, 2023

Remove arrays of scaling factors #116

Merged

gaurav-arya mentioned this pull request Oct 21, 2023

Increasing robustness of adjoint plan optimizations #123

Merged

Eagerly invert plan on formation of AdjointPlan: correct eltype and remove output_size #113

Eagerly invert plan on formation of AdjointPlan: correct eltype and remove output_size #113

Uh oh!

Conversation

gaurav-arya commented Jul 29, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Jul 29, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

stevengj commented Aug 22, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gaurav-arya commented Aug 22, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gaurav-arya commented Sep 4, 2023

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

gaurav-arya commented Jul 29, 2023 •

edited

Loading

codecov bot commented Jul 29, 2023 •

edited

Loading

stevengj commented Aug 22, 2023 •

edited

Loading

gaurav-arya commented Aug 22, 2023 •

edited

Loading