Restructure sparse backends and replace subtyping by traits #40

gdalle · 2024-04-12T06:30:19Z

Motivation

This is a breaking PR (bumping version to v0.3.0) that restructures the handling of sparse backends, and cleans up the package in the process.

TLDR: Most (concrete) structs have not changed too much, but

their abstract subtypes are replaced by a mode trait

their sparse counterparts are replaced by the AutoSparse struct

Breaking API changes

Bump Julia compat lower bound to 1.10

Mode

Remove the abstract supertypes for the different modes (e.g. AbstractForwardMode and AbstractSparseForwardMode), and replace them with the mode trait. They were not documented but some packages used them anyway, like DifferentiationInterface.jl.
Make mode(ad::AbstractADType) return one of the following singletons: ForwardMode(), ReverseMode(), SymbolicMode() or the ambiguous ForwardOrReverseMode() (for cases like AutoEnzyme and AutoChainRules)
Assign the finite differences packages to ForwardMode

Concrete structs

Remove AutoModelingToolkit
Document the use of the mode field for AutoEnzyme: either nothing or a subtype of EnzymeCore.Mode (see What is the right way to specify the Enzyme mode? #24)
Remove the fdm = nothing default for AutoFiniteDifferences, since it always has to be a finite difference method
Add a tag and the associated type parameter to AutoPolyesterForwardDiff (see Don't use nothing Tag JuliaDiff/PolyesterForwardDiff.jl#18)
Remove the AutoSparseForwardDiff type and others like it, and turn them into deprecated constructors:

@deprecate AutoSparseForwardDiff(; kwargs...) AutoSparse(AutoForwardDiff(; kwargs...))

Non-breaking API changes

Export AbstractADType, which is useful for many downstream users
Document keyword argument constructors (part of the API now)
Add AutoSymbolics to replace AutoModelingToolkit
Add AutoTapir for https://github.com/withbayes/Tapir.jl

Sparsity

Introduce an AutoSparse struct that wraps a dense_ad backend with a sparsity_detector and a coloring_algorithm (as discussed in Change handling of sparse backends #38)
Define getter functions for this new struct
Define abstract types and default (trivial) choices for
- the sparsity detector (AbstractSparsityDetector, NoSparsityDetector)
- the coloring algorithm (AbstractColoringAlgorithm, NoColoringAlgorithm)
Define utilities for getting Jacobian/Hessian sparsity patterns based on an AbstractSparsityDetector:
- jacobian_sparsity
- hessian_sparsity
Define utilities for getting column/row colorings based on an AbstractColoringAlgorithm:
- column_coloring
- row_coloring

Other changes

Dependencies

Add ChainRulesCore.jl and EnzymeCore.jl as weak dependencies to define more specific mode dispatches
- based on ChainRulesCore.RuleConfig for ChainRulesCore.jl
- based on EnzymeCore.Mode for EnzymeCore.jl

Package quality

Split source code into several files
Add code coverage to CI (see Add Codecov action #41)
Add more tests
Include Aqua and JET to tests for code quality
Add and improve docstrings
Structure API reference in the docs with sections
Clarify README

Checklist

Appropriate tests were added
Any code changes were done in a way that does not break public API
All documentation related to code changes were updated
The new code follows the
contributor guidelines, in particular the SciML Style Guide and
COLPRAC.
Any new documentation only uses public API

Questions

Should we document type parameters of the concrete structs? I feel like this limits us in the range of non-breaking changes we can do: for instance, turning AutoPolyesterForwardDiff{chunksize} into AutoPolyesterForwardDiff{chunksize,T} is breaking because someone might have used the former as a constructor. That is why I explicitly document keyword constructors and fields.
Is the weak dependency handling okay on Julia < 1.9? The alternative would be to avoid defining the mode trait altogether, which might be preferable
Should we pull in Diffractor.jl and Zygote.jl in the test suite to check on actual instances of the RuleConfig paradigm? I define very simple RuleConfigs in runtests.jl but it's less reliable than taking the real ones

…thms

src/abstract.jl

src/sparse.jl

ChrisRackauckas · 2024-04-13T09:54:28Z

Something not mentioned in the top there is that this will require that ArrayInterface.jl take on an ADTypes.jl dependency, which is fine given that this is lightweight and all, but it should be noted that its ColoringAlgorithm needs a @deprecate ColoringAlgorithm = ADTypes.AbstractColoringAlgorithm which is then used in downstream implementation of its functions. fast_matrix_colors and matrix_colors live there since they are part of the extended sparse matrix interface, so that array types can define their coloring pattern overloads (things like BandedMatrices) without taking a full dependency on the AD libraries, but then specialize whenever such structured matrices are hit with the coloring algorithm (ex. BandedMatrices has a nice analytical solution based on the band size).

ChrisRackauckas · 2024-04-13T09:54:45Z

The symbols instead of abstract types thing is weird. That needs some explanation.

gdalle · 2024-04-13T10:00:00Z

Thanks for the review! Indeed you identified two parts where I was unsure:

The symbols instead of abstract types thing is weird. That needs some explanation.

See my comment #40 (comment)

will require that ArrayInterface.jl take on an ADTypes.jl dependency

Either that or the coloring names are dropped from ArrayInterface altogether? See my comment #40 (comment)

gdalle · 2024-04-13T10:02:31Z

its ColoringAlgorithm needs a @deprecate ColoringAlgorithm = ADTypes.AbstractColoringAlgorithm

I can rename AbstactColoringAlgorithm to ColoringAlgorithm to minimize the refactoring work

fast_matrix_colors and matrix_colors live there since they are part of the extended sparse matrix interface,

Not sure how to deal with those though. I can also rename column_coloring and row_coloring but these are different algorithms, and I don't see a similar toggle in matrix_colors?

ChrisRackauckas · 2024-04-13T10:04:43Z

I can rename AbstactColoringAlgorithm to ColoringAlgorithm to minimize the refactoring work

It's at least contained to two repos, so it's not too big of a deal if it's deprecated properly.

Not sure how to deal with those though. I can also rename column_coloring and row_coloring but these are different algorithms, and I don't see a similar toggle in matrix_colors?

Yeah that's mostly by adjoint. But the bigger thing would be updating all of the downstream analytical solutions to have these functions as well.

gdalle · 2024-04-13T10:08:20Z

But the bigger thing would be updating all of the downstream analytical solutions to have these functions as well.

If we keep the name matrix_colors and import ADTypes into ArrayInterface, there is nothing to change at all. But that would mean having only one coloring function, and if I want the row coloring I take the column coloring of the transpose?

ChrisRackauckas · 2024-04-13T10:11:20Z

But that would mean having only one coloring function, and if I want the row coloring I take the column coloring of the transpose?

That's what we currently do, but if you're going to setup general sparse AD then it's not a good idea since you won't easily know if someone passes a coloring pattern if it should be A or A'. So I like the idea of two functions, but the work of updating all of the analytical solutions in the extensions to the two functions needs to be done as well.

gdalle · 2024-04-13T10:16:21Z

I like the idea of two functions, but the work of updating all of the analytical solutions in the extensions to the two functions needs to be done as well.

I agree. Maybe ArrayInterface can keep defining matrix_colors (based on ADTypes.column_coloring), and that way as a transition we can just copy-paste the following lines in the downstream packages:

ADTypes.column_coloring(M) = ArrayInterface.matrix_colors(M)
ADTypes.row_coloring(M) = ArrayInterface.matrix_colors(M')

without changing the names in the actual function definitions for the time being

gdalle · 2024-04-13T10:16:53Z

Thanks for the comments! I'll update the PR with a trait mechanism and ping you again for validation if that's okay?

ChrisRackauckas · 2024-04-13T10:34:29Z

I agree. Maybe ArrayInterface can keep defining matrix_colors (based on ADTypes.column_coloring), and that way as a transition we can just copy-paste the following lines in the downstream packages:

Let's do the clean break. Such a clean break is the kind of thing that needs someone motivated in order to say yes. I don't see you stopping the DI push, so let's do it right. v0.3.0 here to signal the break, update the downstream in ArrayInterface and SparseDiffTools, and get sparse DI to do things the best way possible, and then integrate it into everything like NonlinearSolve.jl and DifferentialEquations.jl. You've got gusto and we've been sitting here waiting for something like DI to finally fix some of these remaining issues, so let's just pull the trigger and do it.

codecov · 2024-04-13T17:09:14Z

Welcome to Codecov 🎉

Once you merge this PR into your default branch, you're all set! Codecov will compare coverage reports and display results in all future pull requests.

Thanks for integrating Codecov - We've got you covered ☂️

gdalle · 2024-04-13T18:28:19Z

Not ready yet but close

gdalle · 2024-04-14T05:43:28Z

@ChrisRackauckas I have implemented the trait mechanism and used the opportunity for other small breaking changes in the concrete structs. All of it is summarized in the first comment, which I have edited: #40 (comment)
What do you think?

gdalle · 2024-04-15T05:27:16Z

For testing, while it's a hassle to add Diffractor and Zygote to the dependencies, I did perform the following check of my mode method definitions in the ChainRulesCore extension, so I'm reasonably confident:

julia> import Diffractor, Zygote

julia> using ADTypes  # on the PR branch

julia> ADTypes.mode(AutoChainRules(; ruleconfig=Zygote.ZygoteRuleConfig()))
ADTypes.ReverseMode()

julia> ADTypes.mode(AutoChainRules(; ruleconfig=Diffractor.DiffractorRuleConfig()))
ADTypes.ForwardOrReverseMode()

julia> @which ADTypes.mode(AutoChainRules(; ruleconfig=Zygote.ZygoteRuleConfig()))
mode(::AutoChainRules{RC}) where RC<:(ChainRulesCore.RuleConfig{>:ChainRulesCore.HasReverseMode})
     @ ADTypesChainRulesCoreExt ~/.julia/packages/ADTypes/bOklB/ext/ADTypesChainRulesCoreExt.jl:16

julia> @which ADTypes.mode(AutoChainRules(; ruleconfig=Diffractor.DiffractorRuleConfig()))
mode(::AutoChainRules{RC}) where RC<:(ChainRulesCore.RuleConfig{>:Union{ChainRulesCore.HasForwardsMode, ChainRulesCore.HasReverseMode}})
     @ ADTypesChainRulesCoreExt ~/.julia/packages/ADTypes/bOklB/ext/ADTypesChainRulesCoreExt.jl:22

Of course there is no ChainRules-compatible backend with a forward-only RuleConfig, so we'll never hit the third method

gdalle · 2024-04-15T05:30:32Z

I think we're there, the last few things to iron out are:

Should we really do the mode definition ourselves, or leave it to users?
If we define the mode, we need extensions for ChainRulesCore and EnzymeCore (fairly lightweight). Should we
- bump Julia compat to 1.9 to make sure they remain extensions everywhere
- accept that they are unconditional dependencies on <1.9
- use Requires instead but it's useless on >=1.9 (I don't like this one cause at the moment the package is deps-free on >=1.9 and we should keep it that way)
Should we bump ADTypes version to 1.0?
Should we leave JET in the test suite, even though it errors on nightly? It does a great job of catching typos, and I made it so that the badge would still be green if only nightly fails

ChrisRackauckas · 2024-04-15T11:11:54Z

If we define the mode, we need extensions for ChainRulesCore and EnzymeCore (fairly lightweight). Should we

We should define mode and at least bump to v1.9. I'd say just go to v1.10 knowing it's the next LTS and skip all of the other pain, there's other stuff to worry about in life.

Should we bump ADTypes version to 1.0?

Yes

Should we leave JET in the test suite, even though it errors on nightly? It does a great job of catching typos, and I made it so that the badge would still be green if only nightly fails

We just shouldn't run nightly, it's not for humans. We can set the new prerelease branch when it's ready, but since nightly isn't for packages (which is why the prerelease is being made) it's a waste of our time to be trying it.

Co-authored-by: Christopher Rackauckas <accounts@chrisrackauckas.com>

gdalle · 2024-04-15T11:21:18Z

I'd say just go to v1.10 knowing it's the next LTS and skip all of the other pain, there's other stuff to worry about in life.

Done.

We just shouldn't run nightly, it's not for humans.

Removed nightly from CI

gdalle · 2024-04-15T11:45:31Z

Once I get an approving review I'll merge, but I'd like to do two things before we register v1:

ask for feedback, either on Slack or Discourse
test it with DifferentiationInterface to make sure we didn't forget something essential

ChrisRackauckas · 2024-04-15T12:26:10Z

Yes no need to release right away, let's get downstream all set and make sure everyone is bought into this form.

Vaibhavdixit02

🎉🎉

Restructure sparse backends

02d6c58

gdalle requested review from Vaibhavdixit02 and ChrisRackauckas April 12, 2024 06:30

gdalle linked an issue Apr 12, 2024 that may be closed by this pull request

Change handling of sparse backends #38

Closed

gdalle mentioned this pull request Apr 12, 2024

Add Codecov action #41

Closed

TODO

1a77d59

gdalle removed the request for review from ChrisRackauckas April 12, 2024 15:37

Add utilities for working with sparsity detectors and coloring algori…

479e6e4

…thms

gdalle requested a review from ChrisRackauckas April 13, 2024 05:58

Deprecate old AutoSparse...

b1ae98a

ChrisRackauckas reviewed Apr 13, 2024

View reviewed changes

src/abstract.jl Outdated Show resolved Hide resolved

ChrisRackauckas reviewed Apr 13, 2024

View reviewed changes

src/sparse.jl Show resolved Hide resolved

ChrisRackauckas reviewed Apr 13, 2024

View reviewed changes

src/sparse.jl Outdated Show resolved Hide resolved

Remove subtyping, replace by traits

fe11e33

Finalize

35b57fa

gdalle changed the title ~~Restructure sparse backends~~ Restructure sparse backends and replace subtyping by traits Apr 14, 2024

gdalle requested a review from ChrisRackauckas April 14, 2024 05:44

gdalle added 6 commits April 15, 2024 07:05

Bump Julia compat to 1.9

866b625

Unbump

e331753

Avoid JET test on nightly

5970465

Cleaner version bounds for tests

34e73e0

Fix ambiguity

b745751

Fix bracket

5ad325e

gdalle requested review from Vaibhavdixit02 and ChrisRackauckas April 15, 2024 05:34

gdalle and others added 4 commits April 15, 2024 13:16

Update Project.toml

8623f64

Co-authored-by: Christopher Rackauckas <accounts@chrisrackauckas.com>

Jumpa compat 1.9, no test on nightly

7585c5b

Julia 1.10 compat

1149fe7

Remove JET version check

bc5edc4

This was linked to issues Apr 15, 2024

What is the right way to specify the Enzyme mode? #24

Closed

Add Codecov action #41

Closed

ChrisRackauckas approved these changes Apr 15, 2024

View reviewed changes

Vaibhavdixit02 approved these changes Apr 15, 2024

View reviewed changes

gdalle merged commit 47a7ec6 into main Apr 15, 2024
5 checks passed

Vaibhavdixit02 deleted the gd/sparse_overhaul branch April 15, 2024 13:15

This was referenced Apr 16, 2024

Issues & PRs in other repos JuliaDiff/DifferentiationInterface.jl#99

Closed

Add ADTypes 1.0 compat SciML/SciMLBase.jl#674

Merged

Support older Julia versions? #47

Closed

Support Julia 1.6 (again) #48

Merged

This was referenced May 1, 2024

Switch to DifferentiationInterface tpapp/LogDensityProblemsAD.jl#29

Closed

Support Julia 1.6 for Turing and others #52

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Restructure sparse backends and replace subtyping by traits #40

Restructure sparse backends and replace subtyping by traits #40

gdalle commented Apr 12, 2024 •

edited

Loading

ChrisRackauckas commented Apr 13, 2024

ChrisRackauckas commented Apr 13, 2024

gdalle commented Apr 13, 2024

gdalle commented Apr 13, 2024 •

edited

Loading

ChrisRackauckas commented Apr 13, 2024

gdalle commented Apr 13, 2024

ChrisRackauckas commented Apr 13, 2024

gdalle commented Apr 13, 2024 •

edited

Loading

gdalle commented Apr 13, 2024 •

edited

Loading

ChrisRackauckas commented Apr 13, 2024

codecov bot commented Apr 13, 2024

gdalle commented Apr 13, 2024

gdalle commented Apr 14, 2024

gdalle commented Apr 15, 2024 •

edited

Loading

gdalle commented Apr 15, 2024 •

edited

Loading

ChrisRackauckas commented Apr 15, 2024

gdalle commented Apr 15, 2024

gdalle commented Apr 15, 2024

ChrisRackauckas commented Apr 15, 2024

Vaibhavdixit02 left a comment

Restructure sparse backends and replace subtyping by traits #40

Restructure sparse backends and replace subtyping by traits #40

Conversation

gdalle commented Apr 12, 2024 • edited Loading

Motivation

Breaking API changes

Non-breaking API changes

Other changes

Checklist

Questions

ChrisRackauckas commented Apr 13, 2024

ChrisRackauckas commented Apr 13, 2024

gdalle commented Apr 13, 2024

gdalle commented Apr 13, 2024 • edited Loading

ChrisRackauckas commented Apr 13, 2024

gdalle commented Apr 13, 2024

ChrisRackauckas commented Apr 13, 2024

gdalle commented Apr 13, 2024 • edited Loading

gdalle commented Apr 13, 2024 • edited Loading

ChrisRackauckas commented Apr 13, 2024

codecov bot commented Apr 13, 2024

Welcome to Codecov 🎉

gdalle commented Apr 13, 2024

gdalle commented Apr 14, 2024

gdalle commented Apr 15, 2024 • edited Loading

gdalle commented Apr 15, 2024 • edited Loading

ChrisRackauckas commented Apr 15, 2024

gdalle commented Apr 15, 2024

gdalle commented Apr 15, 2024

ChrisRackauckas commented Apr 15, 2024

Vaibhavdixit02 left a comment

Choose a reason for hiding this comment

gdalle commented Apr 12, 2024 •

edited

Loading

gdalle commented Apr 13, 2024 •

edited

Loading

gdalle commented Apr 13, 2024 •

edited

Loading

gdalle commented Apr 13, 2024 •

edited

Loading

gdalle commented Apr 15, 2024 •

edited

Loading

gdalle commented Apr 15, 2024 •

edited

Loading