Static broadcast #149

ferrine · 2022-12-23T15:50:35Z

Motivation for these changes

Dynamic broadcasting creates tremendous graph obfuscations. While in forward pass it is not visible, the backward pass should always check if the broadcasting had happened or not. It may sound simple, but still creates 2^n if else statements. Originally, theano had static broadcasting and the same we get in this PR

Related Issues and PRs

Implementation details

Removed deprecations of broadcasting in TensorType
Changed tests to check cases against static broadcasting
Brought back shape inference for multioutput elemwise ops

Checklist

Explain motivation and implementation 👆
Make sure that the pre-commit linting/style checks pass.
Link relevant issues, preferably in nice commit messages.
The commits correspond to relevant logical changes. Note that if they don't, we will rewrite/rebase/squash the git history before merging.
Are the changes covered by tests and docstrings?
Fill out the short summary sections 👇

Major / Breaking Changes

...

New features

...

Bugfixes

...

Documentation

...

Maintenance

...

tests/tensor/test_elemwise.py

codecov-commenter · 2022-12-28T13:41:03Z

Codecov Report

Merging #149 (7dc4035) into main (f4de2fd) will decrease coverage by 0.00%.
The diff coverage is 84.12%.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #149      +/-   ##
==========================================
- Coverage   79.98%   79.97%   -0.01%     
==========================================
  Files         169      169              
  Lines       44607    44619      +12     
  Branches     9426     9431       +5     
==========================================
+ Hits        35678    35685       +7     
- Misses       6738     6741       +3     
- Partials     2191     2193       +2

Impacted Files	Coverage Δ
pytensor/tensor/type.py	`92.98% <77.77%> (-1.22%)`	⬇️
pytensor/tensor/elemwise.py	`88.29% <88.88%> (+0.20%)`	⬆️

ricardoV94

Looks good. I think we should already revert the Python and C perform code to fail with dynamic broadcasting like it did before (with a better error message than it used to). That will highlight any functionality that may still be implicitly depending on dynamic broadcasting.

In addition we should add the gradient broadcasting tests that showed up in the original issue in the Aesara repo.

Related, but not necessarily in this PR we also have to disable dynamic broadcasting that's done by RandomVariables. We should at least open an issue to track that.

pytensor/tensor/elemwise.py

ferrine · 2022-12-29T15:19:09Z

This PR is a pandora box. There were a lot of small fixes to the new broadcasting. I still find some places where I'm unsure gradients will correctly propagate wit dynamic broadcasting.

https://github.com/aesara-devs/aesara/pulls?page=2&q=is%3Apr+broadcast+is%3Aclosed

One of the concerns so far is the way infer shape works in
https://github.com/search?q=repo%3Apymc-devs%2Fpytensor%20broadcast_shape&type=code

aseyboldt · 2022-12-29T17:01:53Z

pytensor/tensor/elemwise.py

+        out_broadcastable = tuple(all(bcast) for bcast in zip(*broadcast_patterns))
+    except ValueError as e:
+        raise ValueError(
+            "Incompatible Elemwise input broadcasting pattern: "


How about something like "Incompatible Elemwise input broadcasting: Broadcasting is only allowed if the shape of the broadcasted axis is statically known to be one. Use input.specify_shape to inform pytensor that a shape is 1."
I don't think we have to explain why we do it like this in the error message. We could also add a faq entry, and link to that.

aseyboldt · 2022-12-29T17:05:10Z

This PR is a pandora box. There were a lot of small fixes to the new broadcasting. I still find some places where I'm unsure gradients will correctly propagate wit dynamic broadcasting.

Yeah, I was afraid that might be the case. I think we should go ahead and merge the cases that we know about, instead of waiting a long time to try and find everything in the first PR.

It would be great if we had some way of testing this automatically, but I don't really know how that would work...

ricardoV94

We need to revise type.filter_variable, type.is_super type.in_same_class to consider broadcasting flags. Those are called during rewrites to make sure the replacement types are compatible with the original types and/or to apply some simple operations if that would make the types equivalent (e.g., add a specify_shape and now an "Unbroadcast" perhaps).

Also we should check if we can prevent dynamic broadcasting in the JAX dispatch of Elemwise. That doesn't need to be done in this PR but we should confirm it can indeed be done.

ferrine · 2023-01-15T17:16:27Z

We need to revise type.filter_variable, type.is_super type.in_same_class to consider broadcasting flags. Those are called during rewrites to make sure the replacement types are compatible with the original types and/or to apply some simple operations if that would make the types equivalent (e.g., add a specify_shape and now an "Unbroadcast" perhaps).

Also we should check if we can prevent dynamic broadcasting in the JAX dispatch of Elemwise. That doesn't need to be done in this PR but we should confirm it can indeed be done.

I've changed that. Seems like rewrites are important to check now

UPD: replaces indeed get broken

ferrine · 2023-01-15T19:40:31Z

Once I go through small fixes they arise more problems. Fixing the variable filtering I just discovered

sparse variables have incomplete support for broadcasting.
constant folding violates broadcastable property
gradients sometimes ignore broadcasting property

pytensor/tensor/blas.py

ricardoV94 · 2023-01-16T08:44:43Z

pytensor/tensor/elemwise_cgen.py

-            }
+            if (%(lv{j0})s_n{x0} != %(lv{j})s_n{x})
+            {{
+                PyErr_Format(PyExc_ValueError, "Input dimension mismatch implicit broadcasting is not supported. (input[%%i].shape[%%i] = %%lld, input[%%i].shape[%%i] = %%lld)",


Maybe worth it to make the error conditional on numpy broadcasting case. In that case we can say not supported, link to the FAQ and whatever. On the case where it's a mismatch without shape of 1 we should just have the vanilla error message. This is the error most users will be hitting as long as C is the default backend.

ferrine · 2023-01-16T18:31:27Z

The issue that pops up seems to be introduced when refactoring Alloc rewrites here

https://github.com/aesara-devs/aesara/pull/1102/files

ricardoV94 · 2023-01-17T11:48:36Z

The issue that pops up seems to be introduced when refactoring Alloc rewrites here

https://github.com/aesara-devs/aesara/pull/1102/files

Yes, it makes sense to revert those changes. The rewrite originally followed the broadcastable conventions as you can see from the original issue aesara-devs/aesara#1094. It no longer respects them because it was rewritten to support dynamic broadcasting

ferrine · 2023-01-17T13:45:10Z

yeah, the changes are in this specific commit
aesara-devs/aesara@f604e1f

This reverts commit a3dc0a7.

This reverts commit 471657a.

michaelosthege · 2023-04-15T08:36:58Z

@ricardoV94 @ferrine the conflict in elemwise.py should probably be resolved by accepting the incoming version.
For the one in elemwise_cgen.py it looks like a similar change was done on both branches. The original might have been @ricardoV94's edit and @ferrine copied the changes to this branch?
I don't understand this part enough to resolve it =/

ferrine · 2023-06-29T11:19:01Z

it will be fun to go back to this PR, a lot of rebase conflicts...

ricardoV94 · 2023-07-04T12:20:50Z

I think it might work better to take it piece by piece, maybe without attempting a direct git revert. I'll try to spin-off a PR to reintroduce it for Elemwise. We can leave the Blas Ops for later

PR #372

ricardoV94 · 2023-08-24T13:04:28Z

Closing as we did already some progress elsewhere

twiecki reviewed Dec 26, 2022

View reviewed changes

tests/tensor/test_elemwise.py Outdated Show resolved Hide resolved

ferrine force-pushed the static-broadcast branch 10 times, most recently from 8d9c142 to 7dc4035 Compare December 28, 2022 12:37

ferrine marked this pull request as ready for review December 28, 2022 13:25

ferrine requested a review from ricardoV94 December 28, 2022 13:25

ricardoV94 reviewed Dec 28, 2022

View reviewed changes

pytensor/tensor/elemwise.py Outdated Show resolved Hide resolved

ferrine force-pushed the static-broadcast branch from 7dc4035 to 22b436b Compare December 29, 2022 15:09

ferrine marked this pull request as draft December 29, 2022 15:18

aseyboldt reviewed Dec 29, 2022

View reviewed changes

ricardoV94 reviewed Jan 15, 2023

View reviewed changes

ferrine force-pushed the static-broadcast branch 2 times, most recently from 692542d to 885ff0c Compare January 15, 2023 17:08

ricardoV94 reviewed Jan 16, 2023

View reviewed changes

ferrine force-pushed the static-broadcast branch from 4971ac3 to 5facd79 Compare January 16, 2023 13:05

ferrine added 3 commits February 10, 2023 17:09

change broadcasting behaviour

4bbc58f

Revert "Broadcast input matrices in Gemm"

d4c3342

This reverts commit a3dc0a7.

Revert "Clean up some usage of the TensorType interface in Scan"

e5b5294

This reverts commit 471657a.

ferrine force-pushed the static-broadcast branch from 5facd79 to e5b5294 Compare February 10, 2023 14:09

ricardoV94 mentioned this pull request Mar 30, 2023

Implement vectorized adstock transformations pymc-labs/pymc-marketing#221

Merged

ricardoV94 assigned michaelosthege and unassigned michaelosthege Apr 14, 2023

aseyboldt mentioned this pull request Jun 16, 2023

Use static-only broadcasting rules to compute shape of broadcasting #345

Merged

6 tasks

ricardoV94 mentioned this pull request Jul 4, 2023

Forbid runtime broadcasting in Elemwise #372

Merged

ricardoV94 closed this Aug 24, 2023

ricardoV94 deleted the static-broadcast branch June 13, 2024 12:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Static broadcast #149

Static broadcast #149

ferrine commented Dec 23, 2022 •

edited

Loading

codecov-commenter commented Dec 28, 2022

ricardoV94 left a comment

ferrine commented Dec 29, 2022 •

edited

Loading

aseyboldt Dec 29, 2022

aseyboldt commented Dec 29, 2022

ricardoV94 left a comment •

edited

Loading

ferrine commented Jan 15, 2023 •

edited

Loading

ferrine commented Jan 15, 2023 •

edited

Loading

ricardoV94 Jan 16, 2023

ferrine commented Jan 16, 2023

ricardoV94 commented Jan 17, 2023 •

edited

Loading

ferrine commented Jan 17, 2023

michaelosthege commented Apr 15, 2023

ferrine commented Jun 29, 2023

ricardoV94 commented Jul 4, 2023 •

edited

Loading

ricardoV94 commented Aug 24, 2023

Static broadcast #149

Static broadcast #149

Conversation

ferrine commented Dec 23, 2022 • edited Loading

Motivation for these changes

Implementation details

Checklist

Major / Breaking Changes

New features

Bugfixes

Documentation

Maintenance

codecov-commenter commented Dec 28, 2022

Codecov Report

ricardoV94 left a comment

Choose a reason for hiding this comment

ferrine commented Dec 29, 2022 • edited Loading

aseyboldt Dec 29, 2022

Choose a reason for hiding this comment

aseyboldt commented Dec 29, 2022

ricardoV94 left a comment • edited Loading

Choose a reason for hiding this comment

ferrine commented Jan 15, 2023 • edited Loading

ferrine commented Jan 15, 2023 • edited Loading

ricardoV94 Jan 16, 2023

Choose a reason for hiding this comment

ferrine commented Jan 16, 2023

ricardoV94 commented Jan 17, 2023 • edited Loading

ferrine commented Jan 17, 2023

michaelosthege commented Apr 15, 2023

ferrine commented Jun 29, 2023

ricardoV94 commented Jul 4, 2023 • edited Loading

ricardoV94 commented Aug 24, 2023

ferrine commented Dec 23, 2022 •

edited

Loading

ferrine commented Dec 29, 2022 •

edited

Loading

ricardoV94 left a comment •

edited

Loading

ferrine commented Jan 15, 2023 •

edited

Loading

ferrine commented Jan 15, 2023 •

edited

Loading

ricardoV94 commented Jan 17, 2023 •

edited

Loading

ricardoV94 commented Jul 4, 2023 •

edited

Loading