fix #31758: out of bounds write in sparse broadcast #31763

fredrikekre · 2019-04-18T17:24:38Z

No description provided.

mbauman · 2019-04-18T18:07:26Z

Since it took a bit to figure out, here's Fredrik's and my understanding of what this code means to do: it's trying to get a good maximum bound for the number of remaining stored entries in the array. We're in the zero-preserving case, so anywhere that all the arguments have zeros, the function evaluation will return zero. Now computing the exact union across all the arguments is an expensive task. This (over) estimates it by simply looking at the total number of nonzeros in all arguments; that's _sumnnzs(As...) . We can further refine that estimate because we're partway through the algorithm — we know how many total stored values we've seen in the arguments and how many stored values they ended up creating. So we add in the total number of values we've stored so far (Ck) and subtract out the total number of stored values in all the arguments that led to that result (sum(ks) - N — the - N accounts for the 1-based indices across the N arrays). Of course this could be more than the total possible number of elements, so we ensure it's no larger than length(C) with a min.

Put that all together and you can see that the entire estimate algorithm should all go inside the same argument of that min call.

mbauman · 2019-04-18T18:10:37Z

Given that this often over-allocates, I wonder how much more efficient (or not) a simple push! based algorithm would be. We also wondered how much @inbounds actually affects performance in such a complicated algorithm. But those can be investigations for another day — this patch is straightforward and lets not derail it by looking into other possible algorithms.

(cherry picked from commit c0c6f96)

Sacha0 · 2019-04-20T18:22:49Z

Good catch! :)

We also wondered how much @inbounds actually affects performance in such a complicated algorithm.

Depends on which @inbounds you are thinking of; some of them impact performance substantially. I have a now-stale perf test suite for this code somewhere that I've long wanted to package up for nanosoldier in that mythical thing that is free time... 😄

KristofferC · 2019-04-20T19:17:03Z

I'm surprised that the inbounds annotations have a large effect since there are almost always conditionals and non propagate_inbounds function calls in the loop body. Do you recall any such example?

Sacha0 · 2019-04-20T19:37:54Z

I recall perf measurements benefitting from certain inbounds annotations, but what with the many intervening months cannot recall precisely which measurements and annotations 🤷‍♀️. IIRC I started without the annotations, but found that I needed at least some subset of them to close the gap between the then-new generic implementations and the preexisting specialized implementations. Best!

(cherry picked from commit c0c6f96)

fix #31758: out of bounds write in sparse broadcast

d2d5dc3

fredrikekre added sparse Sparse arrays bugfix This change fixes an existing bug broadcast Applying a function over a collection labels Apr 18, 2019

fredrikekre requested review from mbauman and Sacha0 April 18, 2019 17:24

fredrikekre added backport 1.0 labels Apr 18, 2019

mbauman approved these changes Apr 18, 2019

View reviewed changes

fredrikekre merged commit c0c6f96 into master Apr 19, 2019

fredrikekre deleted the fe/bc-sparse branch April 19, 2019 07:36

This was referenced Apr 19, 2019

Backports for 1.0.4 #30954

Merged

Backports for 1.2-RC1 #31727

Closed

KristofferC pushed a commit that referenced this pull request Apr 20, 2019

fix #31758: out of bounds write in sparse broadcast (#31763)

b0187e8

(cherry picked from commit c0c6f96)

KristofferC pushed a commit that referenced this pull request Apr 20, 2019

fix #31758: out of bounds write in sparse broadcast (#31763)

031c35c

(cherry picked from commit c0c6f96)

KristofferC pushed a commit that referenced this pull request Apr 20, 2019

fix #31758: out of bounds write in sparse broadcast (#31763)

11b8f59

(cherry picked from commit c0c6f96)

JeffBezanson removed the backport 1.0 label Jun 6, 2019

KristofferC removed the backport 1.2 label Jun 9, 2019

KristofferC pushed a commit that referenced this pull request Feb 20, 2020

fix #31758: out of bounds write in sparse broadcast (#31763)

4f4384a

(cherry picked from commit c0c6f96)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix #31758: out of bounds write in sparse broadcast #31763

fix #31758: out of bounds write in sparse broadcast #31763

fredrikekre commented Apr 18, 2019

mbauman commented Apr 18, 2019

mbauman commented Apr 18, 2019

Sacha0 commented Apr 20, 2019

KristofferC commented Apr 20, 2019

Sacha0 commented Apr 20, 2019 •

edited

Loading

fix #31758: out of bounds write in sparse broadcast #31763

fix #31758: out of bounds write in sparse broadcast #31763

Conversation

fredrikekre commented Apr 18, 2019

mbauman commented Apr 18, 2019

mbauman commented Apr 18, 2019

Sacha0 commented Apr 20, 2019

KristofferC commented Apr 20, 2019

Sacha0 commented Apr 20, 2019 • edited Loading

Sacha0 commented Apr 20, 2019 •

edited

Loading