aliasing and sparse broadcast #21693

Sacha0 · 2017-05-03T20:34:38Z

Sparse broadcast does not yet check whether the input arguments alias the output arguments, causing e.g.

julia> versioninfo()
Julia Version 0.7.0-DEV.16
Commit 5f296f3 (2017-05-03 18:24 UTC)
Platform Info:
  OS: macOS (x86_64-apple-darwin15.6.0)
  CPU: Intel(R) Core(TM) i7-3520M CPU @ 2.90GHz
  WORD_SIZE: 64
  BLAS: libopenblas (USE64BITINT DYNAMIC_ARCH NO_AFFINITY Sandybridge)
  LAPACK: libopenblas64_
  LIBM: libopenlibm
  LLVM: libLLVM-3.9.1 (ORCJIT, ivybridge)

julia> A = sprandn(10, 10, 0.1)
10×10 SparseMatrixCSC{Float64,Int64} with 12 stored entries:
  [3 ,  1]  =  -0.270259
  [1 ,  3]  =  0.555567
  [3 ,  3]  =  -0.360022
  [9 ,  3]  =  0.567621
  [10,  5]  =  0.638384
  [4 ,  6]  =  1.75078
  [5 ,  6]  =  0.138524
  [1 ,  8]  =  1.1261
  [4 ,  9]  =  1.3175
  [3 , 10]  =  1.72912
  [7 , 10]  =  1.27993
  [8 , 10]  =  -0.422809

julia> b = randn(10);

julia> broadcast!(/, A, A, b)
10×10 SparseMatrixCSC{Float64,Int64} with 100 stored entries:
  [1 ,  1]  =  NaN
  [2 ,  1]  =  NaN
  [3 ,  1]  =  NaN
  [4 ,  1]  =  NaN
  [5 ,  1]  =  NaN
  [6 ,  1]  =  NaN
  [7 ,  1]  =  NaN
  [8 ,  1]  =  NaN
  [9 ,  1]  =  NaN
  [10,  1]  =  NaN
  [1 ,  2]  =  NaN
  [2 ,  2]  =  NaN
  [3 ,  2]  =  NaN
  [4 ,  2]  =  NaN
  [5 ,  2]  =  NaN
  [6 ,  2]  =  NaN
  [7 ,  2]  =  NaN
  [8 ,  2]  =  NaN
  ⋮
  [2 ,  9]  =  NaN
  [3 ,  9]  =  NaN
  [4 ,  9]  =  NaN
  [5 ,  9]  =  NaN
  [6 ,  9]  =  NaN
  [7 ,  9]  =  NaN
  [8 ,  9]  =  NaN
  [9 ,  9]  =  NaN
  [10,  9]  =  NaN
  [1 , 10]  =  NaN
  [2 , 10]  =  NaN
  [3 , 10]  =  NaN
  [4 , 10]  =  NaN
  [5 , 10]  =  NaN
  [6 , 10]  =  NaN
  [7 , 10]  =  NaN
  [8 , 10]  =  NaN
  [9 , 10]  =  NaN
  [10, 10]  =  NaN

Best!

The text was updated successfully, but these errors were encountered:

stevengj · 2017-05-04T20:51:57Z

Maybe for now it should just throw an error? Seems like this would be difficult to make work unless the sparsity pattern is not changed by the broadcast.

Sacha0 · 2017-05-04T21:50:47Z

Seems like this would be difficult to make work unless the sparsity pattern is not changed by the broadcast.

Agreed, strict in place operation with input-output aliasing doesn't seem possible in general.

Maybe for now it should just throw an error?

My thinking as well. Making copies of the input arguments that alias the output argument would be the alternative / a potential future improvement. Best!

jebej · 2017-06-19T13:20:06Z

This should really throw an error instead of silently giving the wrong result.

Due to a bug in Base Julia: JuliaLang/julia#21693 the in-place operation doesn't work.

nalimilan · 2018-01-18T08:26:48Z

It would be nice to do something to prevent this bug from happening in 1.0. It gives a very bad impression of the language's safety. Making a copy of the input when it's === to the output would at least protect from corruption.

ChrisRackauckas · 2018-01-22T07:36:37Z

I think error is the better way to do safety here than copying. One normally uses .= to avoid copies, so it would lead to some hard to debug allocations when you assume that the operation won't copy. On the other hand, an error that says "you can't do aliasing here" is pretty clear and tells the programmer exactly what they need to change.

nalimilan · 2018-01-22T08:55:26Z

The problem is, the person who wrote the .= line (possibly deep in a call stack) might not have anticipated to get a sparse matrix, so if we raise an error the user who passed a sparse matrix wouldn't have any choice other than making it dense, which isn't generally possible (or very costly).

martinholters · 2018-01-22T16:46:29Z

Yes, an implementation that does a potentially unnecessary allocation seems way more useful than one that just errors. (But yea, the latter is still better than one that silently gives wrong results.)

ViralBShah · 2018-01-23T08:25:13Z

Does this have to block 1.0 given that sparse is now in stdlib?

nalimilan · 2018-01-23T08:57:34Z

Since it's a corruption bug, it can be fixed at any time, even if it was in Base, but it doesn't sound very serious to me to release the 1.0 version of a technical computing language with such a dangerous bug.

StefanKarpinski · 2018-01-23T15:01:06Z

I don't really see what the controversy here is. Why can't we check for aliasing and make a copy when we detect it? Surely making a copy is better than getting complete garbage? If possible, we can improve the performance in the future, but for now giving a correct answer seems like a no-brainer.

ViralBShah · 2018-01-23T16:42:53Z

Seems like there is no controversy. Just needs to get done.

StefanKarpinski · 2018-01-23T16:50:58Z

Ok, since this is a bug, it's not a feature freeze blocker, but we really should get it done for 1.0.

…ignment (#25890) This commit puts together the beginnings of an "interface" for detecting and avoiding aliasing between arrays before performing some mutating operation on one of them. `A′ = unalias(dest, A)` is the main entry point; it checks to see if `dest` and `A` might alias one another, and then either returns `A` or a copy of `A`. When it returns a copy of `A`, it absolutely must preserve the same type. As such, this function calls an internal `unaliascopy` function, which defaults to `copy` but ensures it returns the same type. `mightalias(A, B)` is a quick and conservative check to see if two arrays alias eachother. Instead of requiring every array to know about every other array type, it simply asks both arrays for their `dataids` and then looks for an empty intersection. Finally, `dataids(A)` returns a tuple of `UInt`s that describe something about the region of memory the array occupies. It defaults to simply `(objectid(A),)`. A custom implementation for an array with multiple fields of arrays would look something like `(dataids(A.a)..., dataids(A.b)..., ...)`. Fixes #21693, fixes #15209, and a pre-req to get tests passing for #24368 (since we had spot checks for `Array` aliasing, but moving to broadcast means we need a slightly more generic solution). These functions remain un-exported for now since I'd like us to use them a bit more before we officially support them for all custom arrays.

Sacha0 · 2018-02-27T19:22:53Z

Much thanks Matt! 🎉 :)

ChrisRackauckas · 2018-02-27T19:29:11Z

So does a .= b now perform a copy if the two arrays alias? I'm confused by what the effects of this are on slightly different cases than the OP.

mbauman · 2018-02-27T19:33:42Z

Yes, that's correct, with the slight caveat that it will not create a copy in the case where a === b, which is relevant (and important) for a .+= 2. In that case — and that case alone — we know that the iteration order within broadcast will not cause any spooky action at a distance.

If, however, b is a view of a (or any parts of a), then b could re-arrange the indices, changing the access pattern, and causing corruption.

jebej · 2018-02-27T19:57:26Z

What is the difference now between using broadcast with sparse arrays vs not using it?
Using dots allows fusion with regular arrays, but that doesn't seem easily doable with sparse arrays given the sparsity.

mbauman · 2018-02-27T20:01:52Z

That's an awfully broad question. I'm not sure I fully understand it. But sparse arrays do have specialized implementations of broadcasting that tries very hard to exploit the sparsity, and they do fuse. In-place assignment is only a big win if you already have an output array with the expected output sparsity pattern.

jebej · 2018-02-27T20:08:26Z

My question was mostly about fusion.

So something like A .+ 3.*B .- sin.(C)./2 will fuse and use less memory than without the dot. If so that's pretty cool :)

In-place assignment is only a big win if you already have an output array with the expected output sparsity pattern

Right, and this seems hard to predict without knowing the input in advance.

Sacha0 · 2018-02-27T20:13:14Z

To avoid allocation, knowing the result's storage size suffices; the pattern isn't strictly necessary. Though I am not certain under what conditions you would know the result's storage size but not its pattern :).

Sacha0 added sparse Sparse arrays broadcast Applying a function over a collection labels May 3, 2017

Sacha0 mentioned this issue Jun 2, 2017

Difference between .+= and += for sparse matrices [bug] #22183

Closed

jebej mentioned this issue Jun 19, 2017

[release-0.6] final backports for 0.6.0 #22398

Merged

4 tasks

mkborregaard added a commit to mkborregaard/Microbiome.jl that referenced this issue Dec 7, 2017

remove relativeabundance! implementation

d042a52

Due to a bug in Base Julia: JuliaLang/julia#21693 the in-place operation doesn't work.

mkborregaard added a commit to mkborregaard/Microbiome.jl that referenced this issue Dec 7, 2017

remove relativeabundance! implementation

4eee2e1

Due to a bug in Base Julia: JuliaLang/julia#21693 the in-place operation doesn't work.

yuyichao mentioned this issue Jan 17, 2018

Lost elements when .+= is used on sparse matrices #25611

Closed

StefanKarpinski added the bug Indicates an unexpected problem or unintended behavior label Jan 18, 2018

StefanKarpinski added this to the 1.0 milestone Jan 18, 2018

JeffBezanson modified the milestones: 1.0, 1.0.x Jan 23, 2018

mbauman mentioned this issue Feb 5, 2018

More thorough aliasing detection in nonscalar copy, broadcast and assignment #25890

Merged

mbauman closed this as completed in #25890 Feb 27, 2018

pablosanjose mentioned this issue Jan 14, 2022

Bug in map! for sparse matrices with aliasing JuliaSparse/SparseArrays.jl#26

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

aliasing and sparse broadcast #21693

aliasing and sparse broadcast #21693

Sacha0 commented May 3, 2017

stevengj commented May 4, 2017

Sacha0 commented May 4, 2017 •

edited

Loading

jebej commented Jun 19, 2017

nalimilan commented Jan 18, 2018

ChrisRackauckas commented Jan 22, 2018

nalimilan commented Jan 22, 2018

martinholters commented Jan 22, 2018

ViralBShah commented Jan 23, 2018

nalimilan commented Jan 23, 2018

StefanKarpinski commented Jan 23, 2018

ViralBShah commented Jan 23, 2018

StefanKarpinski commented Jan 23, 2018

Sacha0 commented Feb 27, 2018

ChrisRackauckas commented Feb 27, 2018

mbauman commented Feb 27, 2018 •

edited

Loading

jebej commented Feb 27, 2018

mbauman commented Feb 27, 2018

jebej commented Feb 27, 2018

Sacha0 commented Feb 27, 2018

aliasing and sparse broadcast #21693

aliasing and sparse broadcast #21693

Comments

Sacha0 commented May 3, 2017

stevengj commented May 4, 2017

Sacha0 commented May 4, 2017 • edited Loading

jebej commented Jun 19, 2017

nalimilan commented Jan 18, 2018

ChrisRackauckas commented Jan 22, 2018

nalimilan commented Jan 22, 2018

martinholters commented Jan 22, 2018

ViralBShah commented Jan 23, 2018

nalimilan commented Jan 23, 2018

StefanKarpinski commented Jan 23, 2018

ViralBShah commented Jan 23, 2018

StefanKarpinski commented Jan 23, 2018

Sacha0 commented Feb 27, 2018

ChrisRackauckas commented Feb 27, 2018

mbauman commented Feb 27, 2018 • edited Loading

jebej commented Feb 27, 2018

mbauman commented Feb 27, 2018

jebej commented Feb 27, 2018

Sacha0 commented Feb 27, 2018

Sacha0 commented May 4, 2017 •

edited

Loading

mbauman commented Feb 27, 2018 •

edited

Loading