Error in A_ldiv_B! with sparse matrix #7488

CodeLenz · 2014-07-01T18:50:30Z

Hi.

I just found a bug in A_ldiv_B! when used with a sparse matrix
factored with lufact!

The error is trigered with

A = spzeros(3,3)
A[1,1] = 1
A[2,2] = 1
A[3,3] = 1
b = ones(3)

lufact!(A)

A\b  --> trigs the error.

ERROR: BoundsError()
in istriu at sparse/sparsematrix.jl:1748
in unsafe_copy! at array.jl:41
in A_ldiv_B! at linalg/sparse.jl:209

julia> versioninfo()
Julia Version 0.3.0-prerelease+3952
Commit b2fd3af (2014-06-30 01:39 UTC)
Platform Info:
System: Windows (x86_64-w64-mingw32)
CPU: Intel(R) Core(TM) i5-3320M CPU @ 2.60GHz
WORD_SIZE: 64
BLAS: libopenblas (USE64BITINT DYNAMIC_ARCH NO_AFFINITY)
LAPACK: libopenblas
LIBM: libopenlibm

It was working with the previous versions.

Sincerely yours,
Eduardo.

The text was updated successfully, but these errors were encountered:

ViralBShah · 2014-07-01T19:01:58Z

@JeffBezanson It was strange that the above commit fixed this bug. I wonder if this is a compiler issue.

tkelman · 2014-07-01T19:28:35Z

And this wasn't caught by any of the existing tests in test/sparse.jl, at least not on my machine (and I can reproduce the error at the same commit). Wouldn't hurt to throw in another, right?

andreasnoack · 2014-07-01T19:33:53Z

@ViralBShah I don't think that the commit changes anything for this example. With latest master I still get the BoundError.

@CodeLenz I am not sure it is a bug. When calling lufact! you destroy the input matrix which is signalled by the exclamation mark. The row and column pointers are transformed to zero base and therefore not valid in Julia anymore.

What you might want to do is either

A = speye(3)
b = ones(3)
Af = lufact!(A)
Af\b

or

A = speye(3)
b = ones(3)
lufact(A) # No exclamation mark
A\b

CodeLenz · 2014-07-01T19:51:15Z

Hi. It worked with the previous versions. As I understand, lufact!(A) stores the LU factors of
A in place and A\b makes use of it to solve the system. But I am new to Julia, so if its my mistake, I am realy sorry for the noise.

jiahao · 2014-07-01T19:53:36Z

lufact!(A) stores the LU factors of A in place and A\b makes use of it to solve the system.

If you mean to factorize A and use the factorization to solve a linear problem, you'll need to save the factorization object and call backslash on that.

julia> B=copy(A); Z=lufact!(B)
UMFPACK LU Factorization of a 3-by-3 sparse matrix
Ptr{Void} @0x00007f85ab78d8c0

julia> typeof(B) #Julia still thinks it is a sparse matrix
SparseMatrixCSC{Float64,Int64} (constructor with 2 methods)

julia> Z\b
3-element Array{Float64,1}:
 1.0
 1.0
 1.0

julia> @which Z\b #Uses the factorization object
\{TF<:Number,TB<:Number,N}(F::Factorization{TF<:Number},B::AbstractArray{TB<:Number,N}) at linalg/factorization.jl:798

julia> @which A\b #Doesn't use the factorization object
@\{TA,TB,N}(A::AbstractArray{TA,2},B::AbstractArray{TB,N}) at linalg/generic.jl:227

A\b will solve the system without using the factorization. But since lufact!(A) destroys A, there is no guarantee that A represents a valid Julia object, and so calling A\b can fail.

CodeLenz · 2014-07-01T19:57:39Z

@jiahao I see ...thanks. I will look more carefully before posting another bug report. But I have no idea why it used to work before and the ! mark lead me to this conclusion....so....my bad.

Thank you all for this amazing tool. I just found it and I am porting all my tools (FInite element and optimization) to it.

jiahao · 2014-07-01T19:58:26Z

But I have no idea why it used to work before

It could be that you were just lucky before, since the specific memory access patterns can vary a lot.

JeffBezanson · 2014-07-01T20:02:42Z

The documentation could use some extra clarification here. In the vast majority of cases, f!(A) means that the result is written into A. Instead, this is a strange kind of function that returns a result but destroys its input in the process.

jiahao · 2014-07-01T20:04:11Z

Is there a way to make it an error to access A after calling lufact!(A)? I honestly have no idea what UMFPACK does to A.

ViralBShah · 2014-07-01T20:17:24Z

I do think that we should not allow lufact!(A) where the trashed input is then unusable. @dmbates What do you think? This seems like it will cause more users to trip up, expecting dense matrix like behaviour. I don't think that documentation is sufficient for this pitfall.

andreasnoack · 2014-07-01T20:20:26Z

All of the xfact! functions work something like this. For the dense methods, the input matrix stores information about the factorisation, but it is only useful if used through the factorisation object, not the input matrix.

ViralBShah · 2014-07-01T20:21:14Z

Yes, I guess that even though the solution is returned in the input, it is not really in usable form.

mlubin · 2014-07-01T21:01:53Z

How about a test for valid indices when calling A\b?

dmbates · 2014-07-01T21:18:29Z

A simple test for valid indices is whether the first element of the colptr member is 0 or 1. To be a valid Julia matrix it should be 1. To be a valid SuiteSparse matrix it should be 0.

It turns out that the only difference in behavior between lufact and lufact! with regard to the sparse matrix is whether the function decrements the indices of the original matrix in place. I think I would agree with @ViralBShah that the best thing to do is to remove the lufact! method for sparse matrices. The sparse case is different from the dense case in that you can't overwrite a sparse matrix with its factorization in most cases. Thus the lufact! method doesn't make sense unless you count saving the allocations of the index vectors. I initially thought that would be helpful if the sparse matrix was the result of other functions and was not going to be named but, as we have seen, it just leads to confusion.

tkelman · 2014-07-01T21:23:28Z

The right (medium-level, with reusable factorizations - A\b works fine now as the high level way) way to eventually do sparse is separate symbolic and numerical factorizations. symfact only looks at sparsity structure and returns a preallocated object with enough predicted space needed for numfact!. Something like this isn't exposed yet in the umfpack interfaces, is it?

ViralBShah · 2014-07-02T04:07:45Z

I am pretty sure such a thing will be there in the umfpack interface.

tkelman · 2014-07-02T04:36:10Z

Whoops, yep, there they are in umfpack.jl. Maybe an area for future refactoring and generalization to expose those, I think it could be doable to present a unified medium-level API that all of umfpack, cholmod, pardiso, mumps, hsl, etc could work through. I think pardiso and mumps even let you specify whether your indices are 0-based or 1-based, so the input formatting question is different.

It could be almost-as-dangerous (maybe more since it doesn't error?) and confusing to accidentally misuse a dense matrix that has had lufact! called on it. For dense it's not as obvious that the array now contains factorization data since there isn't the same validity check we get with 0-vs-1 indexing in sparse. lufact! for sparse does save a little bit of copying if you don't plan on using the input matrix again while making calls to umfpack, plus it's an easily reversible operation to turn it back into a valid 1-based Julia sparse matrix (assuming you're aware of exactly what's going on behind the scenes).

As @dmbates said, "in place" is not so much a thing with sparse. You do often want multiple factorizations in a row with the same sparsity pattern but changed nzval.

ViralBShah · 2014-07-02T04:40:30Z

Yeah, I think this is the right thing to do. It is certainly doable in 0.3.

Jutho · 2014-07-02T05:49:13Z

All the xfact! methods potentially destroy the input. It is good to have them when you can afford to loose the original matrix, sparse matrix, but like a lot of the ! functions, even those that do not destroy any input (transpose!, permutedims!, A_mul_B! family), it might be possible not to export them, so as not to confuse new users.

mlubin · 2014-07-02T05:57:18Z

I think cheap error checking is an easy way to avoid confusion. Removing the ! methods also breaks generic code that can work with both dense and sparse matrices.

jiahao · 2014-07-02T21:22:59Z

Another possibility is to simply nudge the pointer values back to 1-based indexing after calling UMFPACK functions.

dmbates · 2014-07-02T21:24:09Z

@jiahao Seems reasonable.

ViralBShah · 2015-02-14T11:07:26Z

We have now removed ! methods for sparse factorizations.

ViralBShah · 2015-02-14T11:08:43Z

@andreasnoack Should we disallow cholfact! too? It gives a DomainError currently:

julia> cholfact!(sparse(rand(5,5)))


ERROR: DomainError:

 in chol! at linalg/cholesky.jl:133
 in chol! at linalg/cholesky.jl:71
 in cholfact! at linalg/cholesky.jl:98

ViralBShah · 2015-02-14T11:09:26Z

I see that it has a fallback to the generic implementation. Perhaps we need an explicit method to disallow it for sparse?

andreasnoack · 2015-02-14T11:28:03Z

I think it is safe to restrict that method to StridedMatrix. That should fix the issue, so I'll do that.

andreasnoack · 2015-02-14T11:35:16Z

Fixed in 9c72dec

tkelman · 2015-02-14T17:34:40Z

Removing the ! methods also breaks generic code that can work with both dense and sparse matrices.

This is still a potential concern. But I don't think our current cholfact! API is really generic enough to cover sparse effectively in its current form. What would probably be better for dense-sparse generality is deciding on (then implementing, testing, and documenting it properly) a good medium-level API for reusable sparse factorizations, something like symfact and numfact!. Dense can be expressed as the special-case performance optimization here, within a framework that's generic enough to do sparse correctly. symfact for dense just happens to be a no-op (or identity, returning the same array space).

andreasnoack · 2015-02-14T19:25:46Z

I agree. The generic cholfact! computes the factorization correctly, but is not useful for sparse matrices. It could be interesting to experiment with sparse factorizations in Julia, but we should probably do the experiments outside base.

ViralBShah · 2015-02-15T06:16:07Z

In case you did not already know, see: https://github.com/JuliaSparse/MultiFrontalCholesky.jl

tkelman · 2015-02-15T06:32:16Z

That's great to see a start for a pure-Julia sparse solver implementation. So far that's working at a pretty low level though, the API refinement will need to happen somewhat separately from and in parallel with the pure-Julia solver implementations. Ideally we want a design somewhat like MathProgBase where different low-level solvers (some in Julia, some in C or C++ or Fortran) can plug into a uniform set of interfaces, to present the same API to consumers (whether users directly working with sparse linear algebra, or higher-level libraries like optimization solvers).

This probably should happen outside of base to start with. Though to truly resolve JuliaLang/LinearAlgebra.jl#136 and make sparse matrices first-class citizens, the way base Julia deals with dense matrices will eventually have to adapt in order to generically handle either dense or sparse matrices for all operations.

ViralBShah added bug labels Jul 1, 2014

ViralBShah added this to the 0.3 milestone Jul 1, 2014

JeffBezanson added the regression label Jul 1, 2014

ViralBShah closed this as completed in 6e0d943 Jul 1, 2014

JeffBezanson removed regression labels Jul 1, 2014

ViralBShah reopened this Jul 1, 2014

ViralBShah closed this as completed Feb 14, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Error in A_ldiv_B! with sparse matrix #7488

Error in A_ldiv_B! with sparse matrix #7488

CodeLenz commented Jul 1, 2014

ViralBShah commented Jul 1, 2014

tkelman commented Jul 1, 2014

andreasnoack commented Jul 1, 2014

CodeLenz commented Jul 1, 2014

jiahao commented Jul 1, 2014

CodeLenz commented Jul 1, 2014

jiahao commented Jul 1, 2014

JeffBezanson commented Jul 1, 2014

jiahao commented Jul 1, 2014

ViralBShah commented Jul 1, 2014

andreasnoack commented Jul 1, 2014

ViralBShah commented Jul 1, 2014

mlubin commented Jul 1, 2014

dmbates commented Jul 1, 2014

tkelman commented Jul 1, 2014

ViralBShah commented Jul 2, 2014

tkelman commented Jul 2, 2014

ViralBShah commented Jul 2, 2014

Jutho commented Jul 2, 2014

mlubin commented Jul 2, 2014

jiahao commented Jul 2, 2014

dmbates commented Jul 2, 2014

ViralBShah commented Feb 14, 2015

ViralBShah commented Feb 14, 2015

ViralBShah commented Feb 14, 2015

andreasnoack commented Feb 14, 2015

andreasnoack commented Feb 14, 2015

tkelman commented Feb 14, 2015

andreasnoack commented Feb 14, 2015

ViralBShah commented Feb 15, 2015

tkelman commented Feb 15, 2015

Error in A_ldiv_B! with sparse matrix #7488

Error in A_ldiv_B! with sparse matrix #7488

Comments

CodeLenz commented Jul 1, 2014

ViralBShah commented Jul 1, 2014

tkelman commented Jul 1, 2014

andreasnoack commented Jul 1, 2014

CodeLenz commented Jul 1, 2014

jiahao commented Jul 1, 2014

CodeLenz commented Jul 1, 2014

jiahao commented Jul 1, 2014

JeffBezanson commented Jul 1, 2014

jiahao commented Jul 1, 2014

ViralBShah commented Jul 1, 2014

andreasnoack commented Jul 1, 2014

ViralBShah commented Jul 1, 2014

mlubin commented Jul 1, 2014

dmbates commented Jul 1, 2014

tkelman commented Jul 1, 2014

ViralBShah commented Jul 2, 2014

tkelman commented Jul 2, 2014

ViralBShah commented Jul 2, 2014

Jutho commented Jul 2, 2014

mlubin commented Jul 2, 2014

jiahao commented Jul 2, 2014

dmbates commented Jul 2, 2014

ViralBShah commented Feb 14, 2015

ViralBShah commented Feb 14, 2015

ViralBShah commented Feb 14, 2015

andreasnoack commented Feb 14, 2015

andreasnoack commented Feb 14, 2015

tkelman commented Feb 14, 2015

andreasnoack commented Feb 14, 2015

ViralBShah commented Feb 15, 2015

tkelman commented Feb 15, 2015