Matmul/generic #17

DavidMertz · 2017-05-03T21:39:22Z

Assuming we can merge the minimal https://github.com/mrocklin/sparse/pull/16 that simply implements the @ operator for known types, this is proposal 1 for behavior with lists/tuples.

The gist of this approach is that if a user want to coerce lists to arrays, they must either explicitly create COOs with that behavior as COO(..., toarray_other=True) or call sparse.dot(..., make_array=True) (or my_coo.dot(..., make_array=True)).

Test for both success and failure are included for these opt-in behaviors.

…al code, not NumPy's

mrocklin · 2017-05-03T21:42:36Z

Just as an FYI I am currently -1 on these keyword arguments. My reasons include the following:

I think that tracking them throughout all operations will be add development cost
They're not part of the standard numpy API (which has somehow solved this problem with other means)
No active user has shown up with this need

Generally speaking if we're going to add something new to the array computing API I think that the conversation needs to start with the numpy community. I don't think that this project is the right place to expand the array computing user API. Perhaps raise an issue there?

DavidMertz · 2017-05-03T22:00:51Z

Note that I have just submitted an alternative approach in https://github.com/mrocklin/sparse/pull/18. These two PRs/branches are mutually exclusive. The opt-in keywords were my idea to answer @mrocklin's concern about coercion happening implicitly.

DavidMertz · 2017-05-03T22:06:53Z

Generally speaking if we're going to add something new to the array computing API I think that the conversation needs to start with the numpy community. I don't think that this project is the right place to expand the array computing user API. Perhaps raise an issue there?

There's not really the same issue for the NumPy community. NumPy happily coerces non-array collections to arrays implicitly and without asking (so does scipy.sparse, FWIW). The only reason we might not want that implicit behavior is because Dask sparse arrays might be too big to do this without excessive memory cost.

jakevdp · 2017-05-03T22:35:55Z

so does scipy.sparse, FWIW

As I've pointed out, scipy.sparse does coercion much more carefully. I think @mrocklin's complaint was not with coercion per se, but with the way you initially went about that coercion.

There are good solutions to this in numpy and scipy, and I think this version of the PR can be closed.

DavidMertz · 2017-05-04T00:27:55Z

As I've pointed out, scipy.sparse does coercion much more carefully.

This really is not true. scipy.sparse does extra work to check for compatible shapes of things that are already arrays. But the entirety of the actual coercion code is the following:

# If it's a list or whatever, treat it like a matrix
other_a = np.asanyarray(other)

Currently sparse doesn't perform as much shape massaging as scipy.sparse, but the type coercion is exactly the same. In fact, since the type coercion comes after the shape massaging, if a user of scipy.sparse passes in a list-of-lists it won't benefit from the shape massaging, even if it could be made compatible by doing so.

But that's completely orthogonal to the type coercion step in this PR.

jakevdp · 2017-05-04T04:19:35Z

Yes, it uses np.asanyarray. But first it special-cases scalars and matrices, and afterward it double-checks that it hasn't just created a zero-dimensional object array. That's what I mean when I say it's more careful.

jakevdp · 2017-05-04T04:27:07Z

In any case, the main issue with the approach in this PR is not the specifics of coercion, but the additional keyword arguments. I would not be in favor of adding that level of additional API complexity.

DavidMertz · 2017-05-04T05:05:14Z

@jakevdp Are you OK with #18 instead. I.e. "always coerce if the object is specifically a list or tuple". In that approach, the test in scipy will never be fulfilled, e.g.:

 if other_a.ndim == 0 and other_a.dtype == np.object_:
     # Not interpretable as an array; return NotImplemented so that
     # other's __rmul__ can kick in if that's implemented.
     return NotImplemented

That's not to say we couldn't fail in other ways with #18. But not in ways where scipy.sparse succeeds. E.g.:

In [1]: import numpy as np
In [2]: from sparse import COO
In [3]: l = [1, 2, 3, 4, 5]
In [4]: a = np.array(l)
In [5]: sa = COO(a)
In [6]: l @ sa
Out[6]: array(55, dtype=int64)
In [7]: ['a','b','c','d','e'] @ sa
[...]
TypeError: no supported conversion for types: (dtype('int64'), dtype('<U'))
In [8]: [6,7,8,9] @ sa
[...]
ValueError: shape-mismatch for sum

jakevdp · 2017-05-04T18:58:56Z

Yes, I'm mostly fine with #18 – it's a good start and we can expand from there to work for more general inputs.

mrocklin · 2017-05-10T23:55:48Z

OK to close this?

DavidMertz · 2017-05-11T00:03:02Z

Yes, but see my note in #18.

David Mertz added 7 commits May 2, 2017 08:49

Add @ operator

924118c

test for __rmatmul__

3a7b720

Better implementation of __rmatmul__ and test that exercises the actu…

4e3e343

…al code, not NumPy's

Not final (probably), but implement both directions of COO @ list

45770f6

Tweak to pass Travis CI (flake8 is really picky)

9656393

Improve logic for coercion to (dense) array

d385699

PEP8 small fix

e4191ca

flake8 disrespects PEP 8 :-(

fe8eef2

DavidMertz mentioned this pull request May 4, 2017

Matmul/generic2 #18

Closed

hameerabbasi closed this Dec 27, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Matmul/generic #17

Matmul/generic #17

Uh oh!

DavidMertz commented May 3, 2017 •

edited

Loading

Uh oh!

mrocklin commented May 3, 2017

Uh oh!

DavidMertz commented May 3, 2017

Uh oh!

DavidMertz commented May 3, 2017

Uh oh!

jakevdp commented May 3, 2017 •

edited

Loading

Uh oh!

DavidMertz commented May 4, 2017

Uh oh!

jakevdp commented May 4, 2017

Uh oh!

jakevdp commented May 4, 2017

Uh oh!

DavidMertz commented May 4, 2017 •

edited

Loading

Uh oh!

jakevdp commented May 4, 2017

Uh oh!

mrocklin commented May 10, 2017

Uh oh!

DavidMertz commented May 11, 2017

Uh oh!

Uh oh!

Uh oh!

Matmul/generic #17

Matmul/generic #17

Uh oh!

Conversation

DavidMertz commented May 3, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mrocklin commented May 3, 2017

Uh oh!

DavidMertz commented May 3, 2017

Uh oh!

DavidMertz commented May 3, 2017

Uh oh!

jakevdp commented May 3, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

DavidMertz commented May 4, 2017

Uh oh!

jakevdp commented May 4, 2017

Uh oh!

jakevdp commented May 4, 2017

Uh oh!

DavidMertz commented May 4, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jakevdp commented May 4, 2017

Uh oh!

mrocklin commented May 10, 2017

Uh oh!

DavidMertz commented May 11, 2017

Uh oh!

Uh oh!

DavidMertz commented May 3, 2017 •

edited

Loading

jakevdp commented May 3, 2017 •

edited

Loading

DavidMertz commented May 4, 2017 •

edited

Loading