-
Notifications
You must be signed in to change notification settings - Fork 51
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
RFC: add API to enforce a specified dimensionality #494
Comments
Hi @arogozhnikov, thanks for your interesting proposal. I have a few questions:
Perhaps I missed something obvious...? |
Hi @leofang
Proposal does not introduce any new requirement for any backend. If backend is define-by-run (all shapes known), If backend is symbolic,
That's what I mention in the first line.
|
Thanks @arogozhnikov, interesting proposal. I have to admit I find it a little hard to wrap my head around - probably because I have worked with and on eager libraries a lot, and don't have concrete use cases in mind where I've needed something like this.
Let me try to unpack this a little.
If you have more real-world code example to point at, that would help me I think. On a more philosophical level - we don't have APIs that produce arrays with unknown dimensionality, so do we actually need to be able to deal with that? I may be missing something here, but it seems to me like we should also want/need APIs that produce such arrays in order to need a function to parse those shapes? |
There are two parts:
Thanks, this was a typo (changed name while writing proposal), just corrected that, now it is
That's right, it should be calculable like that, but for unknown shapes (symbols)
For cases I want to address here, you only need to compute Additionally, currently there is no way to enforce minimal or exact dimensionality. E.g. if first axis is batch, and the last axis is channel, but between them there may be >= 0 axes, there is no way to express this requirement. Expression
You mean - do you need to deal with arbitrary shapes? I am thinking from einops perspective (probably the first adoption would come from lib devs). Einops gets tensors from user, and users define tensors in whatever framework, and can pass tensors of any dimensionality, symbols too, so that's an important piece for me.
Some common cases:
Now, to be fair, there is a number of cases that this proposal won't address. Like, if you have ndimage, and want to make subsampling / upsampling over one specific axis, this is doable with tools currently in a standard. But if one wants to do this for every axis, unfortunately I see no simple universal tool for it right now. In other words, when axes of variable dimensionality should be manipulated, it's hard to put into symbolic tracing without specific functionality. A bit hard to do this without some kind of generic |
Agreed, that matters too.
For minimal dimensionality, I've never been too happy with if x.ndim < 2: # example for atleast_2d
x = reshape_to_2d_by_prepending_axes(x) then that does seem useful.
Dealing with arbitrary shapes - kind of, but perhaps not exactly. You must as a library author indeed anticipate getting array inputs which can have any dimensionality. I think the difference between "arbitrary shape" and "unknown shape" here is whether
@leofang is planning to propose a
Thanks for the examples @arogozhnikov. My intuition here is that your proposed function has two goals here:
(2) is probably a superset of (1). I have to think more about the implications here, and whether the API surface added for (2) also makes sense in eager libraries, or if that's then always a "do-nothing function call". It'd be great to get some input from maintainers of MXNet, Dask, and TensorFlow here - I think those are the main beneficiaries of (2). |
Hmm, didn't think about it, that's a good point. I can imagine adding one more constant like
with a restriction that all OptionalAxis should be on the left.
yes
That's correct. Additionally: known dimensionality with unknown components still precludes a number of operations. So the goal is to have a function that can address points above and deal with all scenarios (known shape, unknown components, unknown number of dimensions).
Completely agree. |
Previous discussion: #97
Resolution was to ignore the case of data-dependent (unknown) dimensionalities.
I have a suggestion how to implement it by adding a new function to the API with a specific interface.
This allows to support variable dimensionality in a large number of practical cases.
Additionally, it allows to partially control dimension sizes that are currently unknown.
Example 1:
enforces dimensionality to be 3, will fail for inputs of other dimensionalities.
If dimensionality is not yet known - will fail in runtime at this operation.
Elements
d1
,d2
andd3
are integers or symbols, not Nones.Returned
x
is either the same tensor, or tensor with attached information about shape.Main reason for returning a tensor is that operation could not be cut out of the graph if resulting shape is not used to obtain result.
Example 2:
expects input to have 3 dimensions, first axis matches to (maybe) symbol d1_input, last axis of length 4.
Example 3:
expects input to have 3 or more dimensions. First axis is of length 1, second of any length, last of length 3.
Note here, that for ellipsis a tuple with two objects is returned:
axes
corresponds to x.shape[2:-1], andn_elements
is a product of elements inaxes
.axes
is either a tuple ofUnion[int, Symbol]
(if dimensionality is known)or a Symbol for a tuple of unknown size (if dimensionality is unknown), same type as used to represent unknown shape.
n_elements
is either int (if size of all axes is known) or symbol foraxis
.Signature
I see this as a good intermediate point: sufficient to deal with multiple practical cases while requiring framework developers to maintain very little.
Downstream package dev perspective
There is a generic path to deal with these cases by first reducing to a fixed dimensionality, and converting back afterwards.
Example: last dimension should be rgb, should convert to hsl. Assuming we have a function that implements conversion for 2d array.
tf.assert(...)
and bind them in the graph, which can't be done in a cross-framework mannerFrameworks perspective
While the function is novel, it does not introduce any new entities: symbols for axes and symbols for shapes (like TensorShape) should exist anyway.
Function behavior overlaps with and extends
tf.set_shape
by supporting ellipsis.For frameworks without data-dependent shapes, this is straightforward to implement based on already exposed shape property.
The text was updated successfully, but these errors were encountered: