Array constants are generated on the host during AD #7093

gnecula · 2021-06-24T12:41:09Z

I noticed that the function abstract_arrays.zeros_like_shaped_array has code like np.broadcast_to(np.array(0, aval.dtype), aval.shape) (using regular NumPy). These values will be computed on the host during tracing and then may be passed to the device. Is this the desired behavior?

For example, when we take gradients of constant functions:

x = np.ones((10, 10), dtype=np.float32)
jax.make_jaxpr(jax.grad(lambda x: 42.))(x)

we get

{ lambda a ; b.
  let 
  in (a,) }

where a 10x10 array of 0s is built during tracing and passed to the function as the array a.
Wouldn't it be better to compute the array of 0s directly on the device?

{ lambda  ; a.
   let  b = broadcast_in_dim[ broadcast_dimensions=(  )
                            shape=(10, 10) ] 0.0
  in (b,) }

Incidentally, this affects things like jax2tf where we may end up with large constants embedded in the resulting graph, when instead their computation can be embedded in the graph.

The text was updated successfully, but these errors were encountered:

mattjj · 2021-06-24T23:04:30Z

This was intended at one point, pre-lazy-sublanguage #1668 and pre-omnistaging #3370, since there is special logic for handling broadcasted numpy constants efficiently in XLA lowering. (Hence why it's written with np.broadcast_to, which uses stride tricks for the broadcasted representation, unlike np.zeros!) But that special handling is only present for XLA lowering, and not jax2tf lowering.

I think we don't need this logic anymore. There might be edge cases where it's more efficient in eager mode, but I can't think of one, and they're probably not that interesting.

I agree we should replace it with jnp operations, since that will benefit jax2tf! I can take that on.

gnecula · 2021-06-25T05:18:13Z

Thanks, I can also do it.

Fixes: jax-ml#7093

Fixes: jax-ml#7093 Also fixes type checking in jax2tf, because now we have to be careful about computations that have result float0 (the broadcast_in_dim used to compute the zeros).

gnecula · 2021-06-25T10:32:12Z

@mattjj I have started on this in #7102. The actual change to use lax instead of np is very small, but there are interactions with jax2tf (fixed) and with numpy 3.6.13 (not yet fixed). I can continue to work on this, just FYI, so that you don't start afresh.

Fixes: jax-ml#7093 Also fixes type checking in jax2tf, because now we have to be careful about computations that have result float0 (the broadcast_in_dim used to compute the zeros).

gnecula added the bug Something isn't working label Jun 24, 2021

gnecula assigned gnecula and mattjj and unassigned gnecula Jun 24, 2021

gnecula added enhancement New feature or request and removed bug Something isn't working labels Jun 24, 2021

gnecula changed the title ~~Array constants are generate on the host during AD for integer functions~~ Array constants are generated on the host during AD for integer functions Jun 24, 2021

gnecula changed the title ~~Array constants are generated on the host during AD for integer functions~~ Array constants are generated on the host during AD Jun 25, 2021

gnecula self-assigned this Jun 25, 2021

gnecula added a commit to gnecula/jax that referenced this issue Jun 25, 2021

Ensure zeros from AD are generated on device.

88c7216

Fixes: jax-ml#7093

gnecula mentioned this issue Jun 25, 2021

Ensure zeros from AD are generated on device. #7102

Merged

gnecula added a commit to gnecula/jax that referenced this issue Jun 25, 2021

Ensure zeros from AD are generated on device.

a6da098

Fixes: jax-ml#7093

copybara-service bot closed this as completed in 2749b63 Jul 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Array constants are generated on the host during AD #7093

Array constants are generated on the host during AD #7093

gnecula commented Jun 24, 2021

mattjj commented Jun 24, 2021 •

edited

Loading

gnecula commented Jun 25, 2021

gnecula commented Jun 25, 2021 •

edited

Loading

Array constants are generated on the host during AD #7093

Array constants are generated on the host during AD #7093

Comments

gnecula commented Jun 24, 2021

mattjj commented Jun 24, 2021 • edited Loading

gnecula commented Jun 25, 2021

gnecula commented Jun 25, 2021 • edited Loading

mattjj commented Jun 24, 2021 •

edited

Loading

gnecula commented Jun 25, 2021 •

edited

Loading