jit does not respect backend type when the function has no arguments #1431

romanngg · 2019-10-02T22:19:43Z

Example:

from jax.api import jit
import jax.random as random

def x():
  return random.normal(random.PRNGKey(1), (2, 3))

x().device_buffer.device()
jit(x)().device_buffer.device()
jit(x, backend='cpu')().device_buffer.device()

All these output GpuDevice(id=0), while the last one should be CPU.

The text was updated successfully, but these errors were encountered:

jekbradbury · 2019-10-04T06:38:12Z

Functions without arguments can't meaningfully be jitted, since jit relies on data dependence to trace primitives inside the function (it injects tracer values for each function argument, and follows them through the function). That is, the return value of x() is actually a constant that was computed op-by-op (on the default device) when jit attempted to trace x(), and not the result of a jitted XLA computation at all.

We don't yet have a way to specify the device used for op-by-op computations; it might make sense for such a mechanism to be dynamically scoped (i.e., a context manager) since we can't use data dependence for scoping the way we do for jit.

levskaya · 2019-10-05T00:16:42Z

If for some reason you really need to use this idiom, you can force data dependence on constants (sometimes handy to prevent them from being folded out) by using lax.tie_in:

from jax.api import jit
import jax.random as random
from jax import lax

dummy = 0

def x(_placeholder):
  return lax.tie_in(_placeholder, random.normal(random.PRNGKey(1), (2, 3))) 

print(x(dummy).device_buffer.device())  # --> gpu
print(jit(x)(dummy).device_buffer.device())  # --> gpu
print(jit(x, backend='cpu')(dummy).device_buffer.device())  # --> cpu

fixes #1431

@jit

Before this commit, this computation would avoid materializing the iota array at trace time: @jit def f(x): m, n = x.shape return x + np.arange(n) But this one would materialize the iota array at trace time and stage it into the computation as a potentially large array constant: @jit def f(x): m, n = x.shape return x + np.arange(m)[:, None] The difference is that previously operations like broadcasts, transposes, and reshapes that add singleton dimensions (as above) would force otherwise lazy values to be materialized, while after this commit broadcasts, transposes, and reshapes are all lazy operations that only update metadata on their input rather than compiling and executing XLA computations and producing new buffers. Also, np.eye and np.tri become lazy (in addition to np.zeros, np.ones, np.full). This commit replaces the ad-hoc "lazy device constant" system, which was used to get the simpler behavior in the first example above. Incidentally fixes #1431 See #1668 for more.

@jit

Before this commit, this computation would avoid materializing the iota array at trace time: @jit def f(x): m, n = x.shape return x + np.arange(n) But this one would materialize the iota array at trace time and stage it into the computation as a potentially large array constant: @jit def f(x): m, n = x.shape return x + np.arange(m)[:, None] The difference is that previously operations like broadcasts, transposes, and reshapes that add singleton dimensions (as above) would force otherwise lazy values to be materialized, while after this commit broadcasts, transposes, and reshapes are all lazy operations that only update metadata on their input rather than compiling and executing XLA computations and producing new buffers. Also, np.eye and np.tri become lazy (in addition to np.zeros, np.ones, np.full). This commit replaces the ad-hoc "lazy device constant" system, which was used to get the simpler behavior in the first example above. Incidentally fixes #1431 See #1668 for more.

mattjj self-assigned this Jan 7, 2020

mattjj added a commit that referenced this issue Jan 8, 2020

lazy sublanguage

8a93e9b

fixes #1431

mattjj mentioned this issue Jan 8, 2020

Lazy sublanguage #1668

Merged

11 tasks

mattjj closed this as completed in #1668 Jan 8, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

jit does not respect backend type when the function has no arguments #1431

jit does not respect backend type when the function has no arguments #1431

romanngg commented Oct 2, 2019

jekbradbury commented Oct 4, 2019

levskaya commented Oct 5, 2019

jit does not respect backend type when the function has no arguments #1431

jit does not respect backend type when the function has no arguments #1431

Comments

romanngg commented Oct 2, 2019

jekbradbury commented Oct 4, 2019

levskaya commented Oct 5, 2019