Don't force `.cpu()` on all PyTorch outputs #1052

ricardoV94 · 2024-10-29T17:01:59Z

This whole thing (i.e., calling out.cpu()) is suboptimal. I think we don't need it for JAX (which returns JAX arrays/ not numpy arrays), because np.asarray works with it, and I guess it doesn't work for torch tensors.

pytensor/pytensor/link/pytorch/linker.py

Line 16 in 7b13a95

return out.cpu()

This should only be needed for updated shared variables where we have to convert to a common type as they could be used in multiple functions with distinct backends.

Perhaps we should expand a bit on the TorchLinker to perform the updates itself, and only force conversion when that's the case. This is already supported by Function.

pytensor/pytensor/compile/function/types.py

Lines 1009 to 1017 in 7b13a95

    
           if getattr(self.vm, "need_update_inputs", True): 
        
               # Update the inputs that have an update function 
        
               for input, storage in reversed( 
        
                   list(zip(self.maker.expanded_inputs, input_storage)) 
        
               ): 
        
                   if input.update is not None: 
        
                       storage.data = outputs.pop() 
        
           else: 
        
               outputs = outputs[: self.n_returned_outputs]

Originally posted by @ricardoV94 in #1032 (comment)

The text was updated successfully, but these errors were encountered:

Ch0ronomato · 2024-10-29T17:12:20Z

I would really like to work on this if possible, it burned me a few times

ricardoV94 · 2024-10-29T17:32:12Z

I would really like to work on this if possible, it burned me a few times

Of course :)

Ch0ronomato · 2024-11-04T18:25:10Z

@ricardoV94 ; i'm wondering if this has overlap with the issue I found when messing around with pymc + py[torch|tensor]: #1065. I guess what I'm wondering is should the linker be smart enough to know when to do

result.detach().numpy()

Then the issue with pymc should un theory be solved

ricardoV94 · 2024-11-05T10:39:46Z

The problem is that fails when the data is on the gpu. Is there a cheap way to know when it is and whet it's not? Just wrap it in a try/except?

Ch0ronomato · 2024-11-05T16:03:59Z

Yea, x.device gives you the current location of the tensor. As long as we check cpu Its fairly straightforward (gpu device names vary)

ricardoV94 · 2024-11-05T20:56:39Z

Yea, x.device gives you the current location of the tensor. As long as we check cpu Its fairly straightforward (gpu device names vary)

Wanna try that? It's still suboptimal to always force transfer but probably fine for a rough use of the backend. We may allow user control with custom linker settings in the future

Ch0ronomato · 2024-11-05T21:36:30Z

We would combine this with the suggestion you had earlier as well?

Perhaps we should expand a bit on the TorchLinker to perform the updates itself, and only force conversion when that's the case. This is already supported by Function.

ricardoV94 · 2024-11-05T22:03:45Z

We would combine this with the suggestion you had earlier as well?

Perhaps we should expand a bit on the TorchLinker to perform the updates itself, and only force conversion when that's the case. This is already supported by Function.

Let's skip that idea of the updates for now and force everything to be numpy once it's out. Otherwise you'll have the same sort of problems you saw in your PyMC tests

ricardoV94 changed the title ~~Don't force .cpu() on all PyTorch outputs~~ Don't force .cpu() on all PyTorch outputs Oct 29, 2024

ricardoV94 added torch PyTorch backend enhancement New feature or request backend compatibility labels Oct 29, 2024

ricardoV94 mentioned this issue Oct 29, 2024

Allow for more elemwise torch functions using broadcast_tensor and vmap #1032

Open

11 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Don't force `.cpu()` on all PyTorch outputs #1052

Don't force `.cpu()` on all PyTorch outputs #1052

ricardoV94 commented Oct 29, 2024 •

edited

Loading

Ch0ronomato commented Oct 29, 2024

ricardoV94 commented Oct 29, 2024

Ch0ronomato commented Nov 4, 2024 •

edited

Loading

ricardoV94 commented Nov 5, 2024

Ch0ronomato commented Nov 5, 2024

ricardoV94 commented Nov 5, 2024

Ch0ronomato commented Nov 5, 2024

ricardoV94 commented Nov 5, 2024

Don't force .cpu() on all PyTorch outputs #1052

Don't force .cpu() on all PyTorch outputs #1052

Comments

ricardoV94 commented Oct 29, 2024 • edited Loading

Ch0ronomato commented Oct 29, 2024

ricardoV94 commented Oct 29, 2024

Ch0ronomato commented Nov 4, 2024 • edited Loading

ricardoV94 commented Nov 5, 2024

Ch0ronomato commented Nov 5, 2024

ricardoV94 commented Nov 5, 2024

Ch0ronomato commented Nov 5, 2024

ricardoV94 commented Nov 5, 2024

Don't force `.cpu()` on all PyTorch outputs #1052

Don't force `.cpu()` on all PyTorch outputs #1052

ricardoV94 commented Oct 29, 2024 •

edited

Loading

Ch0ronomato commented Nov 4, 2024 •

edited

Loading