Make the default device for policies "cpu" #828

ernestum · 2023-12-11T14:04:53Z

When there is a GPU available, using "auto" as the default device does not seem to be desirable (see #825).

This PR sets the default device to "cpu" which hopefully just works in most cases.

Fixes #825

codecov · 2023-12-11T14:17:48Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Comparison is base (629ef9a) 95.64% compared to head (39647d7) 95.66%.

Additional details and impacted files

@@            Coverage Diff             @@
##           master     #828      +/-   ##
==========================================
+ Coverage   95.64%   95.66%   +0.01%     
==========================================
  Files         102      102              
  Lines        9655     9655              
==========================================
+ Hits         9235     9236       +1     
+ Misses        420      419       -1

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

tomtseng · 2023-12-11T21:24:11Z

Hmm so the error is this backtrace when running examples/quickstart.py:

  File "examples/quickstart.py", line 83, in <module>
    bc_trainer.train(n_epochs=1)
  File "/home/ttseng/imitation/src/imitation/algorithms/bc.py", line 495, in train
    training_metrics = self.loss_calculator(self.policy, obs_tensor, acts)
  File "/home/ttseng/imitation/src/imitation/algorithms/bc.py", line 130, in __call__
    (_, log_prob, entropy) = policy.evaluate_actions(

On bc.py line 130, tensor_obs is on device auto which assigns it to a GPU, but acts is on the CPU. Could the solution be to move acts onto the same device as tensor_obs? The proposed fix in this PR is to make the device argument default to cpu, but that means that this code still breaks if the user sets the argument to something else

tomtseng · 2023-12-11T21:28:19Z

On line 494 of bc.py, right before the failing line 495, we have acts = util.safe_to_tensor(batch["acts"], device=self.policy.device). Despite the device argument seemingly suggesting that acts should be moved to the same device as the policy & observation, but looking at the implementation of util.safe_to_tensor, acts doesn't actually get moved if it's already a tensor. My instinct would be to either change util.safe_to_tensor's behavior, and/or to add a line acts = acts.to(self.policy.device)

ernestum · 2023-12-15T10:57:05Z

Good point and thanks for the suggestions! I will start a new PR with your proposal.

ernestum · 2023-12-15T12:56:28Z

Closing this in favor of #831

Make the default device for policies "cpu".

39647d7

ernestum requested a review from tomtseng December 11, 2023 14:05

ernestum mentioned this pull request Dec 15, 2023

Ensure safe_to_tensor moves tensors to the specified device. #831

Open

ernestum closed this Dec 15, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make the default device for policies "cpu" #828

Make the default device for policies "cpu" #828

ernestum commented Dec 11, 2023 •

edited

Loading

codecov bot commented Dec 11, 2023

tomtseng commented Dec 11, 2023

tomtseng commented Dec 11, 2023

ernestum commented Dec 15, 2023

ernestum commented Dec 15, 2023

Make the default device for policies "cpu" #828

Make the default device for policies "cpu" #828

Conversation

ernestum commented Dec 11, 2023 • edited Loading

codecov bot commented Dec 11, 2023

Codecov Report

tomtseng commented Dec 11, 2023

tomtseng commented Dec 11, 2023

ernestum commented Dec 15, 2023

ernestum commented Dec 15, 2023

ernestum commented Dec 11, 2023 •

edited

Loading