This repository has been archived by the owner on Jun 13, 2024. It is now read-only.

feature request: batched get_optimistic_exploration_action #3

Open

samuelstanton opened this issue Dec 20, 2019 · 1 comment

samuelstanton commented Dec 20, 2019

Would it be straighforward to implement a batched version of get_optimistic_exploration_action?

The text was updated successfully, but these errors were encountered:

Collaborator

quanvuong commented Dec 20, 2019

Hi Samuel,

It is doable (a few hours effort), but not straightforward (< 20 minutes).

This is because:

the function get_optimistic_exploration_action requires computing one backward pass and per-sample gradient (and not sum of gradients over a batch of input).
AFAIK, pytorch doesn't support computing per-sample gradient.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.