Skip to content

Commit

Permalink
Passing policy_state correctly into policy.action() function when com…
Browse files Browse the repository at this point in the history
…puting Q values.

PiperOrigin-RevId: 402601593
Change-Id: Ibb58d385835ccf9a1062be8a7a281a7f4dd92bdc
  • Loading branch information
Yao Lu authored and copybara-github committed Oct 12, 2021
1 parent e769fec commit 2dc591b
Showing 1 changed file with 2 additions and 1 deletion.
3 changes: 2 additions & 1 deletion tf_agents/agents/qtopt/qtopt_agent.py
Original file line number Diff line number Diff line change
Expand Up @@ -684,7 +684,8 @@ def _compute_next_q_values(self, next_time_steps, info, network_state=()):
if not self._in_graph_bellman_update:
return info['target_q']

next_action_policy_step = self._policy.action(next_time_steps)
next_action_policy_step = self._policy.action(
next_time_steps, network_state)

if self._enable_td3:
q_values_target_delayed, _ = self._target_q_network_delayed(
Expand Down

0 comments on commit 2dc591b

Please sign in to comment.