Skip to content

Commit

Permalink
Fix: offpolicy test with gradient_step at the beginning
Browse files Browse the repository at this point in the history
  • Loading branch information
StephenArk30 committed Apr 29, 2021
1 parent 740a021 commit ba20279
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion tianshou/trainer/offpolicy.py
Original file line number Diff line number Diff line change
Expand Up @@ -96,7 +96,7 @@ def offpolicy_trainer(
test_collector.reset_stat()
test_in_train = test_in_train and train_collector.policy == policy
test_result = test_episode(policy, test_collector, test_fn, start_epoch,
episode_per_test, logger, gradient_step, reward_metric)
episode_per_test, logger, env_step, reward_metric)
best_epoch = start_epoch
best_reward, best_reward_std = test_result["rew"], test_result["rew_std"]

Expand Down

0 comments on commit ba20279

Please sign in to comment.