problem on the function of new_cal_re() in fullpace_env.py #24

WAYKEN-TSE · 2023-03-16T03:55:59Z

i know that the this function is used to calculate the extrinsic reward, but when doing PPO to update the network, the advantage function only include the intrinsic reward(advantages = rollouts.returns[:-1] - rollouts.value_preds[:-1]),then how can the extrinsic reward influence the policy network and what does this function do

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

problem on the function of new_cal_re() in fullpace_env.py #24

problem on the function of new_cal_re() in fullpace_env.py #24

WAYKEN-TSE commented Mar 16, 2023

problem on the function of new_cal_re() in fullpace_env.py #24

problem on the function of new_cal_re() in fullpace_env.py #24

Comments

WAYKEN-TSE commented Mar 16, 2023