timelimit for reinforcement learning tasks #1

YangyangFu · 2023-04-01T23:30:56Z

Currently for dynamic system control tasks, we simply set the terminal condition as the maximum step length inside each episode. This may cause inefficient learning for on-policy algorithms like PPO.

Reference:

Time Limit in Reinforcement Learning
time limit wrapper in gym: https://github.com/openai/gym/blob/master/gym/wrappers/time_limit.py#L19

Good discussions:

stablebaseline: [Bug] Infinite horizon tasks are handled like episodic tasks DLR-RM/stable-baselines3#284

YangyangFu self-assigned this Apr 1, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

timelimit for reinforcement learning tasks #1

timelimit for reinforcement learning tasks #1

YangyangFu commented Apr 1, 2023

timelimit for reinforcement learning tasks #1

timelimit for reinforcement learning tasks #1

Comments

YangyangFu commented Apr 1, 2023