train baseline - memory consumption increases with iterations #157

DaoudPiracha · 2019-07-22T17:25:25Z

Upon running the train_baseline command in python3, the workers seems to consume RAM memory with each iteration without properly freeing it up.

On some runs, I have seen a memory consumption increase of 40Mb/iterations, which when scaled to e.g 10000 iterations, becomes 400 GB memory. Since this is in RAM, it makes longer experiments impossible to run.

Additionally, please note that this seems to be different from object store in ray as upon termination, object store size was considerably small ~ 20 MB. However, each worker.__PolicyEvaluator() had ~2GB storage allocated with multiple such workers present.

eugenevinitsky · 2019-07-22T17:30:09Z

Oh, that's bad! Are you actually seeing this fail as a result? Normally I see a RAM increase but then eventually RLlib somehow clears it up. However, this seems like more of a RLlib issue than an issue with this library (I suspect). Would you mind reposting this in their github issues?

DaoudPiracha · 2019-07-22T18:04:56Z

Yes. Unfortunately, it typically fails on most longer runs. I'll repost on the RLLib Github as well.

I'm currently getting this issue after simply cloning the current repo and running train_baseline.

eugenevinitsky · 2019-07-22T18:05:46Z

Hi, that's really good to know! Thank you for updating us on this. I'll examine it as well when I get a chance, but I'm suspicious it's an rllib issue rather than something on our end. I don't think there's any memory that's persisted across environment rollouts.

DaoudPiracha · 2019-07-22T18:42:40Z

Sounds good. For now, could you possibly share your current environment/setup, where you have RLLib clearing up storage automatically. Possibly as a docker container?

liuxgff mentioned this issue Jul 22, 2019

Error running visual lizer_rllib.py #156

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

train baseline - memory consumption increases with iterations #157

train baseline - memory consumption increases with iterations #157

DaoudPiracha commented Jul 22, 2019

eugenevinitsky commented Jul 22, 2019

DaoudPiracha commented Jul 22, 2019 •

edited

Loading

eugenevinitsky commented Jul 22, 2019

DaoudPiracha commented Jul 22, 2019 •

edited

Loading

train baseline - memory consumption increases with iterations #157

train baseline - memory consumption increases with iterations #157

Comments

DaoudPiracha commented Jul 22, 2019

eugenevinitsky commented Jul 22, 2019

DaoudPiracha commented Jul 22, 2019 • edited Loading

eugenevinitsky commented Jul 22, 2019

DaoudPiracha commented Jul 22, 2019 • edited Loading

DaoudPiracha commented Jul 22, 2019 •

edited

Loading

DaoudPiracha commented Jul 22, 2019 •

edited

Loading