deep reinforcement learning for multi-agent coverage path planning Running this code with ubuntu 18.04 python ==3.6 torch==1.1.0 opencv 2.0